Lead Data Engineer

ABOUT CLIENT

Our client is a global technology company that specializes in providing innovative IT solutions for the financial services industry

JOB DESCRIPTION

Design, create, and maintain scalable data infrastructure, which includes data lakes, pipelines, and metadata repositories, to ensure accurate and timely delivery of data to stakeholders.
Collaborate with data scientists to develop and maintain data models, integrate data sources, and facilitate machine learning workflows and experimentation environments.
Build and enhance large-scale, batch, and real-time data processing systems to improve operational efficiency and align with business goals.
Use Python, Apache Airflow, and AWS services to automate data workflows and processes, ensuring efficient scheduling and monitoring.
Utilize AWS services like S3, Glue, EC2, and Lambda to manage data storage and compute resources, striving for high performance, scalability, and cost-effectiveness.
Implement comprehensive testing and validation methods to guarantee the reliability, accuracy, and security of data processing workflows.
Keep updated on the latest industry best practices and emerging technologies in data engineering and data science to suggest innovative solutions and enhancements.

JOB REQUIREMENT

Core Expertise: Proficiency in Python for data processing and scripting (pandas, pyspark), workflow automation (Apache Airflow), and experience with cloud computing services (Glue, S3, EC2, Lambda).
Containerization & Orchestration: Experience working with Kubernetes and Docker for managing containerized environments in the cloud.
Data Engineering Tools: Hands-on experience with columnar and big data databases (Athena, Redshift, Vertica, Hive/Hadoop), along with version control systems like Git.
Cloud Services: Strong familiarity with cloud services for data processing and management.
CI/CD Pipeline: Experience with CI/CD tools for continuous integration and deployment.
Data Engineering Focus (75%): Expertise in building and managing robust data architectures and pipelines for large-scale data operations.
Data Science Support (25%): Ability to support data science workflows, including collaboration on data preparation, feature engineering, and enabling experimentation environments.
Langchain Experience: Familiarity with Langchain for building data applications involving natural language processing or conversational AI frameworks.
Advanced Data Science Tools: Experience with machine learning environments.
Big Data & Analytics: Familiarity with both RDBMS and NoSQL databases.
BI Tools: Experience with enterprise BI tools.
Messaging & Event Streaming: Familiarity with distributed messaging systems for event streaming.
Monitoring & Logging: Experience with monitoring and log management tools.
Data Privacy and Security: Knowledge of best practices for ensuring data privacy and security, particularly in large data infrastructures.

WHAT'S ON OFFER

Offers competitive salary
Salary bands are reviewed annually
Provides 13th-month salary pro rata based on employee's length of service
Monthly lunch allowance of 700,000 VND per employee
Covers monthly parking fee for employee motorbikes
Conducts performance evaluation once a year for performance bonuses and salary increments
Provides private health insurance including accident, outpatient, in-patient, maternity, and dental for permanent employees after 2-month probation
Offers expense claim for eyewear and annual health check-ups
Provides a maximum of 18-day vacation leave per year, with the option to carry over 5 days until the following year
Grants an additional annual leave day for every two-year anniversary
Allocates annual fund for fitness activities based on team's vote
Offers a range of healthy snacks, tea, coffee, milk, and beer on tap
Conducts company townhall meetings every 6 weeks
Engages in CSR activities as per company's CSR guideline
Offers onsite tour/training courses at other offices and client's destination overseas where applicable.

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Outsource

Technical Skills:

Data Engineering, Management

Location:

Ho Chi Minh, Ha Noi - Viet Nam

Working Policy:

Hybrid

Salary:

Negotiation

Job ID:

J02046

Status:

Active

Related Job:

DevOps Engineer

Others - Viet Nam


Product

  • Devops
  • Kubernetes
  • Network

Operate and evolve our Kubernetes platform across multiple clusters and environments (Prod, Dev, hybrid on-prem and public cloud), covering control plane operations, node lifecycle, upgrades, and autoscaling at every layer (Cluster Autoscaler, HPA, KEDA). Architect and manage hybrid cloud infrastructure spanning on-premises and public clouds (GCP, AWS), including workload placement, cross-cloud networking, and unified resource management. Own the CI/CD and GitOps experience end-to-end: container build pipelines, image optimization, and progressive delivery via ArgoCD / FluxCD. Own the observability stack as a single pane of glass across all clusters: Grafana, Mimir, Tempo, Loki, Pyroscope, OnCall, Prometheus -- and help push toward agent-assisted SRE workflows. Manage and improve our inference platform: vLLM serving and AIBrix for multi-model orchestration and autoscaling across a fleet of NVIDIA GPUs. Operate platform services: Kafka, Redis, PostgreSQL, OpenSearch. Manage identity and access via Keycloak integrated with Google Workspace; harden SSO, RBAC, and secrets management across the platform. Harden network security across private load balancers, firewalls, and VPC segmentation; design and maintain hub-and-spoke / multi-AZ topologies. Support training infrastructure: self-service VM provisioning, RunPod burst capacity, Weights and Biases integration. Drive infrastructure reliability, cost efficiency, and capacity planning as the platform scales.

Negotiation

View details

Platform Engineer

Ho Chi Minh - Viet Nam


Product

  • Backend
  • Devops
  • Data Engineering

Build and maintain distributed infrastructure handling telemetry, sensory, and control data across cloud and edge environments Design and operate data ingestion and streaming pipelines connecting robot fleets to the cloud in real time, covering video, joint states, audio, and LiDAR Develop and maintain backend services and APIs that power the Company's developer-facing platform, with a focus on reliability and developer experience Manage and evolve cloud native infrastructure using Kubernetes, Docker, and infrastructure as code tooling Ensure platform reliability through monitoring, alerting, autoscaling, failover, and incident response Support ML and robotics teams with data infrastructure for training pipelines, policy rollout, and hardware-in-the-loop simulation Implement secure APIs with access control, rate limiting, and usage metering as we scale

Negotiation

View details

Software Engineer (Digital Twin)

Ho Chi Minh - Viet Nam


Product

  • Python
  • C/C++

Build and maintain high-fidelity digital twin environments for Asimov across MuJoCo, Isaac Sim, and Unreal Engine, calibrated to real hardware behavior. Design and own the systems -- not just the environments -- that let locomotion, autonomy, and perception teams generate, validate, and iterate on simulation scenarios at scale. Build pipelines for asset import, USD and MJCF workflows, sensor modeling, and real-to-sim calibration to keep digital twins synchronized with evolving hardware. Develop photorealistic rendering pipelines in Unreal Engine for synthetic data generation and perception model training. Work with hardware and mechatronics teams to model actuator dynamics, contact physics, and structural behavior, ensuring simulation parameters reflect physical ground truth. Integrate digital twin environments with the Company's locomotion training pipeline (Cyclotron) and autonomy stack, enabling teams to run experiments and close the sim-to-real gap. Contribute to the open-source Asimov simulation stack, including tooling, documentation, and reproducible environment workflows.

Negotiation

View details