Distributed Systems Engineer

ABOUT CLIENT

Our client is a leading research company specializing in technology innovation

JOB DESCRIPTION

Design and create distributed systems capable of handling large amounts of sensory, telemetry, and control data across cloud and edge environments.
Plan and implement data ingestion and streaming pipelines to connect groups of robots to the cloud in real-time (video, LiDAR, joint states, audio).
Construct platforms for extensive training and inference to support robot autonomy and teleoperation using foundation models.
Work closely with ML and Robotics engineers to assist in hardware-in-the-loop simulation, policy rollout, and continuous learning initiatives.
Create internal observability systems to monitor fleet performance, reliability, and tuning.
Take the lead on infrastructure decisions such as distributed storage, consensus protocols, GPU orchestration, and network reliability.

JOB REQUIREMENT

Must have more than 7 years of professional experience in software engineering, specializing in distributed systems, networking, or data infrastructure.
Demonstrated capability in constructing and maintaining distributed systems that can handle large-scale workloads.
Proficient in Go, Rust, C++, or Python, with a strong foundation in concurrency, networking, and systems performance.
Familiarity with cloud-native architectures such as Kubernetes, gRPC, Kafka, S3, Ray, or similar frameworks.
Thorough understanding of data consistency, replication, and fault tolerance in heterogeneous environments.
Experience in GPU-based workloads, model training, or edge compute orchestration is desirable.
Strong analytical skills and a preference for developing fast, measurable, and dependable systems.
Experience in creating distributed training or large-scale simulation systems.
Knowledge of real-time robotics workloads, including streaming from physical sensors and actuators.
Previous involvement with telemetry, observability, or fleet-scale systems in production.
Contributions to open-source infrastructure, AI frameworks, or robotics middleware (ROS, gRPC, Mediasoup, etc.) would be advantageous.

WHAT'S ON OFFER

Join an exceptional research team to work on significant and impactful projects
Take charge of and influence the primary training code infrastructure utilized by the team
Engage with actual models, real data, and substantial scale challenges, not small-scale problems
Contribute to bridging the gap between research speed and engineering excellence
Enjoy a flexible work setting with a culture that treasures depth, transparency, and inquisitiveness

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Product

Technical Skills:

Data Engineering, Devops, Golang, Rust, C/C++, Python

Location:

Others - Viet Nam

Working Policy:

Onsite, Remote

Job ID:

J01893

Status:

Close

Related Job:

AI Software Transformation Engineer (Distributed Computing)

Ho Chi Minh - Viet Nam


Product

  • Data Engineering
  • Backend
  • Spark
  • AI

Create an advanced AI-powered software transformation framework to speed up the modernization of complex analytical applications. Develop architectural patterns and transformation methodologies for converting outdated computational tools into scalable cloud-native solutions. Utilize AI agents, LLMs, and emerging AI engineering techniques to automate software analysis, code transformation, validation, and optimization. Work with distributed computing specialists to design target architectures that leverage Spark-based execution models for large-scale data processing. Lead technical investigations into restructuring, decomposing, or re-implementing existing software systems for efficient operation in distributed environments. Develop reusable transformation pipelines, automation tooling, and engineering frameworks for large-scale software modernization. Establish validation strategies and quality frameworks to ensure that transformed systems maintain functional correctness and reproducibility. Make architectural decisions regarding scalability, maintainability, performance, and long-term platform evolution. Collaborate with domain experts to understand application requirements and translate them into scalable technical solutions. Prototype and assess new AI-assisted engineering approaches to enhance transformation speed, engineering productivity, and software quality. Contribute to the organization's long-term strategy for AI-driven software modernization and engineering automation.

Negotiation

View details

Senior Quality Engineer (Automation, Backend)

Ho Chi Minh - Viet Nam


Product

  • Automation Test

Lead test automation strategy and framework design for backend and cloud-based services. Drive end-to-end test automation initiatives using Cypress to ensure seamless user experiences. Perform thorough manual testing for complex workflows requiring deep attention to UX and usability details. Implement continuous integration and deployment test practices such as GitHub Actions and Jenkins. Collaborate with developers and DevOps to enhance test reliability and coverage. Review code and advocate for QA best practices across teams. Identify quality risks early and actively seek solutions. Ensure release compliance through test result reporting.

Negotiation

View details

Senior Quality Engineer (Automation, Full Stack)

Ho Chi Minh - Viet Nam


Product

  • Automation Test

Develop a test automation strategy and framework for backend and cloud-based services. Implement E2E test automation initiatives, using Cypress to ensure smooth user experiences. Perform thorough manual testing for complex workflows focusing on UX and usability details. Write and manage frontend component and unit tests using Jest and React Testing Library. Create and execute API-level test suites, covering REST endpoints and validating request/response contracts and error handling. Verify data integrity from UI interactions through the API layer down to database state. Implement continuous integration and deployment test practices (e.g., GitHub Actions, Jenkins). Collaborate with developers and DevOps to enhance test reliability and coverage. Review code and advocate for QA best practices. Anticipate quality risks and drive proactive solutions. Ensure compliance with releases through test result reporting.

Negotiation

View details