AI Research Engineer

ABOUT CLIENT

Our client is a leading research company specializing in technology innovation

JOB DESCRIPTION

Focus on improving model usability for users
Conduct training runs and AI experiments
Analyze results and make necessary changes
Collaborate with the product engineering team to implement improvements
Implement and improve upon recent RL techniques like GRPO, DPO, RePO, etc.
Create and manage adaptable, expandable training codebases
Establish and maintain efficient data pipelines, including both synthetic and real data
Ensure training jobs are capable of scaling across multiple GPUs and nodes, such as FSDP, DDP, NCCL
Maintain code health over the long term by writing clean, testable, and reproducible code
Contribute to the enhancement of open source dependencies
(Optional) Publish papers and present research findings

JOB REQUIREMENT

Comprehensive skills in Python and coding using frameworks like PyTorch, or similar
Demonstrated proficiency in training deep learning and reinforcement learning models in practical scenarios
Experience in handling and analyzing large datasets and intricate workflows
Adept understanding of training dynamics, identifying issues and troubleshooting them
Proficiency in job launchers, logging tools (such as Weights & Biases, TensorBoard), and checkpointing systems
An approach of applying engineering precision to research through writing clear code, meticulous design, and reproducible results
Knowledge of TorchScript, ONNX, or custom inference runtimes
Contribution to open-source projects related to PyTorch or machine learning tools
Background in working with transformer models, diffusion models, VLMs, or extensive vision/NLP tasks
Familiarity with batch schedulers (SLURM), cluster environments, and GPU resource management
Ability to collaborate closely with systems engineers or MLOps teams for seamless integration

WHAT'S ON OFFER

Join an exceptional research team to work on significant and impactful projects
Take charge of and influence the primary training code infrastructure utilized by the team
Engage with actual models, real data, and substantial scale challenges, not small-scale problems
Contribute to bridging the gap between research speed and engineering excellence
Enjoy a flexible work setting with a culture that treasures depth, transparency, and inquisitiveness

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Product

Technical Skills:

Machine Learning, Python

Location:

Ho Chi Minh - Viet Nam

Working Policy:

Onsite

Salary:

Negotiation

Job ID:

J01951

Status:

Active

Related Job:

Engagement Engineer

Ho Chi Minh - Viet Nam


Product, AI Application Platform

  • Project Management
  • Account Management
  • AI

Drive customer success by strategically managing relationships, acting as a key advocate, and aligning our product capabilities with customer goals while proactively addressing project risks. Stay updated on AI innovation by participating in the design of AI systems for industrial applications and keeping up with the latest trends in AI, machine learning, and data analytics. Foster customer collaboration to clearly define problem statements, develop impactful solutions, translate needs into actionable tasks, and oversee project lifecycles, including roadmap planning, execution, and quality assurance. Lead and collaborate with cross-functional technical teams to efficiently develop and deliver solutions within specified timelines. Provide customers with tailored technical solutions and training to enhance the impact and value of the solutions offered.

Negotiation

View details

API Integration Engineer (Java/Python/Golang)

Ho Chi Minh - Viet Nam


Offshore

  • Java
  • Python

Main responsibilities include integrating using Java/Python/Golang, REST, SOAP APIs, and Identity service. Addressing complex technical and business challenges and staying updated on new technology and frameworks. Collaborating with a team to take accountability for the features you manage. Handling the complete product life cycle—from design and development to testing, deployment, monitoring, and enhancement.

Negotiation

View details

Platform Reliability Engineer

Ho Chi Minh - Viet Nam


Outsource

Maintain production reliability of the Linux-based research and trading platform within a globally distributed engineering team. Respond quickly to production infrastructure issues. Comprehend internal client needs and effectively communicate them to regional and global leadership. Identify risks, develop contingency plans, and implement solutions to mitigate them. Enhance the observability platform to monitor the performance and health of critical computing environments. Take part in occasional on-call rotations and support on-call staff during their shifts. Contribute to organizational knowledge through documentation, education, and writing maintainable code.

Negotiation

View details