AI Research Engineer

ABOUT CLIENT

Our client is a leading research company specializing in technology innovation

JOB DESCRIPTION

Focus on improving model usability for users
Conduct training runs and AI experiments
Analyze results and make necessary changes
Collaborate with the product engineering team to implement improvements
Implement and improve upon recent RL techniques like GRPO, DPO, RePO, etc.
Create and manage adaptable, expandable training codebases
Establish and maintain efficient data pipelines, including both synthetic and real data
Ensure training jobs are capable of scaling across multiple GPUs and nodes, such as FSDP, DDP, NCCL
Maintain code health over the long term by writing clean, testable, and reproducible code
Contribute to the enhancement of open source dependencies
(Optional) Publish papers and present research findings

JOB REQUIREMENT

Comprehensive skills in Python and coding using frameworks like PyTorch, or similar
Demonstrated proficiency in training deep learning and reinforcement learning models in practical scenarios
Experience in handling and analyzing large datasets and intricate workflows
Adept understanding of training dynamics, identifying issues and troubleshooting them
Proficiency in job launchers, logging tools (such as Weights & Biases, TensorBoard), and checkpointing systems
An approach of applying engineering precision to research through writing clear code, meticulous design, and reproducible results
Knowledge of TorchScript, ONNX, or custom inference runtimes
Contribution to open-source projects related to PyTorch or machine learning tools
Background in working with transformer models, diffusion models, VLMs, or extensive vision/NLP tasks
Familiarity with batch schedulers (SLURM), cluster environments, and GPU resource management
Ability to collaborate closely with systems engineers or MLOps teams for seamless integration

WHAT'S ON OFFER

Join an exceptional research team to work on significant and impactful projects
Take charge of and influence the primary training code infrastructure utilized by the team
Engage with actual models, real data, and substantial scale challenges, not small-scale problems
Contribute to bridging the gap between research speed and engineering excellence
Enjoy a flexible work setting with a culture that treasures depth, transparency, and inquisitiveness

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Product

Technical Skills:

Machine Learning, Python, AI

Location:

Ho Chi Minh - Viet Nam

Working Policy:

Onsite

Salary:

Negotiation

Job ID:

J01951

Status:

Close

Related Job:

Software Engineer

Ho Chi Minh - Viet Nam


Outsource

  • Azure
  • .NET

Creating API-based and event-driven integration solutions Developing integration solutions following Azure best practices and cloud-native patterns Constructing integrations using Azure Integration Services like Logic Apps, Functions, API Management, Service Bus, and Event Hubs Installing and managing SAP integrations, such as SAP S/4HANA, SAP PI/PO, or SAP BTP Integration Suite Building and maintaining integrations using C# and the .NET ecosystem Utilizing Infrastructure as Code practices with tools like Terraform Ensuring secure authentication, authorization, and API security utilizing OAuth and best practices Working with architects, developers, and clients to devise end-to-end integration solutions Assisting in deployments, monitoring, and continuous improvement of integration platforms, ensuring reliability and observability in production environments

Negotiation

View details

Senior .NET Engineer

Ho Chi Minh - Viet Nam


Product

  • .NET

Take charge of complex workflows: Collaborate with stakeholders to implement and integrate end-to-end processes, from claim intake to booking, stay, and payment platform. Develop scalable, distributed systems: Build resilient backend services using .NET, with a focus on microservices and ensuring high system reliability. Work on integration-heavy systems: Connect with external insurance and accommodation providers as well as internal systems using APIs and messaging patterns. Ensure system quality and reliability: Write unit and integration tests, troubleshoot production issues, and maintain high standards for performance and stability. Contribute to ongoing improvement: Refine and optimize existing systems, enhance architecture, and embrace best practices in software design. Collaborate in a cross-functional environment: Partner with Dev, PM, and QA engineers to deliver high-quality solutions. Drive technical documentation: Maintain clear and structured documentation to support system evolution and onboarding.

Negotiation

View details

Locomotion Research Engineer

Others - Singapore


Product

Create and train RL locomotion policies for various movement types Establish and maintain simulation environments using custom actuator models to replicate hardware characteristics Implement domain randomization strategy to address simulation-to-reality discrepancies Validate and fine-tune locomotion controllers in simulation and physical platforms Utilize Data Engine telemetry data to refine simulation parameters Collaborate with different teams on issues related to locomotion performance Contribute to open-source releases of locomotion models, training code, and simulation assets

Negotiation

View details