AI Research Engineer

ABOUT CLIENT

Our client is a leading research company specializing in technology innovation

JOB DESCRIPTION

Focus on improving model usability for users
Conduct training runs and AI experiments
Analyze results and make necessary changes
Collaborate with the product engineering team to implement improvements
Implement and improve upon recent RL techniques like GRPO, DPO, RePO, etc.
Create and manage adaptable, expandable training codebases
Establish and maintain efficient data pipelines, including both synthetic and real data
Ensure training jobs are capable of scaling across multiple GPUs and nodes, such as FSDP, DDP, NCCL
Maintain code health over the long term by writing clean, testable, and reproducible code
Contribute to the enhancement of open source dependencies
(Optional) Publish papers and present research findings

JOB REQUIREMENT

Comprehensive skills in Python and coding using frameworks like PyTorch, or similar
Demonstrated proficiency in training deep learning and reinforcement learning models in practical scenarios
Experience in handling and analyzing large datasets and intricate workflows
Adept understanding of training dynamics, identifying issues and troubleshooting them
Proficiency in job launchers, logging tools (such as Weights & Biases, TensorBoard), and checkpointing systems
An approach of applying engineering precision to research through writing clear code, meticulous design, and reproducible results
Knowledge of TorchScript, ONNX, or custom inference runtimes
Contribution to open-source projects related to PyTorch or machine learning tools
Background in working with transformer models, diffusion models, VLMs, or extensive vision/NLP tasks
Familiarity with batch schedulers (SLURM), cluster environments, and GPU resource management
Ability to collaborate closely with systems engineers or MLOps teams for seamless integration

WHAT'S ON OFFER

Join an exceptional research team to work on significant and impactful projects
Take charge of and influence the primary training code infrastructure utilized by the team
Engage with actual models, real data, and substantial scale challenges, not small-scale problems
Contribute to bridging the gap between research speed and engineering excellence
Enjoy a flexible work setting with a culture that treasures depth, transparency, and inquisitiveness

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Product

Technical Skills:

Machine Learning, Python, AI

Location:

Ho Chi Minh - Viet Nam

Working Policy:

Onsite

Salary:

Negotiation

Job ID:

J01951

Status:

Active

Related Job:

Senior Backend Engineer - NAVER Financial

Ho Chi Minh - Viet Nam


Product

Responsible for developing and maintaining the server-side components using the Kotlin programming language. Write and maintain technical documentation, including system architecture and API specifications.

Negotiation

View details

Technical Leader

Ho Chi Minh - Viet Nam


Product

  • Python

We are looking for a product-minded Technical Lead who possesses a strong engineering foundation and the leadership capability to drive our backend and AI initiatives. You will not only architect scalable Python systems but also align technical decisions with business goals. In this role, you are expected to own the product lifecycle end-to-end-from design to operation-while building a high-trust, high-performance engineering culture. You must be adaptable, ready to lead the team through technology shifts (specifically in AI integration), and capable of balancing speed, quality, and cost based on the product phase.#What You'll Do Product-Centric Engineering & Strategy Product Mindset: Work closely with Product Managers to understand user pain points and value features. Make technical trade-offs based on the current product phase (e.g., MVP vs. scaling). Ownership: Take end-to-end responsibility for features: Design → Development → Release → Operation. Proactively propose solutions and identify risks before they become issues. Adaptability: Lead the team in adapting to new technology directions, particularly integrating AI/ML workflows into the backend. Be willing to pivot technical approaches when product direction changes. Architecture & Technical Foundation System Design: Architect robust systems with a clear understanding of when to use Monolith vs. Microservices. Design efficient data models, data flows, and versioned APIs. Cloud & Infrastructure: Leverage AWS services effectively. Assess the risks and benefits of integrating external services versus building in-house. Performance & Security: Ensure systems are designed for scalability, high performance, and security while keeping infrastructure costs optimized. Delivery & Execution Execution: Ensure on-time releases with the required quality standards. Manage scope creep and handle cross-team/external dependencies effectively. Risk Management: Provide honest reporting to management. Do not hide risks; instead, communicate them early with mitigation plans. Operational Excellence: Maintain system stability and reliability in production. Leadership & Mentorship Team Building: Build a strong engineering culture and standardize coding practices. protect the team from distractions while ensuring members trust and follow your technical direction. Mentorship: Conduct code reviews to mentor the team on mindset and standards (SOLID, DRY). Delegate tasks effectively-assigning the right people to the right jobs. Communication: Act as a bridge between technical and non-technical stakeholders. Explain technical decisions to the CEO and Product teams using business language (Cost, Risk, Impact). Align expectations on scope and delivery explicitly from the start.

Negotiation

View details

Android Engineer (Java/Kotlin)

Ho Chi Minh - Viet Nam


Product

  • Android

Develop Android App part of various Services Develop new services and improve structures Analyze and apply new technologies to services

Negotiation

View details