AI Research Scientist

ABOUT CLIENT

Our client is a leading research company specializing in technology innovation

JOB DESCRIPTION

Driven by Intellectual Curiosity: Relentlessly explore the unknown, ask sharp questions, and challenge assumptions.
Constructively Combative: Engage in thoughtful debate to uncover truth, refine ideas, and push boundaries.
Experimental Mindset: Design and run AI experiments, analyze outcomes rigorously, and iterate with purpose.
User-Centered Innovation: Adapt AI systems to solve real-world problems and elevate the experience of Menlo's users.
Stay on the Cutting Edge: Continuously learn, absorb state-of-the-art research, and apply it with discernment.
Rapid Prototyping & Iteration: Ideate, implement, debug, and accelerate the cycle of experimentation and insight.
Robust Engineering: Build modular, maintainable training codebases that scale and evolve gracefully.
Efficient Data Infrastructure: Develop high-performance data loaders and utilities to streamline training workflows.
Scalable Training Systems: Architect jobs to run seamlessly across multi-GPU and multi-node setups (e.g., DDP, NCCL).
Performance Optimization: Tune models for speed, stability, and hardware efficiency without compromising accuracy.
Code Quality & Reproducibility: Write clean, testable, and reproducible code that stands the test of time.
Open Source Stewardship: Actively contribute to and improve the broader ecosystem through upstream collaboration.

JOB REQUIREMENT

Strong proficiency in PyTorch, including custom modules, loss functions, and distributed training
Demonstrated experience in training deep learning models in real-world research or production settings
Proficient engineering skills in Python, and optionally C++ for performance-critical components
Experience in working with large datasets, complex pipelines, and real-world debugging
Understanding of training dynamics and the ability to identify and resolve issues
Familiarity with job launchers, logging tools (e.g., Weights & Biases, TensorBoard), and checkpointing systems
Previous involvement with TorchScript, ONNX, or custom inference runtimes
Contributions to PyTorch or open-source ML tooling
Experience with transformer models, diffusion models, or large-scale vision/NLP tasks
Familiarity with batch schedulers (SLURM), cluster environments, and GPU resource management
Ability to collaborate closely with systems engineers or MLOps teams to ensure smooth integration

WHAT'S ON OFFER

Join a top-notch research team in working on impactful projects
Take ownership of and contribute to the development of the core training code infrastructure
Engage with real models, data, and scale, tackling substantial problems
Support in bridging the gap between research speed and engineering excellence
Enjoy a flexible work setup and a culture that promotes depth, clarity, and curiosity

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Product

Technical Skills:

Data Science

Location:

Others - Singapore

Working Policy:

Remote

Salary:

Negotiation

Job ID:

J01854

Status:

Active

Related Job:

Senior Deep Learning Engineer - AI for Wireless Systems

Ho Chi Minh, Ha Noi - Viet Nam


Computer Hardware

  • Machine Learning

Design and prototype deep learning models for wireless signal processing tasks such as channel estimation, beam alignment, link adaptation, and scheduling. Work with simulation tools and real-world datasets to build models that generalize across diverse wireless scenarios. Implement, train, and validate neural networks (e.g., CNNs, Transformers, GNNs) using PyTorch or TensorFlow. Collaborate with researchers and system engineers to integrate models into fullstack RAN. Optimize model performance for real-time inference and hardware acceleration. Contribute to model evaluation, benchmarking, and deployment readiness on GPU platforms.

Negotiation

View details

Engineering Manager - AI for RAN and 6G Wireless Systems

Ho Chi Minh, Ha Noi - Viet Nam


Computer Hardware

  • Machine Learning
  • Management

Lead and grow a high-impact engineering team focused on AI-enabled signal processing for the Radio Access Network (RAN). Guide the development of deep learning models for tasks such as channel estimation, beamforming, link adaptation, and CSI compression. Collaborate with global teams across architecture, research, and systems to drive proof-of-concepts and production-quality AI-RAN components. Oversee integration of AI models into full-stack simulations and/or testbeds using frameworks such as PyTorch, TensorFlow, and Sionna. Align project priorities with hardware-software co-design constraints and deployment scenarios on Our Client's platforms. Mentor team members, ensure technical excellence, and contribute to strategic direction.

Negotiation

View details

Director Engineering – Software Engineering and AI Inferencing Platforms

Ho Chi Minh, Ha Noi - Viet Nam


Computer Hardware

  • Management
  • Backend
  • Cloud
  • Data Engineering
  • AI

Build, lead and scale world-class engineering teams in Vietnam, collaborating with global counterparts across system software, data science, and AI platforms. Drive the design, architecture, and delivery of high-performance system software platforms that power Our Client's AI products and services. Partner with global teams across Machine Learning, Inference Services, and Hardware/Software integration to ensure performance, reliability, and scalability. Oversee the development and optimization of AI delivery platforms in Vietnam, including NIMs, Blueprints, and other flagship Our Client's services. Engage with open-source and enterprise data and workflow ecosystems (e.g., Temporal, Gitlab DevOps Platform, RAPIDS, NeMo Curator, Morpheus) to advance accelerated AI factory, data science and data engineering workloads. Champion continuous integration, continuous delivery, and engineering best practices across multi-site R&D Centers. Collaborate with product management and cross-functional stakeholders to ensure enterprise readiness and customer impact. Develop and deploy standard processes for large-scale, distributed system testing, encompassing stress, scale, failover, and resiliency testing. Ensure security and compliance testing aligns with industry standards for cloud and data center products. Mentor and develop talent within the organization, fostering a culture of quality and continuous improvement.

Negotiation

View details