Lead Software Architect

JOB DESCRIPTION

We are seeking a highly skilled and experienced Senior Software Architect to join our fast-paced Product Engineering team. The ideal candidate will have a solid background in modern computer system architecture, computer science, algorithms, data structures, and design patterns, and a minimum of 5 years of experience in Python, and designing and building clean, client-oriented APIs.
Responsibilities:
Architect, develop, and implement complex software applications
Collaborate with cross-functional teams to deliver high-quality solutions
Communicate clearly & persuade a strong product-engineering team through diagrams and design documents

JOB REQUIREMENT

Requirements:
Bachelor's or Master's degree in Computer Science, Computer Engineering, or equivalent
Demonstrated successful experience in software system architecture
Strong proficiency in Python (5+ years of experience)
Experience in desinging & building clean, client-oriented APIs (5+ years of experience)
Nice-to-have:
Experience in designing and implementing complex distributed systems
Experience with Kubernetes
Background in Physics/Engineering

WHAT'S ON OFFER

Awesome colleagues
We will match exceptional talent with exceptional compensation (salary and equity) 
You can shape the company culture where the best ideas always win out–regardless of the role, title or seniority; and where engineers are encouraged to help drive strategic decisions
Unlimited vacation policy
Comprehensive health insurance

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Product, AI Application Platform

Technical Skills:

Python

Location:

Ho Chi Minh - Viet Nam

Working Policy:

Salary:

Negotiation

Job ID:

J01204

Status:

Close

Related Job:

Director Engineering – Software Engineering and AI Inferencing Platforms

Ho Chi Minh, Ha Noi - Viet Nam


Computer Hardware

  • Management
  • Backend
  • Cloud
  • Data Engineering
  • AI

Build, lead and scale world-class engineering teams in Vietnam, collaborating with global counterparts across system software, data science, and AI platforms. Drive the design, architecture, and delivery of high-performance system software platforms that power Our Client's AI products and services. Partner with global teams across Machine Learning, Inference Services, and Hardware/Software integration to ensure performance, reliability, and scalability. Oversee the development and optimization of AI delivery platforms in Vietnam, including NIMs, Blueprints, and other flagship Our Client's services. Engage with open-source and enterprise data and workflow ecosystems (e.g., Temporal, Gitlab DevOps Platform, RAPIDS, NeMo Curator, Morpheus) to advance accelerated AI factory, data science and data engineering workloads. Champion continuous integration, continuous delivery, and engineering best practices across multi-site R&D Centers. Collaborate with product management and cross-functional stakeholders to ensure enterprise readiness and customer impact. Develop and deploy standard processes for large-scale, distributed system testing, encompassing stress, scale, failover, and resiliency testing. Ensure security and compliance testing aligns with industry standards for cloud and data center products. Mentor and develop talent within the organization, fostering a culture of quality and continuous improvement.

Negotiation

View details

Principal Engineer, System Software Platform Engineering

Ho Chi Minh, Ha Noi - Viet Nam


Computer Hardware

  • Devops
  • Backend
  • AI

Build and operate the platform for AI: multi-tenant services, identity/policy, configuration, quotas, cost controls, and paved paths for teams. Lead inference platforms at scale, including model-serving routing, autoscaling, rollout safety (canary/A-B), ensuring reliability, and maintaining end-to-end observability. Operate GPUs in Kubernetes: lead Our Client device plugins, GPU Feature Discovery, time-slicing, MPS, and MIG partitioning; implement topology-aware scheduling and bin-packing. Lead GPU lifecycle: driver/firmware/Runtime (CUDA, cuDNN, NCCL) updates via GPU Operator; ensure kernel/RHEL/Ubuntu compatibility and safe rollouts. Enable virtualization strategies: vGPU (e.g., on vSphere/KVM), PCIe passthrough, mediated devices, and pool-based GPU sharing; define placement, isolation, and preemption policies. Build secure traffic and networking: API gateways, service mesh, rate limiting, authN/authZ, multi-region routing, and DR/failover. Improve observability and operations through metrics, tracing, and logging for DCGM/GPUs, runbooks, incident response, performance, and cost optimization. Establish platform blueprints: reusable templates, SDKs/CLIs, golden CI/CD pipelines, and infrastructure-as-code standards. Lead through influence: write design docs, conduct reviews, mentor engineers, and shape platform roadmaps aligned to AI product needs.

Negotiation

View details

Senior Manager, System Software Platform Engineering

Ho Chi Minh, Ha Noi - Viet Nam


Computer Hardware

  • Devops
  • AI

Be responsible for the design and delivery of the most reliable, performing and efficient system software platform for AI products and services. Define, develop and design process, manage teams of junior and experienced System SW engineers Work with continuous integration, continuous delivery of system software Be responsible for Our Client's System Software Platform, work closely with the testing team, support team and stakeholders across time zones Innovate! Bring Our Client's AI software and services to shine in customer's view

Negotiation

View details