Python Developer (DevOps - focused)
ABOUT CLIENT
JOB DESCRIPTION
JOB REQUIREMENT
WHAT'S ON OFFER
CONTACT
Job Summary
Company Type:
Outsourcing
Technical Skills:
Python, Devops
Location:
Ho Chi Minh - Viet Nam
Working Policy:
Onsite
Salary:
Negotiation
Job ID:
J01897
Status:
Active
Related Job:
Principal Engineer, System Software Platform Engineering
Ho Chi Minh, Ha Noi - Viet Nam
Computer Hardware
- Devops
- Backend
- AI
Build and operate the platform for AI: multi-tenant services, identity/policy, configuration, quotas, cost controls, and paved paths for teams. Lead inference platforms at scale, including model-serving routing, autoscaling, rollout safety (canary/A-B), ensuring reliability, and maintaining end-to-end observability. Operate GPUs in Kubernetes: lead Our Client device plugins, GPU Feature Discovery, time-slicing, MPS, and MIG partitioning; implement topology-aware scheduling and bin-packing. Lead GPU lifecycle: driver/firmware/Runtime (CUDA, cuDNN, NCCL) updates via GPU Operator; ensure kernel/RHEL/Ubuntu compatibility and safe rollouts. Enable virtualization strategies: vGPU (e.g., on vSphere/KVM), PCIe passthrough, mediated devices, and pool-based GPU sharing; define placement, isolation, and preemption policies. Build secure traffic and networking: API gateways, service mesh, rate limiting, authN/authZ, multi-region routing, and DR/failover. Improve observability and operations through metrics, tracing, and logging for DCGM/GPUs, runbooks, incident response, performance, and cost optimization. Establish platform blueprints: reusable templates, SDKs/CLIs, golden CI/CD pipelines, and infrastructure-as-code standards. Lead through influence: write design docs, conduct reviews, mentor engineers, and shape platform roadmaps aligned to AI product needs.
Negotiation
View detailsSenior Manager, System Software Platform Engineering
Ho Chi Minh, Ha Noi - Viet Nam
Computer Hardware
- Devops
- AI
Be responsible for the design and delivery of the most reliable, performing and efficient system software platform for AI products and services. Define, develop and design process, manage teams of junior and experienced System SW engineers Work with continuous integration, continuous delivery of system software Be responsible for Our Client's System Software Platform, work closely with the testing team, support team and stakeholders across time zones Innovate! Bring Our Client's AI software and services to shine in customer's view
Negotiation
View detailsSenior Systems Software Engineer
Ho Chi Minh, Ha Noi - Viet Nam
Computer Hardware
- Devops
Contribute the development and maintenance of advanced machine learning software and frameworks, optimizing for performance and scalability. Enhance CI/CD pipelines to streamline the development, testing, and deployment of large-scale machine learning models. Implement and manage cloud infrastructure for continuous integration, delivery, and deployment, ensuring high availability and scalability. Collaborate with cross-functional teams, including engineering, QA, and research, to improve development workflows and enhance software delivery speed and quality. Troubleshoot and resolve complex issues related to software development, containerization, and cloud infrastructure in production environments. Write and maintain robust documentation for development and deployment processes. Communicate effectively with technical and non-technical stakeholders to set shared expectations and ensure visibility around the release and deployment process. Lead code reviews, testing, and debugging to ensure high-quality code and efficient workflows. Mentor and guide junior engineers, fostering professional growth and enhancing team capabilities.