Director of Platform

ABOUT CLIENT

Our client is working on global products, revolutionizing how people and businesses use the internet to instill confidence in every online interaction

JOB DESCRIPTION

You will be responsible for defining the platform requirements, architecture, and integration with existing ops systems.
Your role will involve contributing to the technical roadmap for the platform launch and promoting its adoption across all development teams.
You will closely examine the technical aspects and architecture of the transformation project, documenting and communicating the progress internally.
Over time, you will play a part in maintaining a comprehensive global architectural vision.

JOB REQUIREMENT

At least 5 years of relevant experience in a management role
Proficiency in AWS, GCP, SRE, Kubernetes, Linux, Docker, Jenkins, Golang, Python, Bash, NoSQL, and Service Discovery
Hands-on experience with containers
Strong understanding of security and compliance
Familiarity with a version control system like Git
Enthusiasm for learning new skills and expanding knowledge
Goal-oriented and detail-focused
Ability to collaborate with global teams asynchronously
Familiarity with issue-tracking systems like Jira
Experience with Kubernetes and MongoDB
Excellent written and spoken English skills
Solid analytical and problem-solving abilities
Passion for developing innovative products
A growth mindset and a commitment to excellence

WHAT'S ON OFFER

Opportunities for learning and personal development with allocated allowance
Comprehensive health insurance
Competitive salary and bonus package
Annual salary performance review
Flexibility with hybrid working arrangements and modern office location
Annual company trip and year-end party
Regular team-building activities
In-office snacks and drinks
International workplace environment
Weekly yoga classes in the office

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Product

Technical Skills:

Devops, Google Cloud, Kubernetes

Location:

Ho Chi Minh - Viet Nam

Working Policy:

Hybrid

Salary:

Negotiation

Job ID:

J01649

Status:

Close

Related Job:

Director Engineering – Software Engineering and AI Inferencing Platforms

Ho Chi Minh, Ha Noi - Viet Nam


Computer Hardware

  • Management
  • Backend
  • Cloud
  • Data Engineering
  • AI

Lead and expand engineering teams in Vietnam across system software, data science, and AI platforms. Drive the creation, structure, and delivery of high-performance system software platforms that support AI products and services. Collaborate with global teams across Machine Learning, Inference Services, and Hardware/Software integration to guarantee performance, reliability, and scalability. Oversee the development and optimization of AI delivery platforms in Vietnam, including NIMs, Blueprints, and other flagship services. Collaborate with open-source and enterprise data and workflow ecosystems to advance accelerated AI factory, data science, and data engineering workloads. Promote continuous integration, continuous delivery, and engineering best practices across multi-site R&D Centers. Work with product management and other stakeholders to ensure enterprise readiness and customer impact. Establish and implement standard processes for large-scale, distributed system testing including stress, scale, failover, and resiliency testing. Ensure security and compliance testing aligns with industry standards for cloud and data center products. Mentor and develop talent within the organization, fostering a culture of quality and continuous improvement.

Negotiation

View details

Principal Engineer, System Software Platform Engineering

Ho Chi Minh, Ha Noi - Viet Nam


Computer Hardware

  • Devops
  • Backend
  • AI

Create and manage a platform for AI that provides services for multiple users, handles identity and policy management, configures quotas, and controls costs. Additionally, this platform should offer easy paths for teams to work on AI projects. Oversee the deployment of AI models at scale, including routing, autoscaling, and implementing safety measures to ensure reliability and observability. Manage GPU resources in a Kubernetes environment, including device plugins, feature discovery, and scheduling strategies, among other responsibilities. Take charge of the entire lifecycle of GPUs, ensuring that driver, firmware, and runtime updates are implemented safely and consistently. Implement virtualization strategies for GPU resources, such as vGPU and PCIe passthrough, while defining policies for resource placement, isolation, and preemptive actions. Establish secure traffic and networking protocols, including gateways, service mesh, and authentication/authorization measures. Enhance observability and operational efficiency through monitoring tools for GPUs, response protocols for incidents, and optimization of costs. Develop reusable templates, integrate SDKs and CLIs, and implement infrastructure-as-code standards for the platform. Influence the platform's direction by creating design documents, mentoring engineers, and aligning platform development with the needs of AI products.

Negotiation

View details

Senior Manager, System Software Platform Engineering

Ho Chi Minh, Ha Noi - Viet Nam


Computer Hardware

  • Devops
  • AI

Take on the responsibility of creating a highly reliable, efficient system software platform for AI products and services. Create and oversee processes for developing and managing teams of system SW engineers. Collaborate with different teams and stakeholders across various time zones for continuous integration and delivery of system software. Focus on enhancing Our Client's AI software and services to impress customers.

Negotiation

View details