Senior System Software Engineer - AI Data Platform - Inference Factory

ABOUT CLIENT

Our client is a leading technology company specializing in graphics processing units (GPUs) and artificial intelligence (AI).

JOB DESCRIPTION

Create infrastructure and tools to automate complex software processes effectively.
Improve performance: Deploy advanced test harnesses, benchmarking frameworks, and analytical tools to thoroughly evaluate and enhance the performance and efficiency of software and hardware platforms.
Utilize expertise in operating systems, kernel internals, device drivers, memory management, storage, networking, and high-speed interconnects to construct and troubleshoot high-performance systems.
Collaborate with engineering teams to comprehend requirements and deliver efficient solutions.
Establish performance objectives, assess feedback, analyze data, and continually enhance system reliability.
Shape technical strategies: Contribute to developing technical strategies and roadmaps for platform automation initiatives to ensure they are in line with company goals and industry best practices.

JOB REQUIREMENT

Required: Bachelor's or equivalent experience in Computer Science, Computer Engineering, or a related technical field, or Master's degree or equivalent experience in a similar field.
Minimum 5 years of industry experience in software development, focusing on infrastructure, distributed systems, automation, and/or performance engineering.
Proficiency in System-Level Programming: Proven ability to develop robust tools and automation using programming languages such as C++, Python, or Go.
Thorough Understanding of System Software: Experience with operating system internals, device drivers, memory management, and debugging performance issues in complex compute applications.
Distributed Systems Expertise: Experience in designing, building, and operating large-scale distributed systems, with knowledge of networking protocols, cluster management, and high-performance interconnects.
Automation and CI/CD Proficiency: Experience building and maintaining automated testing, benchmarking, and continuous integration/continuous deployment pipelines.
Strong Problem-Solving and Analytical Skills: Outstanding analytical, problem-solving, and debugging skills, with a track record of resolving complex technical challenges.
Collaboration and Communication Skills: Excellent interpersonal and communication skills, with the ability to articulate complex technical concepts to diverse audiences and collaborate effectively across teams.
Preferred qualifications
Experience optimizing performance for AI/Machine Learning workloads, especially inference applications, on diverse hardware platforms.
Prior experience building or contributing to large-scale compute infrastructure solutions in cloud environments or on-premises data centers.
Familiarity with containerization and orchestration technologies, such as Docker and Kubernetes.
Knowledge of performance profiling tools and methodologies for hardware and software systems.
Track record of driving significant efficiency gains or architectural improvements in large-scale systems.

WHAT'S ON OFFER

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Product

Technical Skills:

Devops, C/C++, Python, Golang

Location:

Ho Chi Minh - Viet Nam

Working Policy:

Hybrid

Job ID:

J02058

Status:

Close

Related Job:

AI Software Transformation Engineer (Distributed Computing)

Ho Chi Minh - Viet Nam


Product

  • Data Engineering
  • Backend
  • Spark
  • AI

Create an advanced AI-powered software transformation framework to speed up the modernization of complex analytical applications. Develop architectural patterns and transformation methodologies for converting outdated computational tools into scalable cloud-native solutions. Utilize AI agents, LLMs, and emerging AI engineering techniques to automate software analysis, code transformation, validation, and optimization. Work with distributed computing specialists to design target architectures that leverage Spark-based execution models for large-scale data processing. Lead technical investigations into restructuring, decomposing, or re-implementing existing software systems for efficient operation in distributed environments. Develop reusable transformation pipelines, automation tooling, and engineering frameworks for large-scale software modernization. Establish validation strategies and quality frameworks to ensure that transformed systems maintain functional correctness and reproducibility. Make architectural decisions regarding scalability, maintainability, performance, and long-term platform evolution. Collaborate with domain experts to understand application requirements and translate them into scalable technical solutions. Prototype and assess new AI-assisted engineering approaches to enhance transformation speed, engineering productivity, and software quality. Contribute to the organization's long-term strategy for AI-driven software modernization and engineering automation.

Negotiation

View details

Senior Quality Engineer (Automation, Backend)

Ho Chi Minh - Viet Nam


Product

  • Automation Test

Lead test automation strategy and framework design for backend and cloud-based services. Drive end-to-end test automation initiatives using Cypress to ensure seamless user experiences. Perform thorough manual testing for complex workflows requiring deep attention to UX and usability details. Implement continuous integration and deployment test practices such as GitHub Actions and Jenkins. Collaborate with developers and DevOps to enhance test reliability and coverage. Review code and advocate for QA best practices across teams. Identify quality risks early and actively seek solutions. Ensure release compliance through test result reporting.

Negotiation

View details

Senior Quality Engineer (Automation, Full Stack)

Ho Chi Minh - Viet Nam


Product

  • Automation Test

Develop a test automation strategy and framework for backend and cloud-based services. Implement E2E test automation initiatives, using Cypress to ensure smooth user experiences. Perform thorough manual testing for complex workflows focusing on UX and usability details. Write and manage frontend component and unit tests using Jest and React Testing Library. Create and execute API-level test suites, covering REST endpoints and validating request/response contracts and error handling. Verify data integrity from UI interactions through the API layer down to database state. Implement continuous integration and deployment test practices (e.g., GitHub Actions, Jenkins). Collaborate with developers and DevOps to enhance test reliability and coverage. Review code and advocate for QA best practices. Anticipate quality risks and drive proactive solutions. Ensure compliance with releases through test result reporting.

Negotiation

View details