Senior System Software Engineer - AI Data Platform - Inference Factory

ABOUT CLIENT

Our client is a leading technology company specializing in graphics processing units (GPUs) and artificial intelligence (AI).

JOB DESCRIPTION

Create infrastructure and tools to automate complex software processes effectively.
Improve performance: Deploy advanced test harnesses, benchmarking frameworks, and analytical tools to thoroughly evaluate and enhance the performance and efficiency of software and hardware platforms.
Utilize expertise in operating systems, kernel internals, device drivers, memory management, storage, networking, and high-speed interconnects to construct and troubleshoot high-performance systems.
Collaborate with engineering teams to comprehend requirements and deliver efficient solutions.
Establish performance objectives, assess feedback, analyze data, and continually enhance system reliability.
Shape technical strategies: Contribute to developing technical strategies and roadmaps for platform automation initiatives to ensure they are in line with company goals and industry best practices.

JOB REQUIREMENT

Required: Bachelor's or equivalent experience in Computer Science, Computer Engineering, or a related technical field, or Master's degree or equivalent experience in a similar field.
Minimum 5 years of industry experience in software development, focusing on infrastructure, distributed systems, automation, and/or performance engineering.
Proficiency in System-Level Programming: Proven ability to develop robust tools and automation using programming languages such as C++, Python, or Go.
Thorough Understanding of System Software: Experience with operating system internals, device drivers, memory management, and debugging performance issues in complex compute applications.
Distributed Systems Expertise: Experience in designing, building, and operating large-scale distributed systems, with knowledge of networking protocols, cluster management, and high-performance interconnects.
Automation and CI/CD Proficiency: Experience building and maintaining automated testing, benchmarking, and continuous integration/continuous deployment pipelines.
Strong Problem-Solving and Analytical Skills: Outstanding analytical, problem-solving, and debugging skills, with a track record of resolving complex technical challenges.
Collaboration and Communication Skills: Excellent interpersonal and communication skills, with the ability to articulate complex technical concepts to diverse audiences and collaborate effectively across teams.
Preferred qualifications
Experience optimizing performance for AI/Machine Learning workloads, especially inference applications, on diverse hardware platforms.
Prior experience building or contributing to large-scale compute infrastructure solutions in cloud environments or on-premises data centers.
Familiarity with containerization and orchestration technologies, such as Docker and Kubernetes.
Knowledge of performance profiling tools and methodologies for hardware and software systems.
Track record of driving significant efficiency gains or architectural improvements in large-scale systems.

WHAT'S ON OFFER

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Product

Technical Skills:

Devops, C/C++, Python, Golang

Location:

Ho Chi Minh - Viet Nam

Working Policy:

Hybrid

Salary:

Negotiation

Job ID:

J02058

Status:

Active

Related Job:

Senior Automation Test Engineer (Playwright)

Ho Chi Minh - Viet Nam


Outsource

  • Automation Test
  • Playwright

Execute all testing activities to improve product quality, work closely with the team (developers, business analysis, customer service, operation, etc.) to deliver the product success Address the test automation needs in a methodical, detail-oriented manner with the help of robust analytical skills and problem-solving capacity Automate functional, regression and/or performance acceptance tests Have complete responsibility to enhance end-to-end automated test coverage Participate in sprint planning and work closely with the Scrum team to analyze requirements and provide necessary test recommendations

Negotiation

View details

Full-stack Test Engineer

Ho Chi Minh - Viet Nam


Outsource

  • Automation Test
  • Manual Test

Execute all testing activities (manual and automation) to improve product quality, work closely with the team (developers, business analysis, customer service, operation, etc.) to deliver product success Address the test automation needs in a methodical, detail-oriented manner with the help of robust analytical skills and problem-solving capacity Automate functional, regression and/or performance acceptance tests Have complete responsibility to enhance end-to-end automated test coverage Work closely with the Scrum team to analyze requirements and provide necessary test recommendations (such as Feature Testing, Smoke Testing and Regression Testing in both manual and automation)

Negotiation

View details

Data Engineer (Python, AI/LLM)

Ho Chi Minh - Viet Nam


Outsource

  • Data Engineering
  • Python

Enriching a wide range of structured and unstructured data into datasets. Enhancing data quality & integrity by developing validation tools to measure the effectiveness of data enrichment. Becoming a domain expert on different deep learning and machine earning applications, analyzing & understanding the underlying dynamics and behaviors within the data. Develop insights based on the data and collaborate with the research team to generate tradable signals. Developing the utility tools that can further automate the software development, testing and deployment workflow. Using your expertise to provide technical support for global researchers, including diagnosing root causes of technical problems and proposing solutions to developers.

Negotiation

View details