Senior System Software Engineer - AI Data Platform - Inference Factory

ABOUT CLIENT

Our client is a leading technology company specializing in graphics processing units (GPUs) and artificial intelligence (AI).

JOB DESCRIPTION

Create infrastructure and tools to automate complex software processes effectively.
Improve performance: Deploy advanced test harnesses, benchmarking frameworks, and analytical tools to thoroughly evaluate and enhance the performance and efficiency of software and hardware platforms.
Utilize expertise in operating systems, kernel internals, device drivers, memory management, storage, networking, and high-speed interconnects to construct and troubleshoot high-performance systems.
Collaborate with engineering teams to comprehend requirements and deliver efficient solutions.
Establish performance objectives, assess feedback, analyze data, and continually enhance system reliability.
Shape technical strategies: Contribute to developing technical strategies and roadmaps for platform automation initiatives to ensure they are in line with company goals and industry best practices.

JOB REQUIREMENT

Required: Bachelor's or equivalent experience in Computer Science, Computer Engineering, or a related technical field, or Master's degree or equivalent experience in a similar field.
Minimum 5 years of industry experience in software development, focusing on infrastructure, distributed systems, automation, and/or performance engineering.
Proficiency in System-Level Programming: Proven ability to develop robust tools and automation using programming languages such as C++, Python, or Go.
Thorough Understanding of System Software: Experience with operating system internals, device drivers, memory management, and debugging performance issues in complex compute applications.
Distributed Systems Expertise: Experience in designing, building, and operating large-scale distributed systems, with knowledge of networking protocols, cluster management, and high-performance interconnects.
Automation and CI/CD Proficiency: Experience building and maintaining automated testing, benchmarking, and continuous integration/continuous deployment pipelines.
Strong Problem-Solving and Analytical Skills: Outstanding analytical, problem-solving, and debugging skills, with a track record of resolving complex technical challenges.
Collaboration and Communication Skills: Excellent interpersonal and communication skills, with the ability to articulate complex technical concepts to diverse audiences and collaborate effectively across teams.
Preferred qualifications
Experience optimizing performance for AI/Machine Learning workloads, especially inference applications, on diverse hardware platforms.
Prior experience building or contributing to large-scale compute infrastructure solutions in cloud environments or on-premises data centers.
Familiarity with containerization and orchestration technologies, such as Docker and Kubernetes.
Knowledge of performance profiling tools and methodologies for hardware and software systems.
Track record of driving significant efficiency gains or architectural improvements in large-scale systems.

WHAT'S ON OFFER

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Product

Technical Skills:

Devops, C/C++, Python, Golang

Location:

Ho Chi Minh - Viet Nam

Working Policy:

Hybrid

Job ID:

J02058

Status:

Close

Related Job:

Head of Engineer - Tech Fraud & Scams VN

Ho Chi Minh


Product

Translate the Customer Onboarding and Mastery, Financial Crime and Fraud's strategic ambitions into an integrated roadmap for strategic execution, and drive this from shaping through to delivery Lead multiple engineering teams across Customer Onboarding and Mastery, Financial Crime and Fraud Domains to drive outcomes - hence Domain knowledge of these areas is desirable. Work closely with the business teams, product owners to validate requirements before and after delivery through showcases and Day 2 production monitoring Own not just the build, but the runtime of applications in production through active operational support, clearly defined support model with engineers proficient in site reliability engineering Own and lead the efforts of cyber security updates such as keeping software currency versions up to date, patch infrastructure every sprint Oversee investment delivery across CET to maintain alignment between Domains, ensure investment is spent effectively, and provide insights on effectiveness and prioritisation of spend

Negotiation

View details

Head of Engineer - CET

Ho Chi Minh - Viet Nam


Product

Translate the Customer Onboarding and Mastery, Financial Crime and Fraud's strategic ambitions into an integrated roadmap for strategic execution, and drive this from shaping through to delivery Lead multiple engineering teams across Customer Onboarding and Mastery, Financial Crime and Fraud Domains to drive outcomes - hence Domain knowledge of these areas is desirable. Work closely with the business teams, product owners to validate requirements before and after delivery through showcases and Day 2 production monitoring Own not just the build, but the runtime of applications in production through active operational support, clearly defined support model with engineers proficient in site reliability engineering Own and lead the efforts of cyber security updates such as keeping software currency versions up to date, patch infrastructure every sprint Oversee investment delivery across CET to maintain alignment between Domains, ensure investment is spent effectively, and provide insights on effectiveness and prioritisation of spend

Negotiation

View details

Head of Engineer - Home Ownership

Ho Chi Minh - Viet Nam


Product

Provide technical leadership for the Sub-Domain and are accountable for on-time delivery of Software Development Life Cycle Epics and Features. Lead, coach and mentor technology resources to uplift skills/knowledge to perform their role and to build a high-performing team. Drive technical delivery with a focus on improving speed, cost and quality of outcomes, ensuring the squads are aligned on objective and outcomes. Support and drive lean portfolio management across the Domain through linking business roadmaps with software delivery. Manage cross-functional, agile teams across business units to achieve performance targets. Accountable for removing Technical or delivery impediments that can't be resolved at squad level. Cost Management-Oversee financials to ensure achievement of plan and drive CI/CD. Responsible for developing market-leading capacity; the strategic deployment of the departmental resources leads to optimal resource allocation and successful product development.

Negotiation

View details