AWS DevOps Lead

ABOUT CLIENT

Our client is using new technology to develop products for the banking industry

JOB DESCRIPTION

Work with various teams to collect requirements and create scalable and sustainable software solutions
Create integration solutions to facilitate smooth communication between microservices, APIs, and external systems
Contribute to the development of continuous delivery, automation frameworks, and pipelines to enhance the developer and customer experience
Improve database interactions and maintain data integrity across distributed systems
Establish best practices for messaging, integration, and data pipeline architectures
Identify and implement enhancements for automation processes and tools
Acquire new skills and support the adoption of a continuous delivery and cloud-first approach.

JOB REQUIREMENT

Minimum 7 years of backend development experience using Python or Java, including at least 2 years in a lead role.
Proficiency in Apache Kafka, including Kafka Connect, Schema Registry, and related components.
Expertise in building and deploying microservice and event-driven architecture, distributed systems, event sourcing, and CQRS patterns.
Experience with AWS foundation services such as VPC, ECS, Lambda, RDS, SNS, SQS, and Eventbridge.
Hands-on experience with tools like Kafka Connectors and Debezium.
Strong experience with application integration patterns, RESTful APIs, and messaging protocols.
Ability to conduct hands-on troubleshooting and optimization of the platform, collaborating closely with team members.
Capability to design scalable systems and multi-country patterns for platforms.
Familiarity with AWS CloudFormation, Terraform, or CDK for infrastructure provisioning.
A focus on automation and the ability to develop tooling for enhancing the efficiency of repeatable tasks, reliability, and performance.
Understanding of cloud change management practices, compliance, and security standards.
Strong English language skills for effective communication and coordination with business partners and technical teams.
Strong logical thinking and problem-solving abilities.
Curiosity and a self-learning attitude are highly desirable.
Big Plus:
AWS Certification in DevOps, SysOps, or Advance Networking Speciality.

WHAT'S ON OFFER

Company offers meal and parking benefits.
Full benefits and probationary salary provided.
Insurance coverage as per Vietnamese labor law and premium health care for employees and their families.
Work environment is values-driven, international, and agile in nature.
Opportunities for overseas travel related to training and work.
Participation in internal Hackathons and company events such as team building, coffee runs, and blue card activities.
Additional benefits include a 13th-month salary and performance bonuses.
Employees receive 15 days of annual leave and 3 days of sick leave per year.
Work-life balance with a 40-hour workweek from Monday to Friday.

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Offshore

Technical Skills:

Devops, AWS

Location:

Ho Chi Minh - Viet Nam

Working Policy:

Hybrid

Salary:

Negotiation

Job ID:

J01325

Status:

Active

Related Job:

Director Engineering – Software Engineering and AI Inferencing Platforms

Ho Chi Minh, Ha Noi - Viet Nam


Product

  • Management
  • Backend
  • Devops
  • Data Engineering
  • Cloud
  • AI

Lead and expand engineering teams in Vietnam across system software, data science, and AI platforms. Drive the creation, structure, and delivery of high-performance system software platforms that support AI products and services. Collaborate with global teams across Machine Learning, Inference Services, and Hardware/Software integration to guarantee performance, reliability, and scalability. Oversee the development and optimization of AI delivery platforms in Vietnam, including NIMs, Blueprints, and other flagship services. Collaborate with open-source and enterprise data and workflow ecosystems to advance accelerated AI factory, data science, and data engineering workloads. Promote continuous integration, continuous delivery, and engineering best practices across multi-site R&D Centers. Work with product management and other stakeholders to ensure enterprise readiness and customer impact. Establish and implement standard processes for large-scale, distributed system testing including stress, scale, failover, and resiliency testing. Ensure security and compliance testing aligns with industry standards for cloud and data center products. Mentor and develop talent within the organization, fostering a culture of quality and continuous improvement.

Negotiation

View details

Principal Engineer, System Software Platform Engineering

Ho Chi Minh, Ha Noi - Viet Nam


Product

  • Devops
  • Backend
  • AI

Create and manage a platform for AI that provides services for multiple users, handles identity and policy management, configures quotas, and controls costs. Additionally, this platform should offer easy paths for teams to work on AI projects. Oversee the deployment of AI models at scale, including routing, autoscaling, and implementing safety measures to ensure reliability and observability. Manage GPU resources in a Kubernetes environment, including device plugins, feature discovery, and scheduling strategies, among other responsibilities. Take charge of the entire lifecycle of GPUs, ensuring that driver, firmware, and runtime updates are implemented safely and consistently. Implement virtualization strategies for GPU resources, such as vGPU and PCIe passthrough, while defining policies for resource placement, isolation, and preemptive actions. Establish secure traffic and networking protocols, including gateways, service mesh, and authentication/authorization measures. Enhance observability and operational efficiency through monitoring tools for GPUs, response protocols for incidents, and optimization of costs. Develop reusable templates, integrate SDKs and CLIs, and implement infrastructure-as-code standards for the platform. Influence the platform's direction by creating design documents, mentoring engineers, and aligning platform development with the needs of AI products.

Negotiation

View details

Senior Manager, System Software Platform Engineering

Ho Chi Minh, Ha Noi - Viet Nam


Product

  • Devops
  • Cloud
  • AI

Take on the responsibility of creating a highly reliable, efficient system software platform for AI products and services. Create and oversee processes for developing and managing teams of system SW engineers. Collaborate with different teams and stakeholders across various time zones for continuous integration and delivery of system software. Focus on enhancing Our Client's AI software and services to impress customers.

Negotiation

View details