Site Reliability Engineer (Shift-working)

ABOUT CLIENT

Our client is a global technology company that specializes in providing innovative IT solutions for the financial services industry

JOB DESCRIPTION

The Senior SRE plays a vital role in overseeing the everyday operations of the organization. It is crucial for this position to have a solid understanding of various technical aspects such as production system access and control, production deployment, Amazon Web Services, Kubernetes, continuous deployment, and systems observability.
 
Key Responsibilities
Take part in on-call rotations to provide round-the-clock support for critical systems.
Address system incidents promptly and effectively
Implement changes in staging and production environments
Collaborate with Platform Engineers to comprehend the changes
Establish deployment pipeline for changes
Comprehend the changes and build observability (monitoring and alert) as per the changes
Design and execute resiliency testing solutions
Continuously improve monitoring solutions
Create and update operational runbooks
Automate operational runbooks

JOB REQUIREMENT

Technical Skills
Proficient in Amazon Web Services
Proficient in Kubernetes system
Proficient in Python or Bash scripting
Familiarity with continuous deployment tools
Familiarity with Harness is a plus
Familiarity with infrastructure as code (IaC) tools, particularly Terraform
Experience with observability solutions like Prometheus and Grafana
Familiarity with SumoLogic is a plus
 
Soft Skills
Effective communication skills, fluent in English
Strong problem-solving abilities
Self-motivated and quick learner

WHAT'S ON OFFER

Attractive salary
13th-month salary and performance bonus
Professional English course available for all employees
Comprehensive health insurance package

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Outsource

Technical Skills:

Devops, AWS, Google Cloud

Location:

Ho Chi Minh, Ha Noi - Viet Nam

Working Policy:

Hybrid

Salary:

Negotiation

Job ID:

J01150

Status:

Close

Related Job:

Platform Lead

Others - Singapore


Product

  • Backend
  • Devops
  • Data Engineering

Develop and expand distributed systems to handle large volumes of sensory, telemetry, and control data across cloud and edge environments, facilitating real-time connections for fleets of robots. Create the API Platform with a focus on high reliability, exceptional developer experience, and robust multimodal AI capabilities accessible through user-friendly APIs and SDKs. Establish extensive training and inference platforms for foundation models used in robot autonomy, teleoperation, and developer integrations. Devise data ingestion and streaming pipelines for real-time connectivity of robot fleets to the cloud, covering various data inputs such as video, LiDAR, joint states, and audio. Oversee and advance a modern cloud native infrastructure stack employing Kubernetes, Docker, and infrastructure as code tools. Ensure platform reliability through telemetry, monitoring, alerting, autoscaling, failover, and disaster recovery measures. Make infrastructure decisions pertaining to distributed storage, consensus protocols, GPU orchestration, network reliability, and API security. Foster collaboration across ML, robotics, and product teams to facilitate hardware in the loop simulation, policy rollout, continuous learning, and CI/CD workflows. Implement secure APIs featuring fine-grained access control, usage metering, rate limiting, and billing integration to accommodate a growing user base.

Negotiation

View details

Embedded Software Engineer (Chinese Speaking)

Ho Chi Minh - Viet Nam


Outsource

  • Embedded

Create, maintain, and enhance complex embedded software components as per technical and business needs. Conduct software requirement engineering by validating and analyzing customer requirements. Integrate software components and merge them into a unified build. Develop and implement test cases to verify software functionality and ensure it meets quality standards. Adhere to established software development processes and coding standards to produce reliable code for embedded systems. Use debugging and analysis tools to troubleshoot software defects and performance issues. Provide guidance to junior engineers on technical tasks, coding practices, and problem-solving. Contribute to technical reviews and knowledge-sharing sessions within the team. Ensure compliance with industry standards, regulatory requirements, and quality frameworks relevant to assigned projects.

Negotiation

View details

Senior Backend Engineer (Python/AWS)

Ho Chi Minh - Viet Nam


Outsource

  • Python

Our company, with expert teams in Berlin and Ho Chi Minh City, provides innovative software solutions for startups and leading enterprise businesses in Germany. The team in Berlin and Ho Chi Minh City collaborates to develop high-quality solutions. We are seeking a Senior Backend Engineer (Python/AWS) to join our team in Ho Chi Minh City. This role is ideal for team players interested in building an international career as a product builder as well as a coder. Developing and maintaining scalable backend services using Python for a live product Designing and implementing robust RESTful APIs and backend systems following industry best practices Leading the design and development of cloud-native backend solutions on AWS (e.g., ECS, SQS, SNS) Driving the architecture and scalability of backend systems for reliability, performance, and maintainability Defining and enforcing coding standards, testing strategies, and best practices across the backend codebase Implementing and overseeing observability practices, including monitoring, logging, and alerting Collaborating closely with frontend engineers, QA, DevOps, and product stakeholders to deliver high-quality solutions Conducting code reviews, technical design reviews, and architectural discussions Leading troubleshooting and root-cause analysis of complex production issues Mentoring junior and mid-level engineers and supporting their technical growth Contributing to technical documentation, system design documentation, and knowledge sharing Staying up to date with emerging technologies and driving technical innovation within the team

Negotiation

View details