Site Reliability Engineer

JOB DESCRIPTION

As a Senior SRE Engineer in the DF squad, you will be responsible for the deployment, configuration, and maintenance of the data platform, focusing on key areas such as database management, data processing tools, ETL frameworks, and AWS-hosted applications. Your key duties will include:
Deploying and maintaining batch and stream jobs on ETL frameworks, metadata management tools, BI platforms, and other data solutions.
Collaborating with the Data Platform Software Engineers and Infrastructure/Platform squads to maintain the overall MS1 infrastructure.
Ensuring smooth operations of cloud-based and on-premise data services, guaranteeing scalability, high availability (HA), disaster recovery (DR), and adherence to Service Level Objectives (SLOs).
Key Responsibilities
Administering AWS resources including IAM, ECS, EKS, Lambda, RDS, and CloudWatch, ensuring compliance with security best practices.
Managing and deploying containerized applications/services, with a strong emphasis on Kubernetes operations.
Using Infrastructure-as-Code tools (e.g., Terraform, Scalr) to automate infrastructure deployment and configuration.
Proficiently configuring and maintaining CI/CD pipelines, with experience in ArgoCD considered a plus.
Designing and maintaining robust cloud architecture with a strong understanding of cloud security, SSO, and authentication mechanisms (Auth0, AzureAD, Okta, SAML/OIDC, OAuth).

JOB REQUIREMENT

Minimum 7 years of experience as an SRE Engineer, with exposure to data platform solutions being an advantage.
Extensive experience with AWS, including IAM, ECS, EKS, Lambda, and CloudWatch.
Expertise in deploying and managing containerized services, especially on Kubernetes.
Hands-on experience with Infrastructure-as-Code and automation tools like Terraform or Scalr.
Strong knowledge of cloud architecture, with a focus on maintaining service SLAs and ensuring high availability.
Experience with cloud security practices, SSO solutions, and authentication protocols (e.g., Auth0, SAML/OIDC, OAuth).
Familiarity with deploying and maintaining data processing frameworks and ML platforms such as Airflow, Airbyte, Superset, Metabase, Databricks, Snowflake, MLflow, etc., is a strong advantage.
Preferred Skills
Experience with Google Cloud Platform (GCP) or Microsoft Azure.
Familiarity with CI/CD tools such as ArgoCD.

WHAT'S ON OFFER

Why join with us
We build a professional & fun working environment.
We focus on your growth, yes the long-term growth.
We develop the future-ready digital bank platform.
Benefits
Competitive salary and bonus.
Opportunities for your professional growth in fintech, especially in digital banking
Social insurance (max), premium medical insurance
Parking allowance, snacks, and coffee.
Monthly team-building and social activities.

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Information Technology & Services

Technical Skills:

System, Devops

Location:

Ho Chi Minh, Ha Noi - Viet Nam

Salary:

Negotiation

Job ID:

J00771

Status:

Active