Site Reliability Engineer (Shift-working)
JOB DESCRIPTION
Senior SRE ensures smooth day-to-day operations of the Bank. Understanding of production system access and control, production deployment, Amazon Web Services, Kubernetes, continuous deployment and systems observability is essential for this role.
Key Responsibilities
Participate in on-call rotations to provide 24/7 support for critical systems.
Resolve system incident when occurs
Deployment of changes into staging and production environments
Work with Platform Engineers to understand the changes
Develop deployment pipeline for changes
Understand the changes and develop observability (monitoring and alert) according to the changes
Develop and conduct resiliency testing solution
Continuous enhancement of monitoring solution
Create and update operation runbooks
Automate operation runbooks
JOB REQUIREMENT
Technical Skill
Strong experience with Amazon Web Services
Strong experience and understanding of Kubernetes system
Scripting skills with Python or Bash
Experience in continuous deployment tools
Harness (good to have)
Experience in infrastructure as code (IaC) tools
Terraform
Experience with observability solutions
Prometheus & Grafana
SumoLogic (good to have)
Soft Skills
Good in communication and able to communicate fluently in English
Good problem solving skill
Self-motivated and able to learn fast
WHAT'S ON OFFER
Competitive salary
13th-month salary guarantee
Performance bonus
Professional English course for employees
Premium health insurance
CONTACT
PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!
Job Summary
Company Type:
Information Technology & Services
Technical Skills:
Devops, AWS, Google Cloud
Location:
Ho Chi Minh, Ha Noi, Others - Viet Nam
Salary:
Negotiation
Job ID:
J01150
Status:
Active