SRE Lead/Manager (DevOps, AWS)

JOB DESCRIPTION

As a Support Site Reliability Engineer (SRE) leader, you will lead our efforts in establishing a support SRE team that works closely with The Company's Product SRE to increase productivity. The ideal candidate will utilize leadership and technical skills to streamline operational tasks affecting Product SRE team efficiency through collaboration with SRE teams located in Japan and Vietnam
Design and execute the Support SRE team's strategic roadmap.
Collaborate with The Company's Product SRE teams to identify opportunities for improving operational efficiency and reducing toil.
Mentor and coach team members to foster their growth and development in technical and collaboration areas.
Drive a culture of continuous improvement and knowledge sharing within the team.
Design and implement automation solutions to standardize operational tasks, reducing manual effort and improving efficiency.
Develop and maintain tools, scripts and processes to automate routine operational tasks.
Build, maintain, and improve our infrastructure, including monitoring, diagnosing, and resolving incidents promptly.
Participate in incident response, on-call rotations, and post-mortem analysis.

JOB REQUIREMENT

At least 5 years experience as a DevOps Engineer (Experience on on-premises environments being a plus) or similar.
3+ years of hands-on experience with AWS or other cloud platforms. Experience with managed AWS services is a plus.
Solid understanding of CI/CD pipelines and best practices.
Working understanding of containerization technologies (Docker and Kubernetes).
Experience with monitoring and logging solutions.
Proficiency with IaC (e.g., Terraform).
Deep understanding and hands-on experience with MySQL or similar relational databases.
Proven track record in training and educating team members, promoting a culture of continuous learning.
Strong ownership and responsibility, with a proactive and solutions-oriented mindset.
Experience in developing and operating web applications built in Go or Ruby is a plus.
Project management experience.
English language proficiency at a professional working level.
People management or team leadership experience is a plus.

WHAT'S ON OFFER

Caring Mental & Physical Recreation:
Hybrid working: 2 days at the office and 3 days WFH
Working hour: Flexible start 8AM-9AM from Mon-Fri
Full salary in probation
Insurance: Applied from Probation period:
Social Insurance, Health Insurance, Unemployment Insurance (on 100% salary)
Private health insurance & accident insurance. From Managing level: extra for family members
Bonus: 13th month salary
17 - 24 paid days off and more
Paternity leave: Extra 5 days
Annual company trip; Quarterly team building
Billiards & Running club
Annual health check
Well-equipped facility: Macbook pro, additional monitor, ..
Caring Career & Development:
Clear Career path
Foreign language & International technology-related certifications sponsoring
External & internal training courses
Soft-skill workshops
Tech seminars
Monthly and biannual Recognition Awards
Performance & salary review: twice/year (Jun & Dec)

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Product, Fintech

Technical Skills:

Devops, AWS

Location:

Ha Noi - Viet Nam

Salary:

Negotiation

Job ID:

J01508

Status:

Close

Related Job:

Tech Lead Software Developer (Delphi, Oracle PL-SQL)

Ho Chi Minh - Viet Nam


Global Software Delivery Centers

  • Delphi

Supervising development teams in a local management role, reporting to the Software Engineering Manager in Europe Setting targets and offering guidance to local teams Ensuring quality in team development Participating in sprint planning and retrospective meetings Assigning and delivering development tasks as per sprint planning Estimating complexity and workload Selecting the most suitable technical solution to meet user requirements Designing, developing, and implementing changes to the LIMS in line with customer and business user needs Collaborating with other team members to support the LIMS Working with other team members (Engineers/QA) to assure high-quality solutions Implementing and enforcing good practices and high-quality standards

Negotiation

View details

Senior DevOps (Data Platform)

Ho Chi Minh - Viet Nam


Digital Bank, Product

  • Devops
  • Spark

Managing workloads on EC2 clusters using DataBricks/EMR for efficient data processing Collaborating with stakeholders to implement a Data Mesh architecture for multiple closely related enterprise entities Utilizing Infrastructure as Code (IaC) tools for defining and managing data platform user access Implementing role-based access control (RBAC) mechanisms to enforce least privilege principles Collaborating with cross-functional teams to design, implement, and optimize data pipelines and workflows Utilizing distributed engines such as Spark for efficient data processing and analysis Establishing operational best practices for data warehousing tools Managing storage technologies to meet business requirements Troubleshooting and resolving platform-related issues Staying updated on emerging technologies and industry trends Documenting processes, configurations, and changes for comprehensive system documentation.

Negotiation

View details

Lead Engineer (Power Platform)

Ho Chi Minh - Viet Nam


IT Service Provider

  • Power Platform

As the lead engineer, the role involves using generative AI technologies in conjunction with Microsoft Power Platform services to solve business and development challenges for clients. The primary responsibilities include project acquisition and management, as well as ensuring efficient communication with stakeholders during all project phases to meet project goals and ensure client satisfaction. The role also involves leading projects using tools such as Azure OpenAI, Microsoft Copilot, Microsoft 365, and Power Platform. This includes supporting the implementation of generative AI and Power Platform, prototype development, business application development, business process automation, RAG system utilization, and ongoing operation and maintenance. Additionally, a key part of the role is fostering effective communication with internal and external stakeholders to ensure successful project execution.

Negotiation

View details