Site Reliability Engineer

ABOUT CLIENT

Our client is a leading digital and technology company specializing in the food industry. They are constantly innovating and leveraging technology to enhance the customer experience. With their robust online ordering system and cutting-edge digital marketing strategies. Our client is at the forefront of revolutionizing the way people enjoy their favorite pizza.

JOB DESCRIPTION

We are looking for a Site Reliability Engineer to join our evolving Incident Management team. As part of this role, you will be responsible for establishing frameworks and best practices while transitioning Incident Management into a Site Reliability Engineering team. In addition, you will work closely with other teams to design, deploy, and maintain cloud native applications and infrastructure, and provide support through investigation, analysis, technical resolution, and post-mortem actions.

JOB REQUIREMENT

2-3 years of experience in a professional technical role with multi-cloud experience (GCP, AWS, Azure) is preferred
Familiarity with Infrastructure as Code methodologies, particularly Terraform
Proficiency in CI/CD best practices and methodologies, specifically GitLab
Fluent in written and spoken English
Familiarity with Cost Optimization and best practices
Strong collaboration skills with multiple engineering functions, business leaders, and vendors
Strong teamwork and communication skills, both verbal and written, along with troubleshooting and analytical abilities
Knowledge of current trends in large-scale infrastructure environments
Shifts and hybrid working hours (required to be at the office three times per week)
Preferred:
Experience in e-Commerce
Experience working in an international company
Experience in Monitoring and Observability platforms (e.g., Datadog, Splunk, New Relic, Prometheus)

WHAT'S ON OFFER

During the probation period, employees receive their full salary
Employees are entitled to 18 days of annual leave per year
Five additional "Recharge Days" are available in addition to company holidays
Employees enjoy flexible Friday afternoons
Full salary insurance is provided
Employees receive a 13th-month bonus
A gift and one day off are provided for birthdays
Advanced health insurance (Generali) is offered
Regular engagement activities such as sport clubs, monthly company lunches, and internal events are organized
Macbook and Monitor support is provided

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

product

Technical Skills:

Devops

Location:

Ho Chi Minh - Viet Nam

Salary:

Negotiation

Job ID:

J01634

Status:

Close

Related Job:

Senior DevOps (Data Platform)

Ho Chi Minh - Viet Nam


Digital Bank, Product

  • Devops
  • Spark

Managing workloads on EC2 clusters using DataBricks/EMR for efficient data processing Collaborating with stakeholders to implement a Data Mesh architecture for multiple closely related enterprise entities Utilizing Infrastructure as Code (IaC) tools for defining and managing data platform user access Implementing role-based access control (RBAC) mechanisms to enforce least privilege principles Collaborating with cross-functional teams to design, implement, and optimize data pipelines and workflows Utilizing distributed engines such as Spark for efficient data processing and analysis Establishing operational best practices for data warehousing tools Managing storage technologies to meet business requirements Troubleshooting and resolving platform-related issues Staying updated on emerging technologies and industry trends Documenting processes, configurations, and changes for comprehensive system documentation.

Negotiation

View details

Senior Machine Learning Engineer

Ho Chi Minh, Ha Noi - Viet Nam


Information Technology & Services

  • Machine Learning

Creating the V1 Evaluation Platform: You will be responsible for designing and building the core backend systems for our new LLM Evaluation Platform, using Arize Phoenix as the basis for traces, evaluations, and experiments. Implementing Production Observability: You will need to architect and implement the observability backbone for our AI services by integrating Phoenix with OpenTelemetry to establish a centralized system for logging, tracing, and evaluating LLM behavior in production. Standardizing LLM Deployment Pipeline: You will be in charge of designing and implementing the CI/CD framework for versioning, testing, and deploying prompt-based logic and LLM configurations, ensuring reproducible and auditable deployments across all AI features. Providing Practical Solutions: Your role will involve making pragmatic technical decisions that prioritize business value and speed of delivery, in line with our early-stage startup environment. Collaborating with Other Teams: You will work closely with the Data Science team to understand their workflow and ensure that the platform you build meets their core needs for experiment tracking and validation. Establishing Core Patterns: You will also help in establishing and documenting the initial technical patterns for MLOps and model evaluation that will serve as the foundation for future development.

Negotiation

View details

Fullstack Engineer - BRAIN

Ho Chi Minh - Viet Nam


product, Investment Management

  • Frontend
  • Backend

Create intricate single page applications. Construct components that can be used across various interfaces. Design layouts that are responsive for both desktop and mobile devices. Automate the testing procedures for the user interface. Develop services and APIs for backend applications. Incorporate AWS and external cloud services. Enhance application speed and scalability. Actively contribute to an agile engineering team focused on continual improvement. Utilize leading open-source technologies like MySQL, PostgreSQL, ELK stack, Sentry, Redis, Git, etc. Take part in periodic on-call responsibilities.

Negotiation

View details