Site Reliability Engineer

JOB DESCRIPTION

Maintain systems and troubleshoot system issues.
Identifying bottleneck in various Java applications and implement performance improvements.
Identify and analyze user requirements.
Prioritize, assign, and execute tasks throughout the software development life cycle.
Develop, configure, and deploy tools for cloud-based systems and services.
Containerize new and legacy applications.
Maintain awareness of new and emerging technologies.
Support development and operations teams.
Enhance, modify or debug developer code as needed.

JOB REQUIREMENT

Must-have
Understanding of an object-orientated language, preferably the latest version of Java (with experience in Hibernate, Multi-thread, Spring Boot)
Experience in configuration, in Jenkins for CI/CD pipeline creation, automation scripts and Kubernetes implementation with Google.
Proficiency in supporting a 24×7 critical operation.
Experience in a cloud computing platform and associated automation patterns it provides, preferably GCP.
Proficient in production systems design including High Availability, Disaster Recovery, Performance, Efficiency, and Security user, application performance, system, log, time-series, and dashboarding.
Familiarity with Open-Source concepts and tools like Prometheus, Grafana, ELK etc. 
Proficient in a modern infrastructure automation toolkit such as Terraform/Helm
Proficient in a Linux or Unix based environment.
Experience in destructive testing methodologies and tools such as chaos monkey
Experience in defensive coding practices and patterns for high availability
Nice-to-have
Experience in a cloud computing platform and associated automation patterns it provides, preferably GCP
Proficient in a modern scripting language like GO or Python
Knowledge of APM fundamentals or experience in tools like New Relic or AppDynamics.

WHAT'S ON OFFER

Open to deal base salary with additional project allowances.
Full salary during probation & Full coverage of social insurance.
Performance & salary review: twice a year
Monthly childcare support.
Premium Healthcare insurance and Health check-up services for employee and family ones.
15 Annual Leaves plus 10 days for Bereavement leave and 1.5 months for Paternity leave.
Premium package at top Gym service provider.
Diverse internal activities: Football, Billiards, Badminton, E-sport clubs & other regular company events.
Frequent opportunities to travel to US headquarter from 3-6 months.
Free parking for motorbike and car

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Information Technology & Services

Technical Skills:

Devops, Java

Location:

Ho Chi Minh, Da Nang - Viet Nam

Working Policy:

Salary:

Negotiation

Job ID:

J01196

Status:

Close

Related Job:

AI Research Engineer

Ho Chi Minh - Viet Nam


Product

  • Machine Learning
  • Python

Focus on improving model usability for users Conduct training runs and AI experiments Analyze results and make necessary changes Collaborate with the product engineering team to implement improvements Implement and improve upon recent RL techniques like GRPO, DPO, RePO, etc. Create and manage adaptable, expandable training codebases Establish and maintain efficient data pipelines, including both synthetic and real data Ensure training jobs are capable of scaling across multiple GPUs and nodes, such as FSDP, DDP, NCCL Maintain code health over the long term by writing clean, testable, and reproducible code Contribute to the enhancement of open source dependencies (Optional) Publish papers and present research findings

Negotiation

View details

Product Quality Engineer

Ho Chi Minh - Viet Nam


Product

  • Automation Test
  • Devops

Develop and implement thorough testing strategies across web, API platform, desktop, and mobile platforms Create automated test suites and Auto QA agents for continuous releases, model updates, and API integrations Manage build and CI/CD pipelines to ensure product functionality across major operating systems Verify compatibility between web clients and different server node versions, including upgrade paths and backwards compatibility testing Validate resource management and performance optimizations across various hardware configurations, including GPU acceleration Engage in the Discord community and GitHub Issues to translate feedback into practical test cases Oversee release cycles, prioritize bugs, and provide timely alerts Generate user-friendly documentation to help users resolve issues

Negotiation

View details

Android Software Engineer (Senior)

Ho Chi Minh - Viet Nam


Product

  • Android
  • Kotlin
  • Java

We are looking for experienced Android Software Engineers to join our team. As an Engineer, you will be responsible for developing scalable and high-performance applications with a focus on performance optimization and clean architecture. Your responsibilities will include working on implementing Android (Kotlin) and Flutter components of our SaaS product, leading the entire mobile software lifecycle, from prototyping to post-launch support, and producing clean, well-tested, and maintainable code that aligns with our cross-platform architecture and performance objectives. In addition, you will be expected to participate in design and code reviews, contribute to cross-functional discussions, and collaborate with Product, Design, and Backend teams to deliver quality Android applications. For Senior candidates, there will be additional opportunities to propose architecture and design reviews, set high technical standards, optimize engineering processes, and mentor junior/middle engineers to foster their growth and technical skill development.

Negotiation

View details