Middle DevOps Engineer (SRE, Kubernetes)

ABOUT CLIENT

Our client is focusing on providing end-to-end solutions

JOB DESCRIPTION

We are looking for a capable DevOps/SRE/Platform Engineer to join our team and contribute to the development of robust systems. This role offers the opportunity to have a significant impact on our product and address stimulating technical challenges in a supportive and collaborative setting.
Manage infrastructure and applications on the cloud
Address performance and security issues with guidance
Streamline routine tasks and contribute to documentation
Work alongside development teams for deployments and troubleshooting
Participate in on-call rotations and incident response efforts

JOB REQUIREMENT

Must have 3+ years of experience in DevOps, SRE, or Platform Engineering roles
Must have a strong understanding of Kubernetes fundamentals and experience deploying and managing applications on Kubernetes. CKA certification is preferred.
Must have experience with Infrastructure as Code (IaC) using tools like Terraform or similar, and practical experience writing and managing infrastructure configurations
Proficiency in coding with Python or Go (preferred) for basic automation tasks is required
Experience with CI/CD pipelines and related tooling (e.g., Jenkins, GitLab CI) is necessary. Understanding of CI/CD principles and practical experience building and maintaining pipelines is a must
Familiarity with monitoring and logging systems (e.g., Prometheus, Grafana, ELK stack) is expected
Strong English reading and writing skills are required, with an interest in working in an international environment with clients from various regions.

WHAT'S ON OFFER

Generous compensation package including a 13th-month bonus and performance incentives
Overtime pay
Monday to Friday work schedule, with 12 annual leave days and 11 public holidays
Chance to improve English skills by engaging with international clients and products
Regular salary reviews based on performance
Monthly team-building exercises
Opportunities for company trips
Full employee insurance as mandated by law
Yearly health check-ups
Access to personal development courses, covering technical and non-technical programs
Hands-on experience with advanced technologies
Complimentary snacks, coffee, tea, and cakes provided

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Outsource

Technical Skills:

Devops

Location:

Ho Chi Minh - Viet Nam

Working Policy:

Onsite

Salary:

Negotiation

Job ID:

J01549

Status:

Close

Related Job:

Partner Implementation Engineer (Security & Digital Trust)

Ha Noi - Viet Nam


Outsource

Đóng vai trò là người thực hiện triển khai chủ chốt, chịu trách nhiệm triển khai, cấu hình và tích hợp các giải pháp Security & Digital Trust (PKI, Chữ ký số, Mã hóa, MFA) vào hệ thống thực tế của khách hàng, đảm bảo hệ thống vận hành ổn định, bảo mật và đúng thiết kế. Triển khai hệ thống (Implementation) Chuẩn bị môi trường: kiểm tra hạ tầng (Server, Hệ điều hành, Cơ sở dữ liệu, Mạng) Cài đặt & cấu hình giải pháp: PKI / CA / Chữ ký số / MFA / Mã hóa Thiết lập chính sách bảo mật, quy trình nghiệp vụ Kết nối với thiết bị bảo mật (HSM, Quản lý Khóa) Triển khai trên nền tảng Cloud / Container (nếu có) Triển khai hệ thống trên Kubernetes / OpenShift Cấu hình tài nguyên (YAML: Pod, Dịch vụ, Ingress, Bản đồ Cấu hình, Bí mật) Thiết lập lưu trữ (Khối Lưu trữ Không gian); mạng nội bộ Áp dụng các chính sách bảo mật cho container Tích hợp hệ thống (Integration) Hỗ trợ tích hợp với: Trang web/ Ứng dụng/ Giao diện lập trình ứng dụng và IAM / SSO / AD / LDAP Hướng dẫn sử dụng API/SDK Kiểm tra luồng dữ liệu & bảo mật giao tiếp Phối hợp với nhóm khách hàng (Phát triển / Cơ sở hạ tầng / Bảo mật) Kiểm thử & nghiệm thu (QA/UAT) Thực hiện kiểm thử kỹ thuật & kịch bản vận hành Hỗ trợ UAT với khách hàng Kiểm tra tính đúng đắn của: Chữ ký số; Chứng thư và Luồng xác thực Vận hành & hỗ trợ Giám sát hệ thống, phân tích log, xử lý sự cố Hỗ trợ sau triển khai (L2/L3) Đảm bảo hệ thống hoạt động ổn định & HA Tài liệu & chuyển giao Xây dựng tài liệu triển khai (cấu trúc, cấu hình) Hướng dẫn vận hành cho khách hàng Đào tạo kỹ thuật cơ bản

Negotiation

View details

AI Product Builder

Ha Noi - Viet Nam


Product

  • AI
  • Backend
  • Frontend
  • Devops
  • Java
  • Golang
  • Product Management

Collaborate with domain experts to develop business requirements and constraints for designing prompt AI-assisted workflows and system specifications. Utilize AI tools, no-code/low-code, and coding to rapidly prototype UI/UX mockups and foundational implementations. Test prototypes through hypothesis validation cycles and provide detailed handovers to engineering teams. Decode legacy specifications and enhance existing products with AI-assisted analysis and implementation. Constantly enhance the product team's building-tooling, templates, and practices to adapt to changes in models and platforms.

Negotiation

View details

DevOps Engineer

Others - Viet Nam


Product

  • Devops
  • Kubernetes
  • Network

Managing and developing our Kubernetes platform across multiple clusters and environments including production, development, on-premises and public cloud. Designing and overseeing hybrid cloud infrastructure across on-premises and public clouds (such as GCP, AWS), including workload placement, cross-cloud networking, and unified resource management. Taking responsibility for the end-to-end CI/CD and GitOps process, including container build pipelines, image optimization, and progressive delivery using tools like ArgoCD/FluxCD. Taking charge of the observability stack to provide a comprehensive view across all clusters using tools like Grafana, Mimir, Tempo, Loki, Pyroscope, OnCall, Prometheus, and supporting agent-assisted SRE workflows. Managing and enhancing our inference platform, including vLLM serving and AIBrix for multi-model orchestration and autoscaling with a fleet of NVIDIA GPUs. Operating platform services such as Kafka, Redis, PostgreSQL, OpenSearch. Managing identity and access management with Keycloak integrated with Google Workspace, strengthening SSO, RBAC, and secrets management across the platform. Strengthening network security across private load balancers, firewalls, and VPC segmentation and designing and maintaining hub-and-spoke/multi-AZ topologies. Supporting training infrastructure with self-service VM provisioning, RunPod burst capacity, and Weights and Biases integration. Driving infrastructure reliability, cost efficiency, and capacity planning as the platform scales.

Negotiation

View details