Senior DevOps Engineer (Cloud Solutions)

JOB DESCRIPTION

Planning, design, management, maintenance and support of cloud infrastructure for high-traffic workloads that operate at an enterprise scale
Drive automation of tasks and implementing of infrastructure services
Identify and rectify potential risks within the infrastructure, network or security
Research, setup, testing and implementation of technologies and solutions to improve the performance, reliability, availability, security and efficiency of infrastructure on AWS
Troubleshoot, perform root cause analysis and working closely with the developers to implement corrective/preventive actions during and after an incident

JOB REQUIREMENT

8+ years of experience with using a broad range of AWS technologies (e.g. EC2, IAM, VPC, CloudWatch, EKS, ECS, Security Hub, DynamoDB, SecretManager, GuardDuty, etc)
Solid experience in Terraform as Infrastructure as Code
Knowledge of Golang and Python is a plus
Experience with containerized workloads
Experienced in a 24x7x365 uptime Amazon AWS environment leveraging git repositories and CI/CD tools like Jenkins
Ability to analyze and resolve complex infrastructure resource and application deployment issues (e.g. by using APM tools like NewRelic, Dynatrace, etc)
Knows the best practice and cloud security (AWS Well-Architected Framework)
 
Nice to have
AWS Data Engineering expertise (e.g. AWS Certified Big Data - Specialty)
Experienced with Data Engineering services like Lake Formation, Glue, Athena, Redshift, Sagemaker, Kineses, Kafka, etc.
Experienced with SQL and NoSQL Databases like DynamoDB, RDS Aurora, MySQL, ElasticSearch, Solr, etc.

WHAT'S ON OFFER

We are an equal opportunity employer and do not discriminate based on gender, race, age, religion, disability, or other local protected class. We are committed to cultivating an inclusive environment for all employees, and we welcome the diversity that you will bring!
If you are looking for a rapid-growth environment and great teams to work with, you should apply now.
We are sorry to inform you that only shortlisted candidates will be notified as we may be overwhelmed by the number of applicants coming into our system; hence if you do not get a reply from us - don’t give up on us just yet!
18 days Annual leaves
Quarter bonus based on employee performance and company's business
Health insurance, private insurance provided that covers yourself and your immediate dependents (spouse and children if any)
Laptop provided
Work from home allowances
Wellness benefit (cover for gym membership etc; can also be used to top up personal insurance too) up to 60usd/quarter
Any government-regulated perks
We have an L&D budget that supports the following:
Online or classroom courses held by an external provider
Conferences, workshops, or seminars
Coursework for a relevant diploma, degree, or professional certification

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Internet, Payment, Product

Technical Skills:

Devops, AWS, Security

Location:

Ho Chi Minh - Viet Nam

Working Policy:

Salary:

Negotiation

Job ID:

J00564

Status:

Close

Related Job:

Partner Implementation Engineer (Security & Digital Trust)

Ha Noi - Viet Nam


Outsource

Đóng vai trò là người thực hiện triển khai chủ chốt, chịu trách nhiệm triển khai, cấu hình và tích hợp các giải pháp Security & Digital Trust (PKI, Chữ ký số, Mã hóa, MFA) vào hệ thống thực tế của khách hàng, đảm bảo hệ thống vận hành ổn định, bảo mật và đúng thiết kế. Triển khai hệ thống (Implementation) Chuẩn bị môi trường: kiểm tra hạ tầng (Server, Hệ điều hành, Cơ sở dữ liệu, Mạng) Cài đặt & cấu hình giải pháp: PKI / CA / Chữ ký số / MFA / Mã hóa Thiết lập chính sách bảo mật, quy trình nghiệp vụ Kết nối với thiết bị bảo mật (HSM, Quản lý Khóa) Triển khai trên nền tảng Cloud / Container (nếu có) Triển khai hệ thống trên Kubernetes / OpenShift Cấu hình tài nguyên (YAML: Pod, Dịch vụ, Ingress, Bản đồ Cấu hình, Bí mật) Thiết lập lưu trữ (Khối Lưu trữ Không gian); mạng nội bộ Áp dụng các chính sách bảo mật cho container Tích hợp hệ thống (Integration) Hỗ trợ tích hợp với: Trang web/ Ứng dụng/ Giao diện lập trình ứng dụng và IAM / SSO / AD / LDAP Hướng dẫn sử dụng API/SDK Kiểm tra luồng dữ liệu & bảo mật giao tiếp Phối hợp với nhóm khách hàng (Phát triển / Cơ sở hạ tầng / Bảo mật) Kiểm thử & nghiệm thu (QA/UAT) Thực hiện kiểm thử kỹ thuật & kịch bản vận hành Hỗ trợ UAT với khách hàng Kiểm tra tính đúng đắn của: Chữ ký số; Chứng thư và Luồng xác thực Vận hành & hỗ trợ Giám sát hệ thống, phân tích log, xử lý sự cố Hỗ trợ sau triển khai (L2/L3) Đảm bảo hệ thống hoạt động ổn định & HA Tài liệu & chuyển giao Xây dựng tài liệu triển khai (cấu trúc, cấu hình) Hướng dẫn vận hành cho khách hàng Đào tạo kỹ thuật cơ bản

Negotiation

View details

AI Product Builder

Ha Noi - Viet Nam


Product

  • AI
  • Backend
  • Frontend
  • Devops
  • Java
  • Golang
  • Product Management

Collaborate with domain experts to develop business requirements and constraints for designing prompt AI-assisted workflows and system specifications. Utilize AI tools, no-code/low-code, and coding to rapidly prototype UI/UX mockups and foundational implementations. Test prototypes through hypothesis validation cycles and provide detailed handovers to engineering teams. Decode legacy specifications and enhance existing products with AI-assisted analysis and implementation. Constantly enhance the product team's building-tooling, templates, and practices to adapt to changes in models and platforms.

Negotiation

View details

DevOps Engineer

Others - Viet Nam


Product

  • Devops
  • Kubernetes
  • Network

Managing and developing our Kubernetes platform across multiple clusters and environments including production, development, on-premises and public cloud. Designing and overseeing hybrid cloud infrastructure across on-premises and public clouds (such as GCP, AWS), including workload placement, cross-cloud networking, and unified resource management. Taking responsibility for the end-to-end CI/CD and GitOps process, including container build pipelines, image optimization, and progressive delivery using tools like ArgoCD/FluxCD. Taking charge of the observability stack to provide a comprehensive view across all clusters using tools like Grafana, Mimir, Tempo, Loki, Pyroscope, OnCall, Prometheus, and supporting agent-assisted SRE workflows. Managing and enhancing our inference platform, including vLLM serving and AIBrix for multi-model orchestration and autoscaling with a fleet of NVIDIA GPUs. Operating platform services such as Kafka, Redis, PostgreSQL, OpenSearch. Managing identity and access management with Keycloak integrated with Google Workspace, strengthening SSO, RBAC, and secrets management across the platform. Strengthening network security across private load balancers, firewalls, and VPC segmentation and designing and maintaining hub-and-spoke/multi-AZ topologies. Supporting training infrastructure with self-service VM provisioning, RunPod burst capacity, and Weights and Biases integration. Driving infrastructure reliability, cost efficiency, and capacity planning as the platform scales.

Negotiation

View details