Senior Data Engineer

ABOUT CLIENT

Our client is a startup company focusing on building a marketplace for insurance products

JOB DESCRIPTION

Designing and implementing scalable data architectures aligned with business objectives
Building and maintaining data pipelines for ingestion, transformation, and delivery using modern orchestration tools
Architecting data solutions across data warehousing, data lakes, and real-time analytics
Creating and maintaining data models and documenting the enterprise data landscape
Building and maintaining data infrastructure supporting ML model training, deployment, and monitoring
Designing and implementing vector database solutions for AI-powered features
Developing data pipelines for AI-driven capabilities and ensuring global scalability
Implementing DataOps practices to ensure data quality, lineage, and governance
Defining and enforcing data strategy and architectural principles
Building monitoring and alerting for pipeline health, data quality, and SLA compliance
Optimizing query performance and cost efficiency
Collaborating with product and engineering teams to translate business requirements into data solutions
Driving adoption of modern data practices across the organization
Contributing to architectural reviews and technical decision-making
Taking ownership of data problems through to resolution

JOB REQUIREMENT

A minimum of 8-10 years of experience in data engineering, demonstrating increasing level of responsibility for architecture and technical leadership
Recent 3-5 years of experience in tech startups, specializing in building and scaling data infrastructure in high-growth settings
Recent expertise in insurance, banking, fintech, or e-commerce, with a preference for insurance domain experience
Track record of building data platforms capable of handling production AI/ML workloads at scale
Profound knowledge of AWS and Azure cloud data services, with multi-cloud experience
Proficiency in MongoDB Atlas, including aggregation pipelines, Atlas Data Federation, and data modeling for document databases
Hands-on experience with pipeline orchestration tools such as Airflow, Spark (Databricks or Apache), and Kafka
Familiarity with transformation frameworks like dbt or similar
Advanced skills in SQL and Python
Experience with Infrastructure as Code (Terraform) and GitOps practices
Strong understanding of containerization (Docker, Kubernetes) for data workloads
Ability to build data pipelines for ML model training and inference
Hands-on experience with vector databases (e.g., MongoDB Atlas Vector Search, Pinecone, Weaviate, Qdrant)
Knowledge of MLOps practices including model versioning, feature stores, monitoring, and A/B testing infrastructure
Understanding of embedding models and retrieval-augmented generation (RAG) patterns
Active use of AI coding assistants (GitHub Copilot, Claude, Cursor, or similar) in daily work
Strong data modeling and architecture skills, with the ability to communicate designs to both technical and non-technical stakeholders
Self-directed and capable of managing and prioritizing autonomously
Collaborative mindset, comfortable working across engineering, product, and business teams
Comfortable with ambiguity, able to context-switch, and energized by building rather than maintaining, reflecting a startup mentality

WHAT'S ON OFFER

Competitive compensation package
Health benefits for yourself and two family members
Additional allowances for meals, phone, and transportation
Year-end bonus
Strong business and technical support
Positive and diverse work environment
Potential for travel and work in Southeast Asia
New, modern office in District 1 (HCM city)
Access to latest technologies and flexible work hours
Other perks and benefits available

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Product

Technical Skills:

Data Engineering

Location:

Ho Chi Minh - Viet Nam

Working Policy:

Onsite

Salary:

Negotiation

Job ID:

J02016

Status:

Close

Related Job:

PreSales Solutions Engineer

Ho Chi Minh - Viet Nam


Product

  • Presale
  • System
  • Google Cloud

PreSales Support: Collaborating with the Sales team to understand client needs and develop tailored solutions using Google Maps and Google Cloud services. This involves conducting technical presentations, product demonstrations, and creating proof of concepts (POCs) for prospective clients, as well as contributing to proposals and RFP responses with detailed technical information. Post-Sales Support: Leading the technical implementation of Google Maps and Google Cloud services, ensuring smooth deployment and integration. Providing ongoing technical support and troubleshooting for clients after implementation, working closely with cross-functional teams to ensure client satisfaction and build long-term relationships. Technical Expertise: Staying up-to-date with the latest Google Maps and Google Cloud technologies, serving as a subject matter expert (SME) for both internal teams and clients. Integrating new features and services into client solutions and providing guidance on best practices. Collaboration: Working closely with Sales, Product, Infrastructure, Data, and Engineering teams to align solutions with client needs and company goals. Mentoring junior team members and contributing to training initiatives.

Negotiation

View details

Partner Implementation Engineer (Security & Digital Trust)

Ha Noi - Viet Nam


Outsource

  • System

Đóng vai trò là người thực hiện triển khai chủ chốt, chịu trách nhiệm triển khai, cấu hình và tích hợp các giải pháp Security & Digital Trust (PKI, Chữ ký số, Mã hóa, MFA) vào hệ thống thực tế của khách hàng, đảm bảo hệ thống vận hành ổn định, bảo mật và đúng thiết kế. Triển khai hệ thống (Implementation) Chuẩn bị môi trường: kiểm tra hạ tầng (Server, Hệ điều hành, Cơ sở dữ liệu, Mạng) Cài đặt & cấu hình giải pháp: PKI / CA / Chữ ký số / MFA / Mã hóa Thiết lập chính sách bảo mật, quy trình nghiệp vụ Kết nối với thiết bị bảo mật (HSM, Quản lý Khóa) Triển khai trên nền tảng Cloud / Container (nếu có) Triển khai hệ thống trên Kubernetes / OpenShift Cấu hình tài nguyên (YAML: Pod, Dịch vụ, Ingress, Bản đồ Cấu hình, Bí mật) Thiết lập lưu trữ (Khối Lưu trữ Không gian); mạng nội bộ Áp dụng các chính sách bảo mật cho container Tích hợp hệ thống (Integration) Hỗ trợ tích hợp với: Trang web/ Ứng dụng/ Giao diện lập trình ứng dụng và IAM / SSO / AD / LDAP Hướng dẫn sử dụng API/SDK Kiểm tra luồng dữ liệu & bảo mật giao tiếp Phối hợp với nhóm khách hàng (Phát triển / Cơ sở hạ tầng / Bảo mật) Kiểm thử & nghiệm thu (QA/UAT) Thực hiện kiểm thử kỹ thuật & kịch bản vận hành Hỗ trợ UAT với khách hàng Kiểm tra tính đúng đắn của: Chữ ký số; Chứng thư và Luồng xác thực Vận hành & hỗ trợ Giám sát hệ thống, phân tích log, xử lý sự cố Hỗ trợ sau triển khai (L2/L3) Đảm bảo hệ thống hoạt động ổn định & HA Tài liệu & chuyển giao Xây dựng tài liệu triển khai (cấu trúc, cấu hình) Hướng dẫn vận hành cho khách hàng Đào tạo kỹ thuật cơ bản

Negotiation

View details

DevOps Engineer

Others - Viet Nam


Product

  • Devops
  • Kubernetes
  • Network

Managing and developing our Kubernetes platform across multiple clusters and environments including production, development, on-premises and public cloud. Designing and overseeing hybrid cloud infrastructure across on-premises and public clouds (such as GCP, AWS), including workload placement, cross-cloud networking, and unified resource management. Taking responsibility for the end-to-end CI/CD and GitOps process, including container build pipelines, image optimization, and progressive delivery using tools like ArgoCD/FluxCD. Taking charge of the observability stack to provide a comprehensive view across all clusters using tools like Grafana, Mimir, Tempo, Loki, Pyroscope, OnCall, Prometheus, and supporting agent-assisted SRE workflows. Managing and enhancing our inference platform, including vLLM serving and AIBrix for multi-model orchestration and autoscaling with a fleet of NVIDIA GPUs. Operating platform services such as Kafka, Redis, PostgreSQL, OpenSearch. Managing identity and access management with Keycloak integrated with Google Workspace, strengthening SSO, RBAC, and secrets management across the platform. Strengthening network security across private load balancers, firewalls, and VPC segmentation and designing and maintaining hub-and-spoke/multi-AZ topologies. Supporting training infrastructure with self-service VM provisioning, RunPod burst capacity, and Weights and Biases integration. Driving infrastructure reliability, cost efficiency, and capacity planning as the platform scales.

Negotiation

View details