Senior Data Engineer

ABOUT CLIENT

Our client is a startup company focusing on building a marketplace for insurance products

JOB DESCRIPTION

Designing and implementing scalable data architectures aligned with business objectives

Building and maintaining data pipelines for ingestion, transformation, and delivery using modern orchestration tools

Architecting data solutions across data warehousing, data lakes, and real-time analytics

Creating and maintaining data models and documenting the enterprise data landscape

Building and maintaining data infrastructure supporting ML model training, deployment, and monitoring

Designing and implementing vector database solutions for AI-powered features

Developing data pipelines for AI-driven capabilities and ensuring global scalability

Implementing DataOps practices to ensure data quality, lineage, and governance

Defining and enforcing data strategy and architectural principles

Building monitoring and alerting for pipeline health, data quality, and SLA compliance

Optimizing query performance and cost efficiency

Collaborating with product and engineering teams to translate business requirements into data solutions

Driving adoption of modern data practices across the organization

Contributing to architectural reviews and technical decision-making

Taking ownership of data problems through to resolution

JOB REQUIREMENT

A minimum of 8-10 years of experience in data engineering, demonstrating increasing level of responsibility for architecture and technical leadership

Recent 3-5 years of experience in tech startups, specializing in building and scaling data infrastructure in high-growth settings

Recent expertise in insurance, banking, fintech, or e-commerce, with a preference for insurance domain experience

Track record of building data platforms capable of handling production AI/ML workloads at scale

Profound knowledge of AWS and Azure cloud data services, with multi-cloud experience

Proficiency in MongoDB Atlas, including aggregation pipelines, Atlas Data Federation, and data modeling for document databases

Hands-on experience with pipeline orchestration tools such as Airflow, Spark (Databricks or Apache), and Kafka

Familiarity with transformation frameworks like dbt or similar

Advanced skills in SQL and Python

Experience with Infrastructure as Code (Terraform) and GitOps practices

Strong understanding of containerization (Docker, Kubernetes) for data workloads

Ability to build data pipelines for ML model training and inference

Hands-on experience with vector databases (e.g., MongoDB Atlas Vector Search, Pinecone, Weaviate, Qdrant)

Knowledge of MLOps practices including model versioning, feature stores, monitoring, and A/B testing infrastructure

Understanding of embedding models and retrieval-augmented generation (RAG) patterns

Active use of AI coding assistants (GitHub Copilot, Claude, Cursor, or similar) in daily work

Strong data modeling and architecture skills, with the ability to communicate designs to both technical and non-technical stakeholders

Self-directed and capable of managing and prioritizing autonomously

Collaborative mindset, comfortable working across engineering, product, and business teams

Comfortable with ambiguity, able to context-switch, and energized by building rather than maintaining, reflecting a startup mentality

WHAT'S ON OFFER

Competitive compensation package

Health benefits for yourself and two family members

Additional allowances for meals, phone, and transportation

Year-end bonus

Strong business and technical support

Positive and diverse work environment

Potential for travel and work in Southeast Asia

New, modern office in District 1 (HCM city)

Access to latest technologies and flexible work hours

Other perks and benefits available

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666

We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Product

Technical Skills:

Data Engineering

Location:

Ho Chi Minh - Viet Nam

Working Policy:

Onsite

Salary:

Negotiation

Job ID:

J02016

Status:

Related Job:

PreSales Solutions Engineer

Ho Chi Minh - Viet Nam

Product

Presale
System
Google Cloud

PreSales Support: Collaborating with the Sales team to understand client needs and develop tailored solutions using Google Maps and Google Cloud services. This involves conducting technical presentations, product demonstrations, and creating proof of concepts (POCs) for prospective clients, as well as contributing to proposals and RFP responses with detailed technical information. Post-Sales Support: Leading the technical implementation of Google Maps and Google Cloud services, ensuring smooth deployment and integration. Providing ongoing technical support and troubleshooting for clients after implementation, working closely with cross-functional teams to ensure client satisfaction and build long-term relationships. Technical Expertise: Staying up-to-date with the latest Google Maps and Google Cloud technologies, serving as a subject matter expert (SME) for both internal teams and clients. Integrating new features and services into client solutions and providing guidance on best practices. Collaboration: Working closely with Sales, Product, Infrastructure, Data, and Engineering teams to align solutions with client needs and company goals. Mentoring junior team members and contributing to training initiatives.

Negotiation

View details

Partner Implementation Engineer (Security & Digital Trust)

Ha Noi - Viet Nam

Outsource

System

Đóng vai trò là người thực hiện triển khai chủ chốt, chịu trách nhiệm triển khai, cấu hình và tích hợp các giải pháp Security & Digital Trust (PKI, Chữ ký số, Mã hóa, MFA) vào hệ thống thực tế của khách hàng, đảm bảo hệ thống vận hành ổn định, bảo mật và đúng thiết kế. Triển khai hệ thống (Implementation) Chuẩn bị môi trường: kiểm tra hạ tầng (Server, Hệ điều hành, Cơ sở dữ liệu, Mạng) Cài đặt & cấu hình giải pháp: PKI / CA / Chữ ký số / MFA / Mã hóa Thiết lập chính sách bảo mật, quy trình nghiệp vụ Kết nối với thiết bị bảo mật (HSM, Quản lý Khóa) Triển khai trên nền tảng Cloud / Container (nếu có) Triển khai hệ thống trên Kubernetes / OpenShift Cấu hình tài nguyên (YAML: Pod, Dịch vụ, Ingress, Bản đồ Cấu hình, Bí mật) Thiết lập lưu trữ (Khối Lưu trữ Không gian); mạng nội bộ Áp dụng các chính sách bảo mật cho container Tích hợp hệ thống (Integration) Hỗ trợ tích hợp với: Trang web/ Ứng dụng/ Giao diện lập trình ứng dụng và IAM / SSO / AD / LDAP Hướng dẫn sử dụng API/SDK Kiểm tra luồng dữ liệu & bảo mật giao tiếp Phối hợp với nhóm khách hàng (Phát triển / Cơ sở hạ tầng / Bảo mật) Kiểm thử & nghiệm thu (QA/UAT) Thực hiện kiểm thử kỹ thuật & kịch bản vận hành Hỗ trợ UAT với khách hàng Kiểm tra tính đúng đắn của: Chữ ký số; Chứng thư và Luồng xác thực Vận hành & hỗ trợ Giám sát hệ thống, phân tích log, xử lý sự cố Hỗ trợ sau triển khai (L2/L3) Đảm bảo hệ thống hoạt động ổn định & HA Tài liệu & chuyển giao Xây dựng tài liệu triển khai (cấu trúc, cấu hình) Hướng dẫn vận hành cho khách hàng Đào tạo kỹ thuật cơ bản

Negotiation

View details

DevOps Engineer

Others - Viet Nam

Product

Devops
Kubernetes
Network

Managing and developing our Kubernetes platform across multiple clusters and environments including production, development, on-premises and public cloud. Designing and overseeing hybrid cloud infrastructure across on-premises and public clouds (such as GCP, AWS), including workload placement, cross-cloud networking, and unified resource management. Taking responsibility for the end-to-end CI/CD and GitOps process, including container build pipelines, image optimization, and progressive delivery using tools like ArgoCD/FluxCD. Taking charge of the observability stack to provide a comprehensive view across all clusters using tools like Grafana, Mimir, Tempo, Loki, Pyroscope, OnCall, Prometheus, and supporting agent-assisted SRE workflows. Managing and enhancing our inference platform, including vLLM serving and AIBrix for multi-model orchestration and autoscaling with a fleet of NVIDIA GPUs. Operating platform services such as Kafka, Redis, PostgreSQL, OpenSearch. Managing identity and access management with Keycloak integrated with Google Workspace, strengthening SSO, RBAC, and secrets management across the platform. Strengthening network security across private load balancers, firewalls, and VPC segmentation and designing and maintaining hub-and-spoke/multi-AZ topologies. Supporting training infrastructure with self-service VM provisioning, RunPod burst capacity, and Weights and Biases integration. Driving infrastructure reliability, cost efficiency, and capacity planning as the platform scales.

Negotiation

View details