MLOps Engineer

JOB DESCRIPTION

Be a part of building the ideal data and ML/AI ecosystem from scratch. Spearheaded the integration of the latest capabilities to enhance customer experiences and transform business operations. Embrace the vision of democratizing ML/AI technology, making it accessible to all by establishing robust engineering standards, simplifying complexities, and designing effective controls and guardrails. This leadership role goes beyond conventional boundaries, empowering you to lead and innovate across many aspects of our data enablement value stream.
Your role as an MLOps Engineer will be similar to a DevOps engineer, with a stretched focus on productionizing Machine Learning features:
Design and implement scalable AI solutions that enables data engineers and ML scientists to train, build, and maintain machine learning models effectively.
Develop automated processes for continuous model training and evaluation pipelines specifically for ML applications.
Ensure the seamless integration of Company Plus's current architecture with newly added ML functionalities, enhancing overall system capabilities.
Collaborating with diverse stakeholders including business partners, risk, legal, and security teams, as well as UX designers and architects to define and implement robust validation and verification strategies
Fostering a culture of quality coding practices, including test-driven development, unit testing, and secure coding awareness
Focus on business practicality and the 80/20 rule, aiming for a high bar for code quality, but recognize the business benefit of "having something now" vs "perfection sometime in the future"

JOB REQUIREMENT

To grow and be successful in this role, you will bring extensive analytical and technical skills, business acumen and natural curiosity to deliver on product investigations and analysis and support initiatives through insights.
You will ideally bring the following:
Proficiency in one of the scripting/programming languages (Python).
Experience in building data products using GCP/ AWS technologies.
Experience with containerization, Terraform, and GitOps principles for automation and deployment.
Strong background in ML concepts and applications and in-depth knowledge of MLOps best practices.
Agile development mindset, appreciating the benefit of constant iteration and improvement.
Have experience in addressing Tech Debt with minimizing production incidents.
Familiarity with RAG architectures and/or have a good understanding of their application.

WHAT'S ON OFFER

Attractive package including fixed 13-month salary and variable performance bonus
Insurance plan based on full salary
100% full salary and benefits as an official employee from the 1st day of working
Medical benefit (private insurance) for employee and their family
18 paid leaves/year (12 annual leaves and 6 personal leaves)
Working in a fast-paced, flexible, and multinational working environment.
Chance to travel for business trip in foreign countries
Free snacks, refreshment, and parking
Career development in a giant tech hub just entering Vietnam market, with very challenging project
Hybrid working mode, flexible time (3 days in office per week)

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Outsource

Technical Skills:

Machine Learning, Devops, Data Science, Python, Java

Location:

Ho Chi Minh - Viet Nam

Working Policy:

Salary:

Negotiation

Job ID:

J01554

Status:

Close

Related Job:

PreSales Solutions Engineer

Ho Chi Minh - Singapore


Product

  • System
  • Google Cloud
  • Presale

PreSales Support: Collaborating with the Sales team to understand client needs and develop tailored solutions using Google Maps and Google Cloud services. This involves conducting technical presentations, product demonstrations, and creating proof of concepts (POCs) for prospective clients, as well as contributing to proposals and RFP responses with detailed technical information. Post-Sales Support: Leading the technical implementation of Google Maps and Google Cloud services, ensuring smooth deployment and integration. Providing ongoing technical support and troubleshooting for clients after implementation, working closely with cross-functional teams to ensure client satisfaction and build long-term relationships. Technical Expertise: Staying up-to-date with the latest Google Maps and Google Cloud technologies, serving as a subject matter expert (SME) for both internal teams and clients. Integrating new features and services into client solutions and providing guidance on best practices. Collaboration: Working closely with Sales, Product, Infrastructure, Data, and Engineering teams to align solutions with client needs and company goals. Mentoring junior team members and contributing to training initiatives.

Negotiation

View details

Technical Lead

Ho Chi Minh - Viet Nam


Product

  • NodeJS
  • Python

Leading the backend development team, providing technical direction, mentorship, and best practices. Designing and implementing scalable, secure, and high-performance microservices-based architectures. Architecting and implementing agentic AI workflows and RAG (Retrieval-Augmented Generation) systems for personalized user interactions and automated coaching features. Overseeing data pipelines and infrastructure required for real-time AI model inference within a microservices-based environment. Collaborating with stakeholders to align on requirements and delivery timelines. Optimizing application performance, monitoring system reliability, and proactively troubleshooting issues. Advocating for CI/CD pipelines, automated testing, and robust version control strategies. Documenting key architectural decisions, APIs, and processes for internal use.

Negotiation

View details

Chief Technology Officer

Ha Noi - Viet Nam


Product

  • Cloud
  • Backend

Planning & designing overall system architecture: Creating a Technology Roadmap for a Game Server system with high concurrency and low latency for global players. Cost optimization: Deciding on the strategy for using Cloud infrastructure (AWS, GCP, Azure) or Hybrid Cloud to balance performance and operational expenses. High-level consultation: Participating in the Executive Board to address the relationship between speed-to-market of features and system stability. Tech-stack selection: Evaluating and finalizing programming languages (Go, C++, Java, Node.js) and processing models (Microservices vs Monolith) suitable for the complex logic of the game. Scalability solution: Directing the development of Auto-scaling, Load Balancing mechanisms, and managing Player State on large clusters. Data management: Designing Database structure (SQL/NoSQL) and Cache system (Redis, Memcached) to handle billions of queries daily without congestion. Ensuring Uptime: Building real-time monitoring and alerting systems to maintain 99.99% Availability. Network security: Implementing solutions to combat DDoS attacks, game fraud (Anti-cheat), and comprehensive user data security. Infrastructure & CI/CD: Standardizing automatic deployment processes to ensure game updates (Hotfix/Update) do not disrupt players. Deployment strategy & Optimization: Developing plans to optimize Cloud Services costs (AWS/GCP/Azure), evaluating the use of Spot Instances, Reserved Instances, or Private Cloud solutions to save operational budget. Meanwhile, establishing 24/7 monitoring and incident response systems.

Negotiation

View details