Site Reliability Engineer

JOB DESCRIPTION

We are seeking an engineer to ensure the reliability and performance of Our Client's Data Platform. Successful candidates will work with researchers, operations, and other technology teams to establish the smooth functioning of our production data pipeline sourced from an enormous and continuously updating catalog of vendor and market data. This engineer will also develop solutions to improve the efficiency and scalability of our ever-growing business-critical management system
Operate, monitor, and provision the system to make sure it works smoothly
Provide feedback for system improvement
Provide solutions for live monitoring of the production data pipeline
Design and implement continuous integration and test automation
Deliver release management solutions
Collaborate with engineering, analyst, and research teams to ensure the reliability and operability of new data pipeline components
Analyze and diagnose platform performance and reliability problems
Understand, manage, and utilize the right technologies for building our platforms such as Kubernetes, Kafka, and Spark

JOB REQUIREMENT

Bachelor’s degree in Computer Science or equivalent experience
Excellent analytical skills and a passion for solving problems
Experience in Linux administration; fluent in Linux standard command line programs
Fluency in Python and its ecosystem (numpy, pandas, etc.) is strongly recommended
Experience in metrics and logs aggregation and analysis with a focus on performance optimization
Understanding of Git and CI/CD concept
A great support attitude (our job is to make life easier for other teams!)
Strong written and verbal communication skills; Fluency in the English language
Knowledgeable in:
Computer science fundamentals (algorithms and data structures)
Relational databases
Modern service architectures
Experience in the following technologies is relevant: Kafka, Docker, Helm, Kubernetes, GC, AWS, Spark and Pyspark, Hadoop, Redis, MySQL, gRPC, Apache Arrow, Apache Airflow

WHAT'S ON OFFER

Competitive and attractive compensation package with a clear career road-map – where you feel challenged every day
We offer a strong culture of learning and development: training courses, library, speakers, share and learn events
Learn from who sits next to you! Working in our client's environment, you are surrounded by smart and talented people
Employee resources groups with strong diversity and inclusion culture
Premium Health Insurance and Employee Assistance Program
Generous time-off policy, unlimited sick days, re-creation sabbatical leave (based on tenure), Trade Union benefits for staff and family
Team building activities every month: Local engagement events, monthly team lunches – Employee clubs: football, ping-pong, badminton, yoga, running, PS5, movies, etc.
Annual company trips and occasional global conferences – the opportunity to travel and connect with our global teams
Happy hour with tea breaks, snacks, and meals every day in the office!

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Product

Technical Skills:

Devops, Kubernetes, Kafka, Python

Location:

Ho Chi Minh, Ha Noi - Viet Nam

Working Policy:

Salary:

Negotiation

Job ID:

J01251

Status:

Close

Related Job:

Windows Engineer (C++/C#) - GSaaS

Ho Chi Minh - Viet Nam


Product

  • C/C++

Develop and maintain applications using C# (WinUI framework) and C++ (Qt framework and Win32 API). Participate in the company's software development projects and collaborate with cross-functional teams on software architecture. Develop new features according to requirements, provide development documentation, and participate in code reviews. Troubleshoot, debug, and optimize performance for existing software features and applications. Write high-quality, testable code, ensuring adherence to high code quality standards. Research and integrate new technologies to enhance software products. Mentor junior developers and contribute to team knowledge sharing.

Negotiation

View details

Senior AI Engineer

Ho Chi Minh - Viet Nam


Product

  • Python
  • AI
  • Machine Learning

We're seeking an AI Engineer with strong academic foundations and deep technical expertise who excels at translating research into production banking systems. This role is 80% focused on engineering excellence-deploying models, optimizing infrastructure, ensuring reliability, and solving real-world implementation challenges-and 20% on staying current with cutting-edge AI research and emerging technologies. You'll bridge the gap between state-of-the-art AI research and scalable production systems in the financial services sector.#AI Engineering & Deployment (80%) Design, build, and deploy production-ready AI/ML systems on AWS with focus on reliability, scalability, and performance for banking applications Implement and maintain MLOps pipelines using AWS services (SageMaker, Bedrock, Lambda, Step Functions) including model versioning, monitoring, and automated retraining workflows Build and optimize AI solutions using AWS Bedrock, OpenAI API, and Gemini API combining with Model Context Protocol (MCP), Agent-to-Agent (A2A) protocol for various banking use cases Design and implement prompt engineering frameworks and prompt management systems for LLM-based applications Develop graph analysis solutions for fraud detection, customer relationship mapping, and network analysis in banking contexts Debug and troubleshoot production AI systems, identifying and resolving issues in model performance, data pipelines, and AWS infrastructure Build and maintain AIOps practices including automated monitoring, alerting, and incident response for AI systems on AWS Optimize model serving infrastructure for latency, throughput, and cost-efficiency using AWS services Implement robust data pipelines using AWS Glue, Kinesis, and related services for training and inference Collaborate with software engineering and risk teams to integrate AI capabilities into banking products and services Ensure compliance with banking regulations and security standards in all AI deployments Monitor model performance in production and implement drift detection and retraining strategies#AI Research & Innovation (20%) Stay current with latest AI research papers and breakthroughs, evaluating applicability to banking and financial services Research and prototype emerging AI architectures and techniques for financial use cases Evaluate new paradigms in model training, inference optimization, and architectural innovations Share knowledge through technical discussions, paper reviews, and internal research presentations Identify opportunities to apply cutting-edge research to improve fraud detection, customer service, risk assessment, and other banking operations

Negotiation

View details

Frontend Software Engineer (Middle/Senior/Lead)

Ho Chi Minh - Viet Nam


Product

  • ReactJS
  • Frontend

Front-end research and development of products, including designing, implementing, and optimizing user interfaces. Making technical decisions, reviewing code, and promoting best practices in development and architecture. Using relevant tools and platforms to address user experience challenges, integrate complex business logic, and deliver business solutions efficiently. Collaborating with stakeholders to analyze business needs, define technical requirements, and contribute to system architecture design and development. Researching emerging technologies and evaluating leading industry products to drive continuous improvement in user experience and overall product quality. Providing mentorship and guidance to junior developers and fostering a collaborative and knowledge-sharing environment. Leading big projects from end-to-end and mentoring/guiding team members.

Negotiation

View details