DevOps Leader

JOB DESCRIPTION

Define and execute strategic plan for the DevOps team.
Responsible for building & operating the system according to the requirements & design of the project teams.
Implement (CI/CD) for Microservices architecture platform on DEV, QA, STAGING, UAT, PRODUCTION subjects.
Follow-up the issues happened with the system and actions to be done.
Identify problematic areas and implement strategic solutions to improve the system.
Collaborate and support engineering to the project teams
Manage work plan for DevOps staff; assign work activities; supervise, review and evaluate team members performance.
Responsible for building and helping develop the careers of your team members.
Work with Technical Director for infrastructure roadmap development

JOB REQUIREMENT

5+ years of working experience in DevOps position, Infrastructure System Engineer.
In-depth knowledge of VM Wares, Docker Containers, Kubernetes or similar systems…
General knowledge of Network, Network Security and OSI model
General knowledge of IT systems, SQL, No-SQL database systems, knowledge of distributed databases such as Hadoop, Hbase, Hive, Solr/Elasticsearch, or Cloudera.
Have experience implementing CI/CD : Git, GitHub, GitLab, Jenkins
Experience in working, deploying discovery, monitoring and caching systems such as: Apache Kafka, Istio, Grafana, Prometheus, redis.
Have experience in building and developing Microservice systems on the Kubernetes platform.
Experience with scripting skills such as Shell scripts or one of Nodejs, Python, Perl.
Experience in creating backups and managing disaster recovery
Development method: Scrum, Agile
REPORT TO: Technical Director

WHAT'S ON OFFER

Basic salary, 13th payment, bonus and share package
12 AL/years
PVI Premium Healthcare
A Powerful Mac machine for daily development work
Annual health check
Career path clearly
Company trip yearly and team building quarterly
Opportunity to join the start-up project and buy stocks company
Opportunity to go US office

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Product

Technical Skills:

Devops

Location:

Ho Chi Minh - Viet Nam

Working Policy:

Salary:

Negotiation

Job ID:

J01145

Status:

Close

Related Job:

Technical Leader

Ho Chi Minh - Viet Nam


Product

  • Python

We are looking for a product-minded Technical Lead who possesses a strong engineering foundation and the leadership capability to drive our backend and AI initiatives. You will not only architect scalable Python systems but also align technical decisions with business goals. In this role, you are expected to own the product lifecycle end-to-end-from design to operation-while building a high-trust, high-performance engineering culture. You must be adaptable, ready to lead the team through technology shifts (specifically in AI integration), and capable of balancing speed, quality, and cost based on the product phase.#What You'll Do Product-Centric Engineering & Strategy Product Mindset: Work closely with Product Managers to understand user pain points and value features. Make technical trade-offs based on the current product phase (e.g., MVP vs. scaling). Ownership: Take end-to-end responsibility for features: Design → Development → Release → Operation. Proactively propose solutions and identify risks before they become issues. Adaptability: Lead the team in adapting to new technology directions, particularly integrating AI/ML workflows into the backend. Be willing to pivot technical approaches when product direction changes. Architecture & Technical Foundation System Design: Architect robust systems with a clear understanding of when to use Monolith vs. Microservices. Design efficient data models, data flows, and versioned APIs. Cloud & Infrastructure: Leverage AWS services effectively. Assess the risks and benefits of integrating external services versus building in-house. Performance & Security: Ensure systems are designed for scalability, high performance, and security while keeping infrastructure costs optimized. Delivery & Execution Execution: Ensure on-time releases with the required quality standards. Manage scope creep and handle cross-team/external dependencies effectively. Risk Management: Provide honest reporting to management. Do not hide risks; instead, communicate them early with mitigation plans. Operational Excellence: Maintain system stability and reliability in production. Leadership & Mentorship Team Building: Build a strong engineering culture and standardize coding practices. protect the team from distractions while ensuring members trust and follow your technical direction. Mentorship: Conduct code reviews to mentor the team on mindset and standards (SOLID, DRY). Delegate tasks effectively-assigning the right people to the right jobs. Communication: Act as a bridge between technical and non-technical stakeholders. Explain technical decisions to the CEO and Product teams using business language (Cost, Risk, Impact). Align expectations on scope and delivery explicitly from the start.

Negotiation

View details

Platform Lead

Others - Singapore


Product

  • Backend
  • Devops
  • Data Engineering

Develop and expand distributed systems to handle large volumes of sensory, telemetry, and control data across cloud and edge environments, facilitating real-time connections for fleets of robots. Create the API Platform with a focus on high reliability, exceptional developer experience, and robust multimodal AI capabilities accessible through user-friendly APIs and SDKs. Establish extensive training and inference platforms for foundation models used in robot autonomy, teleoperation, and developer integrations. Devise data ingestion and streaming pipelines for real-time connectivity of robot fleets to the cloud, covering various data inputs such as video, LiDAR, joint states, and audio. Oversee and advance a modern cloud native infrastructure stack employing Kubernetes, Docker, and infrastructure as code tools. Ensure platform reliability through telemetry, monitoring, alerting, autoscaling, failover, and disaster recovery measures. Make infrastructure decisions pertaining to distributed storage, consensus protocols, GPU orchestration, network reliability, and API security. Foster collaboration across ML, robotics, and product teams to facilitate hardware in the loop simulation, policy rollout, continuous learning, and CI/CD workflows. Implement secure APIs featuring fine-grained access control, usage metering, rate limiting, and billing integration to accommodate a growing user base.

Negotiation

View details

Senior System Software Engineer - AI Data Platform - Inference Factory

Ho Chi Minh - Viet Nam


Product

  • Devops
  • C/C++
  • Python
  • Golang

Create infrastructure and tools to automate complex software processes effectively. Improve performance: Deploy advanced test harnesses, benchmarking frameworks, and analytical tools to thoroughly evaluate and enhance the performance and efficiency of software and hardware platforms. Utilize expertise in operating systems, kernel internals, device drivers, memory management, storage, networking, and high-speed interconnects to construct and troubleshoot high-performance systems. Collaborate with engineering teams to comprehend requirements and deliver efficient solutions. Establish performance objectives, assess feedback, analyze data, and continually enhance system reliability. Shape technical strategies: Contribute to developing technical strategies and roadmaps for platform automation initiatives to ensure they are in line with company goals and industry best practices.

Negotiation

View details