Python Developer (Operations Team)

ABOUT CLIENT

Our client is a reputable company specializing in software development and IT consulting services

JOB DESCRIPTION

The role requires managing and resolving issues, with more complex problems being escalated to the Devops team. The support engineer will maintain ownership even after escalation.
The position also involves devising solutions to enhance issue resolution efficiency. Examples include automating common tasks using bots and writing scripts to categorize and manage JIRA service desk tickets programmatically.

JOB REQUIREMENT

Proficient programming skills in Python, including experience with libraries like NumPy and Pandas.
Understanding of Unix/Linux environments and shell scripting.
Practical experience with SQL and relational databases.
Familiarity with JIRA, Git, and monitoring tools such as Grafana and Prometheus.
Strong problem-solving abilities with a proactive approach.
Capability to take ownership of issues and see them through to resolution.
Background in DevOps or operational support roles.
Experience in developing and maintaining web-based applications using technologies such as JavaScript (Node.js) for backend services and Python frameworks (Django, Flask, FastAPI).
Exposure to Machine Learning or Artificial Intelligence techniques.
Familiarity with Generative AI tools (e.g., Chat GPT, Gemini) for enhancing support efficiency.
Strong communication and coordination skills in English.

WHAT'S ON OFFER

Come join a diverse, energetic, and innovative team focusing on cutting-edge projects and emerging technologies.
Work alongside global experts and top tech talent to develop and enhance your skills.
Thrive in an environment that values openness, forward-thinking, and innovation, while supporting your full potential.
Competitive salary, 13th month salary, and performance bonuses.
Flexibility to work in a hybrid model, splitting time between the office and remote work.
Comprehensive healthcare and accident insurance.
Annual health check-up package.
Various allowances including lunch, marriage, newborn baby, and bereavement.
Fully equipped pantry with amenities for a comfortable lunch break.
A range of sports and social activities such as yoga, football, badminton, and tech clubs.
Annual company trip and team building activities.
Recognition awards for individuals, teams, and long-term service.
Advanced English and soft skills training for career development.
Monthly events including team gatherings, games, birthday celebrations, and year-end parties.
Company-funded support for personal loans such as home, vehicle, and tuition.

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Outsource

Technical Skills:

Python, Devops

Location:

Ho Chi Minh - Viet Nam

Working Policy:

Onsite

Salary:

Negotiation

Job ID:

J01897

Status:

Active

Related Job:

Operation Engineer (Python, English)

Ho Chi Minh - Viet Nam


Outsource

  • Python

Address and resolve operational issues via the JIRA Service Desk from internal teams and researchers. For more complex matters, you’ll coordinate with the DevOps team while still ensuring ownership and resolution. Enhance operational efficiency by analyzing recurring issues, suggesting improved workflows, and fostering clear communication between teams. Create automation tools to streamline operations, including the exploration of bots for repetitive tasks and the development of scripts to auto-categorize or manage tickets in JIRA. Collaborate with DevOps, Engineering, and Research teams to uphold stable and scalable system operations.

Negotiation

View details

Director Engineering – Software Engineering and AI Inferencing Platforms

Ho Chi Minh, Ha Noi - Viet Nam


Product

  • Management
  • Backend
  • Devops
  • Data Engineering
  • Cloud
  • AI

Lead and expand engineering teams in Vietnam across system software, data science, and AI platforms. Drive the creation, structure, and delivery of high-performance system software platforms that support AI products and services. Collaborate with global teams across Machine Learning, Inference Services, and Hardware/Software integration to guarantee performance, reliability, and scalability. Oversee the development and optimization of AI delivery platforms in Vietnam, including NIMs, Blueprints, and other flagship services. Collaborate with open-source and enterprise data and workflow ecosystems to advance accelerated AI factory, data science, and data engineering workloads. Promote continuous integration, continuous delivery, and engineering best practices across multi-site R&D Centers. Work with product management and other stakeholders to ensure enterprise readiness and customer impact. Establish and implement standard processes for large-scale, distributed system testing including stress, scale, failover, and resiliency testing. Ensure security and compliance testing aligns with industry standards for cloud and data center products. Mentor and develop talent within the organization, fostering a culture of quality and continuous improvement.

Negotiation

View details

Principal Engineer, System Software Platform Engineering

Ho Chi Minh, Ha Noi - Viet Nam


Product

  • Devops
  • Backend
  • AI

Create and manage a platform for AI that provides services for multiple users, handles identity and policy management, configures quotas, and controls costs. Additionally, this platform should offer easy paths for teams to work on AI projects. Oversee the deployment of AI models at scale, including routing, autoscaling, and implementing safety measures to ensure reliability and observability. Manage GPU resources in a Kubernetes environment, including device plugins, feature discovery, and scheduling strategies, among other responsibilities. Take charge of the entire lifecycle of GPUs, ensuring that driver, firmware, and runtime updates are implemented safely and consistently. Implement virtualization strategies for GPU resources, such as vGPU and PCIe passthrough, while defining policies for resource placement, isolation, and preemptive actions. Establish secure traffic and networking protocols, including gateways, service mesh, and authentication/authorization measures. Enhance observability and operational efficiency through monitoring tools for GPUs, response protocols for incidents, and optimization of costs. Develop reusable templates, integrate SDKs and CLIs, and implement infrastructure-as-code standards for the platform. Influence the platform's direction by creating design documents, mentoring engineers, and aligning platform development with the needs of AI products.

Negotiation

View details