AWS and Linux System Administrator

JOB DESCRIPTION

Administer AntiVirus Solutions, Mobile Device Management and UNIX based systems.
Deploying, automating, maintaining and managing AWS cloud based production system to ensure the availability, performance, scalability and security of productions systems.
Build, release and configuration management of production systems
System troubleshooting and problem solving across platform and application domains.
Suggesting architecture improvements and recommending process improvements. 

JOB REQUIREMENT

4+ years experience managing Linux systems on a wider scale (e.g. NGINX, Apache, Tomcat, openVPN, openLDAP, etc)
2+ years experience with AWS technologies
Experience with Configuration Management Solutions (e.g. Ansible, Puppet, Chef, etc)
Experience with Log Aggregation technologies (e.g. ELK, Splunk, Datadog, etc)
Experience with Application Performance Monitoring solutions (e.g. Dynatrace, NewRelic, etc)
Experience with Monitoring technologies (e.g. AWS Cloudwatch, Grafana, Prometheus, etc)
Experience with DNS management (e.g. AWS Route53, GoDaddy, etc)
Experience with Certificate Management (HTTPS)
Ability to analyze and resolve complex infrastructure resource and application deployment issues

WHAT'S ON OFFER

We are an equal opportunity employer and do not discriminate based on gender, race, age, religion, disability, or other local protected class. We are committed to cultivating an inclusive environment for all employees, and we welcome the diversity that you will bring!
If you are looking for a rapid-growth environment and great teams to work with, you should apply now.
We are sorry to inform you that only shortlisted candidates will be notified as we may be overwhelmed by the number of applicants coming into our system; hence if you do not get a reply from us - don’t give up on us just yet!
18 days Annual leaves
Quarter bonus based on employee performance and company's business
Health insurance, private insurance provided that covers yourself and your immediate dependents (spouse and children if any)
Laptop provided
Work from home allowances
Wellness benefit (cover for gym membership etc; can also be used to top up personal insurance too) up to 60usd/quarter
Any government-regulated perks
We have an L&D budget that supports the following:
Online or classroom courses held by an external provider
Conferences, workshops, or seminars
Coursework for a relevant diploma, degree, or professional certification

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Internet, Payment, Product

Technical Skills:

System, Security, Cloud

Location:

Ho Chi Minh - Viet Nam

Working Policy:

Salary:

$ 2,000 - $ 3,000

Job ID:

J00591

Status:

Close

Related Job:

Engineering Manager - AI for RAN and 6G Wireless Systems

Ho Chi Minh, Ha Noi - Viet Nam


Computer Hardware

  • Machine Learning
  • Management

Manage and expand an engineering team focused on AI-enabled signal processing for the Radio Access Network (RAN). Supervise the development of deep learning models for various tasks related to RAN. Work with global teams to drive proof-of-concepts and production-quality AI-RAN components. Supervise the integration of AI models into full-stack simulations and/or testbeds using various frameworks. Align project priorities with hardware-software co-design constraints and deployment scenarios. Provide mentorship and guidance to team members, ensure technical excellence, and contribute to strategic direction.

Negotiation

View details

Director Engineering – Software Engineering and AI Inferencing Platforms

Ho Chi Minh, Ha Noi - Viet Nam


Computer Hardware

  • Management
  • Backend
  • Cloud
  • Data Engineering
  • AI

Lead and expand engineering teams in Vietnam across system software, data science, and AI platforms. Drive the creation, structure, and delivery of high-performance system software platforms that support AI products and services. Collaborate with global teams across Machine Learning, Inference Services, and Hardware/Software integration to guarantee performance, reliability, and scalability. Oversee the development and optimization of AI delivery platforms in Vietnam, including NIMs, Blueprints, and other flagship services. Collaborate with open-source and enterprise data and workflow ecosystems to advance accelerated AI factory, data science, and data engineering workloads. Promote continuous integration, continuous delivery, and engineering best practices across multi-site R&D Centers. Work with product management and other stakeholders to ensure enterprise readiness and customer impact. Establish and implement standard processes for large-scale, distributed system testing including stress, scale, failover, and resiliency testing. Ensure security and compliance testing aligns with industry standards for cloud and data center products. Mentor and develop talent within the organization, fostering a culture of quality and continuous improvement.

Negotiation

View details

Principal Engineer, System Software Platform Engineering

Ho Chi Minh, Ha Noi - Viet Nam


Computer Hardware

  • Devops
  • Backend
  • AI

Create and manage a platform for AI that provides services for multiple users, handles identity and policy management, configures quotas, and controls costs. Additionally, this platform should offer easy paths for teams to work on AI projects. Oversee the deployment of AI models at scale, including routing, autoscaling, and implementing safety measures to ensure reliability and observability. Manage GPU resources in a Kubernetes environment, including device plugins, feature discovery, and scheduling strategies, among other responsibilities. Take charge of the entire lifecycle of GPUs, ensuring that driver, firmware, and runtime updates are implemented safely and consistently. Implement virtualization strategies for GPU resources, such as vGPU and PCIe passthrough, while defining policies for resource placement, isolation, and preemptive actions. Establish secure traffic and networking protocols, including gateways, service mesh, and authentication/authorization measures. Enhance observability and operational efficiency through monitoring tools for GPUs, response protocols for incidents, and optimization of costs. Develop reusable templates, integrate SDKs and CLIs, and implement infrastructure-as-code standards for the platform. Influence the platform's direction by creating design documents, mentoring engineers, and aligning platform development with the needs of AI products.

Negotiation

View details