Cloud Architect

JOB DESCRIPTION

We are on the hunt for a talented cloud architect to manage our company's cloud architecture and position in cloud environments. You will play a strategic role in maintaining all cloud systems including the frontend platforms, servers, storage, and management networks.
Create a well-informed cloud strategy and manage the adaption process.
Regularly evaluate cloud applications, hardware, and software.
Develop and organize cloud systems.
Work closely with IT security to monitor the company's cloud privacy.
Respond to technical issues in a professional and timely manner.
Offer guidance in infrastructure movement techniques including bulk application transfers into the cloud.
Identify the top cloud architecture solutions to successfully meet the strategic needs of the company.
Lead our organization through cloud adoption and establish best practices

JOB REQUIREMENT

Bachelor's degree in computer science, computer engineering, information technology, or relevant field.
3-5+ years of experience designing, executing, and supporting IT cloud solutions.
Deep knowledge of Microsoft Azure, from common features to data security solutions
Strong experience in configuring, maintaining, and troubleshooting Microsoft-based production systems.
Strong understanding of cloud, automation, and infrastructure.
Strong troubleshooting skills for Cloud and Automation.
Strong written and spoken capability in English.
Positive attitude and a strong commitment to delivering quality work.
Excellent knowledge of cloud computing technologies and current computing trends.
Effective communication skills (written and verbal) to properly articulate complicated cloud reports to management and other IT development partners.

WHAT'S ON OFFER

Performance Management – Assess based on Company’s Performance Management Plan annually.
14 days of Annual Leave in a calendar year.
Great allowances (parking, birthday, happy hours, promotion...)
Outing/team-building activities (company trip, sports club…)
Monthly Gross Salary will be 100% for insurance and income tax purposes.
Probation Period: Two (2) Months with 100% monthly gross salary.
Other benefits as per stated in Vietnamese Labor Law
Training opportunity: both technical and soft skills to develop your career path

CONTACT

PEGASI – IT Recruitment Consultancy | Email: recruit@pegasi.com.vn | Tel: +84 28 3622 8666
We are PEGASI – IT Recruitment Consultancy in Vietnam. If you are looking for new opportunity for your career path, kindly visit our website www.pegasi.com.vn for your reference. Thank you!

Job Summary

Company Type:

Product

Technical Skills:

System, Azure

Location:

Ho Chi Minh - Viet Nam

Working Policy:

Salary:

$ 3,000 - $ 3,500

Job ID:

J01254

Status:

Close

Related Job:

Storage System Engineer (Linux)

Ho Chi Minh - Viet Nam


Outsource

We are seeking a highly motivated and talented individual to join our team as a Storage Operations Engineer. This role requires a strong understanding of storage systems, automation skills, and experience in Linux system administration. As a Storage and Linux Operations Engineer, you will be responsible for managing and optimizing our storage infrastructure while actively contributing to automation initiatives and providing Linux system administration support.#Responsibilities: Monitor storage performance, capacity, and availability to ensure optimal performance and reliability. Troubleshoot storage-related issues and provide timely resolutions to users spanning across the globe. Develop and maintain scripts and automation tools to streamline storage administration tasks. Perform regular data backup and recovery procedures to ensure data availability.

Negotiation

View details

Director Engineering – Software Engineering and AI Inferencing Platforms

Ho Chi Minh, Ha Noi - Viet Nam


Product

  • Management
  • Backend
  • Devops
  • Data Engineering
  • Cloud
  • AI

Lead and expand engineering teams in Vietnam across system software, data science, and AI platforms. Drive the creation, structure, and delivery of high-performance system software platforms that support AI products and services. Collaborate with global teams across Machine Learning, Inference Services, and Hardware/Software integration to guarantee performance, reliability, and scalability. Oversee the development and optimization of AI delivery platforms in Vietnam, including NIMs, Blueprints, and other flagship services. Collaborate with open-source and enterprise data and workflow ecosystems to advance accelerated AI factory, data science, and data engineering workloads. Promote continuous integration, continuous delivery, and engineering best practices across multi-site R&D Centers. Work with product management and other stakeholders to ensure enterprise readiness and customer impact. Establish and implement standard processes for large-scale, distributed system testing including stress, scale, failover, and resiliency testing. Ensure security and compliance testing aligns with industry standards for cloud and data center products. Mentor and develop talent within the organization, fostering a culture of quality and continuous improvement.

Negotiation

View details

Principal Engineer, System Software Platform Engineering

Ho Chi Minh, Ha Noi - Viet Nam


Product

  • Devops
  • Backend
  • AI

Create and manage a platform for AI that provides services for multiple users, handles identity and policy management, configures quotas, and controls costs. Additionally, this platform should offer easy paths for teams to work on AI projects. Oversee the deployment of AI models at scale, including routing, autoscaling, and implementing safety measures to ensure reliability and observability. Manage GPU resources in a Kubernetes environment, including device plugins, feature discovery, and scheduling strategies, among other responsibilities. Take charge of the entire lifecycle of GPUs, ensuring that driver, firmware, and runtime updates are implemented safely and consistently. Implement virtualization strategies for GPU resources, such as vGPU and PCIe passthrough, while defining policies for resource placement, isolation, and preemptive actions. Establish secure traffic and networking protocols, including gateways, service mesh, and authentication/authorization measures. Enhance observability and operational efficiency through monitoring tools for GPUs, response protocols for incidents, and optimization of costs. Develop reusable templates, integrate SDKs and CLIs, and implement infrastructure-as-code standards for the platform. Influence the platform's direction by creating design documents, mentoring engineers, and aligning platform development with the needs of AI products.

Negotiation

View details