IT Operations Manager
JOB DESCRIPTION
JOB REQUIREMENT
WHAT'S ON OFFER
CONTACT
Job Summary
Company Type:
Blockchain
Technical Skills:
System, IT inhouse
Location:
Ho Chi Minh - Viet Nam
Working Policy:
Salary:
$ 3,000 - $ 5,000
Job ID:
J01053
Status:
Close
Related Job:
Storage System Engineer (Linux)
Ho Chi Minh - Viet Nam
Outsource
Monitoring storage performance, capacity, and availability for optimal performance and reliability. Troubleshooting storage-related issues and providing timely resolutions to users. Developing and maintaining scripts and automation tools for storage administration tasks. Performing regular data backup and recovery procedures to ensure data availability.
Negotiation
View detailsEngineering Manager - AI for RAN and 6G Wireless Systems
Ho Chi Minh, Ha Noi - Viet Nam
Product
- Machine Learning
- Management
- AI
Manage and expand an engineering team focused on AI-enabled signal processing for the Radio Access Network (RAN). Supervise the development of deep learning models for various tasks related to RAN. Work with global teams to drive proof-of-concepts and production-quality AI-RAN components. Supervise the integration of AI models into full-stack simulations and/or testbeds using various frameworks. Align project priorities with hardware-software co-design constraints and deployment scenarios. Provide mentorship and guidance to team members, ensure technical excellence, and contribute to strategic direction.
Negotiation
View detailsPrincipal Engineer, System Software Platform Engineering
Ho Chi Minh, Ha Noi - Viet Nam
Product
- Devops
- Backend
- AI
Create and manage a platform for AI that provides services for multiple users, handles identity and policy management, configures quotas, and controls costs. Additionally, this platform should offer easy paths for teams to work on AI projects. Oversee the deployment of AI models at scale, including routing, autoscaling, and implementing safety measures to ensure reliability and observability. Manage GPU resources in a Kubernetes environment, including device plugins, feature discovery, and scheduling strategies, among other responsibilities. Take charge of the entire lifecycle of GPUs, ensuring that driver, firmware, and runtime updates are implemented safely and consistently. Implement virtualization strategies for GPU resources, such as vGPU and PCIe passthrough, while defining policies for resource placement, isolation, and preemptive actions. Establish secure traffic and networking protocols, including gateways, service mesh, and authentication/authorization measures. Enhance observability and operational efficiency through monitoring tools for GPUs, response protocols for incidents, and optimization of costs. Develop reusable templates, integrate SDKs and CLIs, and implement infrastructure-as-code standards for the platform. Influence the platform's direction by creating design documents, mentoring engineers, and aligning platform development with the needs of AI products.