DevOps Engineer (SRE), AI Exposure
Opening Code : 0410/MH102
Posted on 4 Oct 2025
Job Highlights
- Well experience in SRE for K8s
- Up to 4 months bonus
- Exposure in AI computing center
MNC is looking for an experienced SRE Engineer to support Devops Operation
Job Description
Key Responsibilities
- Work with IT Infrastructure teams to design and engineer AI computing infrastructure for high-performance computing (HPC) solutions.
- Design, implement, and manage scalable GPU infrastructure using Kubernetes.
- Ensure the availability, performance, and reliability of production systems, proactively identifying and resolving issues.
- Utilize configuration management tools to automate the provisioning and management of servers and applications at Data Center.
Qualifications:
- Bachelor’s degree or higher in Telecommunications, Information Systems Management, Electrical Engineering, or a related field.
- At least 5 years of experience in supporting sizable infrastructure.
- Hands-on experience in performing SRE for devops operation using Kubernetes.
- Familiarity with computing power-related technologies (e.g., CPU/GPU/AI accelerators, high-performance computing, distributed computing) would be a great advantage.
- Good communication in English and Chinese (Cantonese & Mandarin).