Job Openings
HPC Storage Administrator
About the job HPC Storage Administrator
Job Title: Storage Administrator – HPC Managed Services
Location: Singapore
Working Hours: 08:30 – 18:00, Monday to Friday (excluding Public Holidays)
Role Overview
- We are looking for a dedicated Storage Administrator to join our High-Performance Computing (HPC) Managed Services team. The successful candidate will play a key role in administering, maintaining, and supporting large-scale storage systems and parallel filesystems that underpin HPC operations.
- Key Responsibilities
- Administer and maintain HPC storage environments, including parallel filesystems such as Lustre or General Parallel File System (GPFS).
- Monitor performance, manage capacity, and ensure data integrity across critical storage infrastructures.
- Troubleshoot and resolve storage and filesystem-related issues in multi-user, high-performance computing environments.
- Support storage lifecycle management, including upgrades, patching, and configuration changes.
- Collaborate with system and network teams to optimize end-to-end I/O performance for HPC workloads.
- Participate in a 24×7 on-call rotation to address P1/P2 incident escalations.
- Maintain proper documentation, operational guidelines, and adherence to ITIL service management processes.
Requirements
- At least two (2) years of recent experience managing large-scale parallel filesystems such as Lustre, GPFS, or similar technologies.
- Strong proficiency in Linux system administration, particularly Red Hat Enterprise Linux or compatible distributions.
- Solid understanding of storage technologies, RAID architectures, metadata servers, and network interconnects (InfiniBand, Ethernet).
- Scripting ability (e.g., Bash, Python) for automation, monitoring, and reporting tasks.
- Excellent analytical, troubleshooting, and communication skills.
- Team-oriented and capable of working in mission-critical, production-grade environments.
Minimum Certification Requirements
- ITIL Foundation (or equivalent or higher).
- Red Hat Certified System Administrator (RHCSA) or equivalent or higher certification.