Job Openings HPC Storage Administrator

About the job HPC Storage Administrator

Job Title: Storage Administrator – HPC Managed Services

Location: Singapore

Working Hours: 08:30 – 18:00, Monday to Friday (excluding Public Holidays)

Role Overview

  • We are looking for a dedicated Storage Administrator to join our High-Performance Computing (HPC) Managed Services team. The successful candidate will play a key role in administering, maintaining, and supporting large-scale storage systems and parallel filesystems that underpin HPC operations.
  • Key Responsibilities
  • Administer and maintain HPC storage environments, including parallel filesystems such as Lustre or General Parallel File System (GPFS).
  • Monitor performance, manage capacity, and ensure data integrity across critical storage infrastructures.
  • Troubleshoot and resolve storage and filesystem-related issues in multi-user, high-performance computing environments.
  • Support storage lifecycle management, including upgrades, patching, and configuration changes.
  • Collaborate with system and network teams to optimize end-to-end I/O performance for HPC workloads.
  • Participate in a 24×7 on-call rotation to address P1/P2 incident escalations.
  • Maintain proper documentation, operational guidelines, and adherence to ITIL service management processes.

Requirements

  • At least two (2) years of recent experience managing large-scale parallel filesystems such as Lustre, GPFS, or similar technologies.
  • Strong proficiency in Linux system administration, particularly Red Hat Enterprise Linux or compatible distributions.
  • Solid understanding of storage technologies, RAID architectures, metadata servers, and network interconnects (InfiniBand, Ethernet).
  • Scripting ability (e.g., Bash, Python) for automation, monitoring, and reporting tasks.
  • Excellent analytical, troubleshooting, and communication skills.
  • Team-oriented and capable of working in mission-critical, production-grade environments.

Minimum Certification Requirements

  • ITIL Foundation (or equivalent or higher).
  • Red Hat Certified System Administrator (RHCSA) or equivalent or higher certification.