About the job Platform Operations Engineer 24014
The Platform Operations Engineer plays a critical role in maintaining and enhancing government agencies' mission-critical on-premises and cloud infrastructure platforms. This position combines operational excellence with strategic modernisation initiatives, ensuring robust platform reliability while driving the adoption of industry best operational practices. Working within a collaborative environment, you will support infrastructure transformation while maintaining seamless service delivery for Sentosa IT systems.
Key Responsibilities:
Infrastructure Management & Operations
Maintain and optimise critical infrastructure platforms encompassing compute, storage, virtualisation, and supporting systems across development, staging, and production environments. Ensure consistent platform performance through proactive monitoring, capacity planning, and lifecycle management of virtualisation platforms and GCC 2.0 Azure environments.
Operational Excellence & Automation
Implement and maintain platform standards whilst executing infrastructure automation initiatives to enhance operational efficiency and system reliability. Drive the adoption of modern operational practices including configuration management processes
Security & Compliance
Execute comprehensive monthly server patching strategies across GCC 2.0 Azure and on-premises environments, maintaining security posture while minimizing service disruption. Implement and maintain security controls including access management frameworks, security hardening protocols, and continuous compliance monitoring systems.
Incident Management & Support
Provide expert L1/L2 technical support for platform-related incidents, conducting thorough problem determination and resolution activities. Collaborate effectively with application teams to address platform stability, performance optimisation, and scalability requirements whilst maintaining service level agreements.
Modernisation & Innovation
Support containerisation initiatives and maintain hybrid cloud solutions for both modern and traditional workloads. Contribute to platform enhancement projects and implementation of innovative infrastructure solutions, ensuring alignment with enterprise architecture standards and government technology strategies.
Business Continuity & Documentation
Maintain robust backup, disaster recovery, and high-availability solutions for critical platform components. Develop and maintain comprehensive platform documentation, operational runbooks, and standard operating procedures to ensure knowledge transfer and operational consistency.
Technical Expertise:
- Cloud Platforms: Demonstrated experience with GCC 2.0 Azure services, hybrid cloud architectures, and cloud-native technologies
- Operating Systems: Advanced proficiency in Linux and Windows Server administration, including performance tuning and troubleshooting
- Containerisation: Practical knowledge of container technologies including Docker, Kubernetes, and container orchestration platforms
- Infrastructure as Code: Experience with automation tools and IaC practices using technologies such as Terraform, Ansible, or similar platforms
- Virtualisation: Understanding of Nutanix hyperconverged infrastructure concepts and virtualisation best practices
- Networking: Solid grasp of networking concepts, protocols, and technologies including TCP/IP, DNS, load balancing, and network security
- Scripting & Automation: Proficiency in scripting languages including Python, PowerShell, and Bash for automation and operational tasks
- Monitoring & Observability: Experience with monitoring platforms, logging systems, and observability tools for proactive infrastructure management
- Understanding of Government IM8 compliances requirement and/or best industry practices
Required Qualifications:
- Bachelor's degree in Computer Science, Information Technology, Engineering, or related technical discipline
- Minimum 3-5 years of experience in infrastructure operations, platform engineering, or related technical roles
- Proven track record in supporting large-scale infrastructure modernization initiatives within enterprise environments
- Demonstrated experience in maintaining high-availability systems and managing critical infrastructure platforms
Preferred Qualifications:
- Industry certifications in cloud platforms (Azure, AWS), virtualization technologies, or infrastructure management
- Experience with DevOps practices and CI/CD pipeline implementation
- Knowledge of ITIL framework and service management practices
- Previous experience in government or regulated industry environments
- Understanding of cybersecurity frameworks and compliance requirements
Key Competencies:
- Responsible, dependable, and able to commit to standby duties.
- Calm under pressure, with good problem-solving and escalation judgment.
- Detail-oriented with strong documentation discipline.
- Team player with eagerness to learn.