Job Openings
G01 - Platform Operations Engineer
About the job G01 - Platform Operations Engineer
Key Responsibilities
Infrastructure Management & Operations
- Maintain and optimise critical infrastructure platforms encompassing compute, storage, virtualisation, and supporting systems across development, staging, and production environments.
- Ensure consistent platform performance through proactive monitoring, capacity planning, and lifecycle management of virtualisation platforms and GCC 2.0 Azure environments.
Operational Excellence & Automation
- Implement and maintain platform standards whilst executing infrastructure automation initiatives to enhance operational efficiency and system reliability.
- Drive the adoption of modern operational practices including configuration management processes
Security & Compliance
- Execute comprehensive monthly server patching strategies across GCC 2.0 Azure and on-premises environments, maintaining security posture while minimizing service disruption.
- Implement and maintain security controls including access management frameworks, security hardening protocols, and continuous compliance monitoring systems.
Incident Management & Support
- Provide expert L1/L2 technical support for platform-related incidents, conducting thorough problem determination and resolution activities.
- Collaborate effectively with application teams to address platform stability, performance optimisation, and scalability requirements whilst maintaining service level agreements.
Modernisation & Innovation
- Support containerisation initiatives and maintain hybrid cloud solutions for both modern and traditional workloads.
- Contribute to platform enhancement projects and implementation of innovative infrastructure solutions, ensuring alignment with enterprise architecture standards.
Business Continuity & Documentation
- Maintain robust backup, disaster recovery, and high-availability solutions for critical platform components.
- Develop and maintain comprehensive platform documentation, operational runbooks, and standard operating procedures to ensure knowledge transfer and operational consistency.
Technical Expertise
- Cloud Platforms: Demonstrated experience with GCC 2.0 Azure services, hybrid cloud architectures, and cloud-native technologies
- Operating Systems: Advanced proficiency in Linux and Windows Server administration, including performance tuning and troubleshooting
- Containerisation: Practical knowledge of container technologies including Docker, Kubernetes, and container orchestration platforms
- Infrastructure as Code: Experience with automation tools and IaC practices using technologies such as Terraform, Ansible, or similar platforms
- Virtualisation: Understanding of Nutanix hyperconverged infrastructure concepts and virtualisation best practices
- Networking: Solid grasp of networking concepts, protocols, and technologies including TCP/IP, DNS, load balancing, and network security
- Scripting & Automation: Proficiency in scripting languages including Python, PowerShell, and Bash for automation and operational tasks
- Monitoring & Observability: Experience with monitoring platforms, logging systems, and observability tools for proactive infrastructure management
- Understanding of Government IM8 compliances requirement and/or best industry practices
Required Qualifications
- Bachelor's degree in Computer Science, Information Technology, Engineering, or related technical discipline
- Minimum 3 - 5 years of experience in infrastructure operations, platform engineering, or related technical roles
- Proven track record in supporting large-scale infrastructure modernization initiatives within enterprise environments
- Demonstrated experience in maintaining high-availability systems and managing critical infrastructure platforms
Preferred Qualifications
- Industry certifications in cloud platforms (Azure, AWS), virtualization technologies, or infrastructure management
- Experience with DevOps practices and CI/CD pipeline implementation
- Knowledge of ITIL framework and service management practices
- Understanding of cybersecurity frameworks and compliance requirements