Job Openings G01 - Platform Operations Engineer

About the job G01 - Platform Operations Engineer

Key Responsibilities

Infrastructure Management & Operations

  • Maintain and optimise critical infrastructure platforms encompassing compute, storage, virtualisation, and supporting systems across development, staging, and production environments.
  • Ensure consistent platform performance through proactive monitoring, capacity planning, and lifecycle management of virtualisation platforms and GCC 2.0 Azure environments.

Operational Excellence & Automation

  • Implement and maintain platform standards whilst executing infrastructure automation initiatives to enhance operational efficiency and system reliability.
  • Drive the adoption of modern operational practices including configuration management processes

Security & Compliance

  • Execute comprehensive monthly server patching strategies across GCC 2.0 Azure and on-premises environments, maintaining security posture while minimizing service disruption.
  • Implement and maintain security controls including access management frameworks, security hardening protocols, and continuous compliance monitoring systems.

Incident Management & Support

  • Provide expert L1/L2 technical support for platform-related incidents, conducting thorough problem determination and resolution activities.
  • Collaborate effectively with application teams to address platform stability, performance optimisation, and scalability requirements whilst maintaining service level agreements.

Modernisation & Innovation

  • Support containerisation initiatives and maintain hybrid cloud solutions for both modern and traditional workloads.
  • Contribute to platform enhancement projects and implementation of innovative infrastructure solutions, ensuring alignment with enterprise architecture standards.

Business Continuity & Documentation

  • Maintain robust backup, disaster recovery, and high-availability solutions for critical platform components.
  • Develop and maintain comprehensive platform documentation, operational runbooks, and standard operating procedures to ensure knowledge transfer and operational consistency.

Technical Expertise

  • Cloud Platforms: Demonstrated experience with GCC 2.0 Azure services, hybrid cloud architectures, and cloud-native technologies
  • Operating Systems: Advanced proficiency in Linux and Windows Server administration, including performance tuning and troubleshooting
  • Containerisation: Practical knowledge of container technologies including Docker, Kubernetes, and container orchestration platforms
  • Infrastructure as Code: Experience with automation tools and IaC practices using technologies such as Terraform, Ansible, or similar platforms
  • Virtualisation: Understanding of Nutanix hyperconverged infrastructure concepts and virtualisation best practices
  • Networking: Solid grasp of networking concepts, protocols, and technologies including TCP/IP, DNS, load balancing, and network security
  • Scripting & Automation: Proficiency in scripting languages including Python, PowerShell, and Bash for automation and operational tasks
  • Monitoring & Observability: Experience with monitoring platforms, logging systems, and observability tools for proactive infrastructure management
  • Understanding of Government IM8 compliances requirement and/or best industry practices

Required Qualifications

  • Bachelor's degree in Computer Science, Information Technology, Engineering, or related technical discipline
  • Minimum 3 - 5 years of experience in infrastructure operations, platform engineering, or related technical roles
  • Proven track record in supporting large-scale infrastructure modernization initiatives within enterprise environments
  • Demonstrated experience in maintaining high-availability systems and managing critical infrastructure platforms

Preferred Qualifications

  • Industry certifications in cloud platforms (Azure, AWS), virtualization technologies, or infrastructure management
  • Experience with DevOps practices and CI/CD pipeline implementation
  • Knowledge of ITIL framework and service management practices
  • Understanding of cybersecurity frameworks and compliance requirements