Job Openings G16 - DevOps Engineer

About the job G16 - DevOps Engineer

Were looking for a DevOps Engineer to join our diverse, experienced team and help drive the adoption of modern application delivery practices across agencies. In this role, you'll play a key part in modernising development processes, improving system reliability, and strengthening observability for mission-critical applications.

What you'll do

  • Champion the Application Development Modernisation initiative, accelerating the adoption of practices such as CI/CD, DevSecOps, Shift Left security testing, SRE, and Observability to improve quality, security, and delivery speed in an outsourced development environment.
  • Design and implement SRE practices, including SLIs, SLOs, error budgets, and reliability frameworks to boost resilience and operational excellence.
  • Develop and roll out observability strategies with metrics, logs, and traces, leveraging modern tools for better visibility and faster troubleshooting.
  • Create and maintain observability playbooks, templates, and best practices, including distributed tracing and metrics collection across applications.
  • Design monitoring solutions, automated alerts, and dashboards that deliver real-time insights into system health and performance.

What you'll bring

  • Degree or Diploma in Computer Science, IT, Engineering, or a related discipline.
  • At least 1 year of experience with CI/CD pipelines.
  • Hands-on experience with enterprise observability platforms (e.g. ELK Stack, Dynatrace); familiarity with Prometheus and Grafana is a plus.
  • Experience with distributed tracing (e.g. AWS X-Ray) and log aggregation tools.
  • Strong knowledge of defining and implementing SLIs/SLOs and error budgets.
  • Proven experience in alerting strategies, dashboard design, and monitoring tools.
  • Familiarity with RUM and synthetic monitoring.
  • Strong problem-solving and troubleshooting skills.
  • Excellent communication, presentation, and stakeholder management skills.
  • Experience conducting post-mortems and implementing reliability improvements.

Bonus points if you have

  • Experience with SHIP-HATS stack.
  • Hands-on knowledge of cloud platforms (AWS, Azure, GCP).
  • Experience with Atlassian tools (JIRA, Confluence).
  • Proficiency in scripting languages (Python, Bash, PowerShell).
  • Experience with containerised platforms (Docker, Kubernetes).
  • Knowledge of infrastructure automation (Ansible, AWS Systems Manager).