About the job Senior DevOps Engineer (HealthTech/SaaS)
Our client is a US-based technology start-up that builds advanced open-source and enterprise data integration and analytics software for public health and development entities in low- and middle-income countries. Their vision is to transform how public and private sector stakeholders access and use data to address the world's most pressing development challenges. As a global company, diversity, equity, and inclusion are core parts of their values. They define impact by their success in helping their users access the full power of their data so they can act on valuable insights to build and sustain high performing, equitable systems. Their software is currently used by 12 countries across Africa, Asia and South America and they are funded by some of the largest global health foundations.
Role Overview
Our client is seeking an experienced Senior DevOps Engineer who will play a critical role in ensuring the seamless operation, scalability, and security of their technology platform, which supports the integration and analysis of complex data sets for governments and international organizations. Success in this role means building and maintaining robust infrastructure across cloud, on-prem, and government cloud environments, while continuously optimizing automation, deployment pipelines, and tooling to support ongoing product development. This role involves working closely with cross-functional teams, particularly product and engineering, to drive innovation and ensure the stability of deployments in mission-critical environments. Ultimately, the Senior DevOps Engineer will safeguard operational excellence, enabling our client to maintain the trust of their clients and continue delivering high-impact solutions to low and middle-income countries.
Job Type: Full Time
Workplace: Hybrid
Location: Cape Town
Requirements
- Bachelors degree in Computer Science, Information Technology, or a related field.
- Preferred: Masters degree in a relevant field or equivalent industry experience.
- Relevant certifications in cloud technologies (AWS, Azure, GCP) or DevOps tools (Terraform, Ansible, Docker) are a strong plus.
- 5+ years of experience as a DevOps Engineer, with a proven track record of managing and deploying infrastructure in complex environments.
- 5+ years of experience with Python.
- Hands-on experience with: Linux and advanced networkingCloud services (AWS preferred), automation tools (Ansible, Terraform), and containerization (Docker, Kubernetes).
- Managing CI/CD pipelines, automated deployments, and system monitoring.
- Operating within fast-paced, dynamic startup environments, supporting teams remotely.
- Preferred experience: Working with data and analytics products or tools, and exposure to working with governments, NGOs, or international organizations
- Proficient in cloud infrastructure management (AWS, Azure, or GCP), including provisioning, scaling, and securing cloud services.
- Advanced expertise in automation and infrastructure-as-code using tools like Ansible, Terraform, and Jenkins.
- Strong scripting skills in Python, Bash, or similar languages to develop automation scripts and manage infrastructure
- Deep understanding of containerization and orchestration technologies like Docker and Kubernetes.
- Excellent understanding of networking, security practices, and VPN management in cloud and on-prem environments.
- Excellent written and verbal communication skills, with the ability to clearly articulate complex technical concepts to both technical and non-technical stakeholders.
- Strong collaboration skills, working effectively across remote teams and cross-functional groups.
- Adaptable, capable of responding to rapidly changing priorities, and able to operate in a high-ambiguity environment.
- Self-starter with the ability to independently manage tasks, make decisions, and drive initiatives to completion.
Responsibilities
Infrastructure and Systems Ownership (30%)
- Design, implement, and manage scalable, secure infrastructure across cloud, on-premises, and government environments.
- Optimize systems for performance, security, and growth using AWS, Ansible, and Terraform.
- Lead the development of reliable, automated deployment tools to support our clients global platform.
Security and Compliance Management (25%)
- Implement security best practices to protect infrastructure and data, especially in high-security environments.
- Ensure compliance with industry standards through regular audits, vulnerability assessments, and timely fixes.
- Maintain robust cybersecurity to safeguard our clients reputation with governments and international organizations.
Operational Support and Troubleshooting (20%)
- Own system health and uptime by providing rapid, effective support for infrastructure issues.
- Conduct root cause analysis, apply fixes, and implement preventive measures.
- Proactively monitor systems to prevent bottlenecks or service disruptions, ensuring continuous service for clients.
Automation and CI/CD Leadership (15%)
- Drive automation to improve deployment efficiency and consistency.
- Optimize CI/CD pipelines for seamless integration and reliable, rapid deployments.
- Lead efforts to reduce manual interventions, enhance scalability, and improve operational efficiency across engineering and product development.
Cross-functional Collaboration and Strategic Input (10%)
- Work closely with product, engineering, and data teams to align infrastructure with business goals.
- Provide strategic input on infrastructure needs during product development and planning discussions to support long-term platform and client requirements.