Sofia, Bulgaria

Senior Site Reliability Engineer

 Job Description:

At DIGITALL we don't just deliver technology we deliver the future! We are explorers, knowledge-hunters, tech geeks, problem solvers and game changers who want to inspire and be inspired. Our DIGITALL people are always one step forward: working with top-notch technologies, creating innovations ahead of the market trends, sharing the passion for discovering better ways. As a human-centric organization, our teams are built on mutual respect and open communication, allowing everyone to be authentic, express ideas and unleash their potential. We are proud of our DIGITALL bright minds and never stop developing their skills to keep pushing boundaries together and do what we love. DIGITALL operates globally with a team of 1000 experts in 16 locations across 8 countries.

The Senior Site Reliability Engineer (SRE) is a key role in driving the reliability, scalability, and performance of Tricentis SaaS products. You will design and maintain cloud infrastructure, implement observability solutions, and collaborate closely with engineering teams to ensure seamless operations. This position combines software engineering principles with operational excellence to deliver highly available and resilient systems.

This is your job:

  • Design, build, and maintain scalable cloud infrastructure on AWS/Azure.
  • Develop advanced monitoring and alerting systems for proactive issue detection.
  • Automate deployments and operational tasks using Terraform, GitHub Actions, Kubernetes.
  • Collaborate with product engineers to influence architecture and reliability standards.
  • Participate in on-call rotations (PagerDuty) for incident response and root cause analysis.
  • Continuously improve system availability, performance, and operational processes.

Your qualifications:

  • IT infrastructure management experience. (Servers, Virtualization, Storage, Networking, Applications)
  • Good understanding of Cloud Computing: virtualization technologies, Infrastructure as a Service, Platform as a Service and Software as a Service Cloud delivery models and the current competitive landscape
  • Strong proficiency in Terraform and GitHub Actions (CI/CD, GitOps).
  • Hands-on experience with Azure or AWS and Kubernetes.
  • Solid understanding of SaaS architecture and observability practices.
  • Experience with Linux OS configuration and troubleshooting.
  • Ability to define SLIs and manage metrics, alerts, and silences.
  • Excellent communication skills, collaborative mindset, and a culture of continuous learning.
  • Ability to work in a multicultural office environment
  • Good communication skills
  • Excellent English written and spoken

Organizational information:

  • All applications will be treated in strict confidentiality
  • Please note that only shortlisted candidates will be invited to an interview

  Required Skills:

Reliability