DevOps Monitoring Engineer

Belgrade, Belgrade, Serbia System Administrator Department

Job Openings DevOps Monitoring Engineer

About the job DevOps Monitoring Engineer

Hello there, thanks for paying attention to our vacancy and let us introduce ourselves.

We are an IT company occupying a remarkable place in marketing, gaming, and financial segments of the iGaming industry. Our products are successfully implemented in European, African, and Latin American markets.

Ever since we first started, we have been driven to find the best solutions in everything we do. At the same time, we have been creating strong connections all over the world which have helped us establish our presence in the industry.

We strive for fun, elegance, and efficiency in everything we do. You can feel this in our relaxed, homelike working environment. We offer employment and seek to hire only the brightest, most driven, and most gifted minds around.

Currently, we are looking for an ambitious DevOps Monitoring Engineer to join our team for full-time employment in our beautiful office in New Belgrade.

Here are a few reasons to join us:

competitive salary and constant encouragement for your efforts and contribution (annual salary review);
rapid growth (junior to CEO is a familiar journey for our company. We value business results and individuals eager to raise the bar. As an employer we provide supportive space to pursue your ideas);
we are taking care of the overall well-being of our people by providing private health insurance and Fit Pass for all;
work-life balance (every second-week Power-Up Friday with 4 hours of working time, 25 working days of paid vacation, flexible schedule);
modern and comfy office (easy-to-get office in new business center with a spectacular view, free fruits, and snacks in the office, playroom with billiard, tennis table, and PlayStation);
we like to celebrate special days and have fun at our corporate events, and team-building activities, and we have special vouchers for our people's birthdays so you can enjoy your special day.

Responsibilities:

Design and implement observability solutions (metrics, logs, and traces) to ensure end-to-end visibility across applications and infrastructure.
Monitor system performance and availability, proactively identifying and resolving issues to minimize downtime and service disruption.
Configure and maintain monitoring and alerting tools (e.g., Prometheus, Grafana, Datadog, ELK) to ensure accurate and timely insights.
Develop and maintain dashboards and alerts that provide actionable insights for engineering and operations teams.
Analyze incidents and system behavior to perform root cause analysis and recommend preventive improvements.
Define and track SLIs, SLOs, and SLAs to measure system reliability and drive continuous performance improvements.
Collaborate with DevOps, SRE, and development teams to integrate observability into CI/CD pipelines and system design.
Automate monitoring processes and workflows to improve efficiency and reduce manual intervention.
Document observability practices, configurations, and incident response procedures to ensure knowledge sharing and consistency.
Evaluate and implement new tools and technologies to enhance monitoring capabilities and support evolving business needs.

Experience:

Bachelor’s degree in Computer Science, Information Technology, or a related field.
3–5+ years of experience in observability, monitoring, SRE, or DevOps roles within complex production environments.
Strong proficiency with monitoring and observability tools (e.g., Prometheus, Grafana, ELK stack, Datadog) and log analysis.
Experience with cloud platforms (AWS, Azure, or GCP) and containerized environments (Docker, Kubernetes).
Strong analytical, troubleshooting, and collaboration skills to work effectively with cross-functional engineering teams.

Good to Have:

Experience with distributed systems and microservices architectures in large-scale environments.
Familiarity with OpenTelemetry or similar frameworks for implementing standardized observability.
Proficiency in scripting or programming languages (e.g., Python, Go, Bash) for automation tasks.
Experience with incident management and on-call practices (e.g., PagerDuty, Opsgenie, ITIL frameworks).
Relevant certifications (e.g., AWS, Kubernetes, or observability/monitoring tools certifications).

From our part, we are ready to provide you with our full and comprehensive support in order to successfully cope with this list of tasks. Our System Administration Department is waiting for you to become an indispensable part of it!

Or refer someone