Job Openings
Observability Engineer - Incident Management and Service Reliability.
About the job Observability Engineer - Incident Management and Service Reliability.
WE ARE HIRING: Observability Engineer - Incident Management and Service Reliability.
Job Role:
- Define and implement observability standards for the bank, integrating relevant
tools and platforms to enhance visibility. - Design and maintain standard dashboards, monitors, and actionable alerts to
ensure the availability of critical applications and services. - Continuously analyze performance trends to identify bottlenecks and optimize
infrastructure, applications, and services. - Collaborate with development and infrastructure teams to proactively resolve
performance issues. - Leverage observability data to support capacity planning and performance
forecasting. - Drive high system uptime and reliability by improving observability tools and
enabling rapid issue detection and resolution. - Regularly review and update observability strategies and tools to keep pace with
evolving technologies and system architecture. - Provide training and documentation to ensure best practices in the use and
interpretation of observability tools across team.
Job Specification:
- A Bachelors degree in Information Technology, Computer Science, or a related
field recognized by the University Grants Commission. - Minimum of 2 years' experience in a similar capacity.
- Understanding of incident management principles and best practices (knowledge
of DataDog or ITIL certification will be an added advantage). - Basic understanding of application architectures, microservices, middleware,
distributed systems, DevOps, and application runtimes (Java/IIS), as well as
operating systems (Windows/Linux). - Strong interpersonal and communication skills.
- Problem-solving mindset with a strong orientation towards result
Ready to take your career to new heights?
We're InTalent Asia, your go-to recruitment partner in Sri Lanka, and we've got an exciting opportunity for you! Our client is looking for a dynamic individual to fill the role of Observability Engineer - Incident Management and Service Reliability.
At InTalent Asia, we're not just recruiters; we're here to architect your career success. Join us in the journey of matching your skills with unparalleled opportunities. Apply now and see how you can be the perfect fit for this exclusive position!