Job Openings Observability Engineer - Incident Management and Service Reliability.

About the job Observability Engineer - Incident Management and Service Reliability.

WE ARE HIRING:  Observability Engineer - Incident Management and Service Reliability.


      Job Role:

  • Define and implement observability standards for the bank, integrating relevant
    tools and platforms to enhance visibility.
  • Design and maintain standard dashboards, monitors, and actionable alerts to
    ensure the availability of critical applications and services.
  • Continuously analyze performance trends to identify bottlenecks and optimize
    infrastructure, applications, and services.
  • Collaborate with development and infrastructure teams to proactively resolve
    performance issues.
  • Leverage observability data to support capacity planning and performance
    forecasting.
  • Drive high system uptime and reliability by improving observability tools and
    enabling rapid issue detection and resolution.
  • Regularly review and update observability strategies and tools to keep pace with
    evolving technologies and system architecture.
  • Provide training and documentation to ensure best practices in the use and
    interpretation of observability tools across team.
    
    Job Specification:
  • A Bachelors degree in Information Technology, Computer Science, or a related
    field recognized by the University Grants Commission.
  • Minimum of 2 years' experience in a similar capacity.
  • Understanding of incident management principles and best practices (knowledge
    of DataDog or ITIL certification will be an added advantage).
  • Basic understanding of application architectures, microservices, middleware,
    distributed systems, DevOps, and application runtimes (Java/IIS), as well as
    operating systems (Windows/Linux).
  • Strong interpersonal and communication skills.
  • Problem-solving mindset with a strong orientation towards result

Ready to take your career to new heights?

We're InTalent Asia, your go-to recruitment partner in Sri Lanka, and we've got an exciting opportunity for you! Our client is looking for a dynamic individual to fill the role of Observability Engineer - Incident Management and Service Reliability.

At InTalent Asia, we're not just recruiters; we're here to architect your career success. Join us in the journey of matching your skills with unparalleled opportunities. Apply now and see how you can be the perfect fit for this exclusive position!