Job Openings SRE Lead

About the job SRE Lead

We are looking for a Site Reliability Engineer, responsible for building scalable and reliable infrastructure at Nference. As an SRE, you will be moving the SRE tech-forward by getting involved in restructuring the current procedures as well as starting from scratch when needed. You will work as part of an experienced and close-knit team to design, implement, test and launch products and solutions.

Responsibilities:

  • Creative thinker and strong problem solver with meticulous attention to detail
  • Highly organized, creative, motivated, and passionate about achieving results
  • Able to balance multiple tasks and projects effectively and quickly adapt to new situations and technologies
  • Able to work both independently and as part of a team
  • Systematic problem-solver coupled with a strong sense of ownership and drive.

Requirements:

  • 7+ years of experience as a Site Reliability Engineer or a mix of a software engineer and DevOps.
  • Strong hands-on knowledge of Linux fundamentals System administration scripting, performance tuning/scalability, troubleshooting.
  • Team Leading experience
  • Write great quality code using SOLID principles including unit and integration tests.
  • Hands-on development experience in an object-oriented programming language like Python.
  • Hands-on experience developing task automations
  • Experience using tools to create and manage CI (continuous integration) and CD (continuous delivery) pipelines.
  • Familiarity with software development tools: source code management (SCM systems), code review systems, issue tracking tools, build tools, test frameworks, code quality tools.
  • Experience implementing open-source observability and alerting tools, like Prometheus, Grafana, Cortex, Thanos, Alertmanager etc
  • Have decent knowledge on networking (VPC, VNet, DNS etc) and of the TCP/IP stack, internet routing and load balancing.
  • Worked with log and configuration management tool
  • Prior experience of working with AWS, Azure, GCP is a plus
  • Prior experience of working with Kubernetes, Docker and containers is plus
  • Strong interpersonal communication skills (including listening, speaking, and writing) and ability to work well in a diverse, team-focused environment with other SREs, Engineers, Product Managers, etc
  • Documenting your work should be in your DNA