Job Openings
SRE Lead
About the job SRE Lead
We are looking for a Site Reliability Engineer, responsible for building scalable and reliable infrastructure at Nference. As an SRE, you will be moving the SRE tech-forward by getting involved in restructuring the current procedures as well as starting from scratch when needed. You will work as part of an experienced and close-knit team to design, implement, test and launch products and solutions.
Responsibilities:
- Creative thinker and strong problem solver with meticulous attention to detail
- Highly organized, creative, motivated, and passionate about achieving results
- Able to balance multiple tasks and projects effectively and quickly adapt to new situations and technologies
- Able to work both independently and as part of a team
- Systematic problem-solver coupled with a strong sense of ownership and drive.
Requirements:
- 7+ years of experience as a Site Reliability Engineer or a mix of a software engineer and DevOps.
- Strong hands-on knowledge of Linux fundamentals System administration scripting, performance tuning/scalability, troubleshooting.
- Team Leading experience
- Write great quality code using SOLID principles including unit and integration tests.
- Hands-on development experience in an object-oriented programming language like Python.
- Hands-on experience developing task automations
- Experience using tools to create and manage CI (continuous integration) and CD (continuous delivery) pipelines.
- Familiarity with software development tools: source code management (SCM systems), code review systems, issue tracking tools, build tools, test frameworks, code quality tools.
- Experience implementing open-source observability and alerting tools, like Prometheus, Grafana, Cortex, Thanos, Alertmanager etc
- Have decent knowledge on networking (VPC, VNet, DNS etc) and of the TCP/IP stack, internet routing and load balancing.
- Worked with log and configuration management tool
- Prior experience of working with AWS, Azure, GCP is a plus
- Prior experience of working with Kubernetes, Docker and containers is plus
- Strong interpersonal communication skills (including listening, speaking, and writing) and ability to work well in a diverse, team-focused environment with other SREs, Engineers, Product Managers, etc
- Documenting your work should be in your DNA