Job Openings
Site Reliability Engineer (SRE) Team Lead (cancelled)
About the job Site Reliability Engineer (SRE) Team Lead (cancelled)
Who We Are
Walnut is a fast growing startup in the sales automation space, backed by top-notch investors.
We help some of the best sales teams in the world to dramatically perform better, by easily creating failure-free, interactive, and personalized demo experiences for each prospect - without coding.
We are a young company that values culture, transparency, and achievements, and put our founding members at the very heart of our company.
We are looking for a Site Reliability Engineer Team Lead to design and implement scalable systems that will keep Walnut running smoothly and support our business growth. You will lead a global team of SREs and work with cutting-edge technologies in a dynamic and agile environment.
Key Responsibilities:
- Lead the SRE team in developing technical solutions, leveraging your deep knowledge of the latest technologies and tools in system reliability and resilience.
- Make decisions that will help craft our infrastructure stack, tackling concerns ranging from user configuration, resource isolation, capacity management, and cost.
- Ensure the reliability of our system, abiding by SLAs, setting SLIs and SLOs that allow for a manageable error budget for Walnut teams.
- Build and evolve tools and standards that provide the best developer experience possible.
- Measure, own, and derive the improvement of metrics like deployment frequency, lead time for changes, change failure rate, and time to restore service.
- Be part of a 24/7 on-call rotation.
Requirements:
- 5+ years of professional experience as a Site Reliability Engineer or DevOps.
- 2+ years of management experience
- Worked in a team ensuring system reliability through automation, SLI definition, observability, incident management, and error budget allocation.
- Proficiency in AWS, Pulumi and/or Terraform.
- Knowledge of at least one backend programming language like Go, Python, NodeJs, etc.
- Understanding of web development tools (HTML, CSS, Javascript, TypeScript, React).
- Experience with cloud native platforms and tools: Kubernetes, AWS Lambda, AWS Lambda@edge, etc.
- Experience with end-to-end frontend testing frameworks (e.g., Playwright) a plus.
- Experience working in an Agile organization which values team autonomy.
- Youre product oriented, сalm, and focused under pressure.