Job Openings Site Reliability Engineer (SRE) Team Lead (cancelled)

About the job Site Reliability Engineer (SRE) Team Lead (cancelled)

Who We Are

Walnut is a fast growing startup in the sales automation space, backed by top-notch investors.

We help some of the best sales teams in the world to dramatically perform better, by easily creating failure-free, interactive, and personalized demo experiences for each prospect - without coding.

We are a young company that values culture, transparency, and achievements, and put our founding members at the very heart of our company.

We are looking for a Site Reliability Engineer Team Lead to design and implement scalable systems that will keep Walnut running smoothly and support our business growth. You will lead a global team of SREs and work with cutting-edge technologies in a dynamic and agile environment.

Key Responsibilities:

  • Lead the SRE team in developing technical solutions, leveraging your deep knowledge of the latest technologies and tools in system reliability and resilience.
  • Make decisions that will help craft our infrastructure stack, tackling concerns ranging from user configuration, resource isolation, capacity management, and cost.
  • Ensure the reliability of our system, abiding by SLAs, setting SLIs and SLOs that allow for a manageable error budget for Walnut teams.
  • Build and evolve tools and standards that provide the best developer experience possible.
  • Measure, own, and derive the improvement of metrics like deployment frequency, lead time for changes, change failure rate, and time to restore service.
  • Be part of a 24/7 on-call rotation.

Requirements:

  • 5+ years of professional experience as a Site Reliability Engineer or DevOps.
  • 2+ years of management experience
  • Worked in a team ensuring system reliability through automation, SLI definition, observability, incident management, and error budget allocation.
  • Proficiency in AWS, Pulumi and/or Terraform.
  • Knowledge of at least one backend programming language like Go, Python, NodeJs, etc.
  • Understanding of web development tools (HTML, CSS, Javascript, TypeScript, React).
  • Experience with cloud native platforms and tools: Kubernetes, AWS Lambda, AWS Lambda@edge, etc.
  • Experience with end-to-end frontend testing frameworks (e.g., Playwright) a plus.
  • Experience working in an Agile organization which values team autonomy.
  • Youre product oriented, сalm, and focused under pressure.