Job Openings Site Reliability Engineering

About the job Site Reliability Engineering

Experience: 14 to 20 Years

Work Type: Hybrid

Employment Type: Full Time

Responsibilities:


Senior Manager, Site Reliability Engineering will be responsible for leading & Managing SRE Engineering to establish, enhance & maintain SRE practices in Tech Product teams.

  • Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. 
  • Additionally, SREs will keep an ever-watchful eye on our system's capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure, and eliminating work through automation.
  •  (eCommerce/stores and supply chain apps) built for a multi-Brand business model, while using your expertise in coding, algorithms, complexity analysis, and large-scale system design. 
  • SRE's culture of diversity, intellectual curiosity, problem-solving, and openness is key to its success. We encourage them to collaborate, think big, and take risks in a blame-free environment.

Requirements:

  • 14+ years of Software development & Engineering experience with the large-scale enterprise.
  • Bachelor's degree in Computer Science or similar, and/or related experience System-level understanding of storage, computing, distributed systems, networking
  • Experience in multiple programming languages (java/j2ee/python/NodeJS), messaging (MQ, Kafka), database (SQL and NoSQL) & Middleware/Tomcat
  • Willingly shares relevant technical and/or industry knowledge and expertise to mentor team members.
  • Proficiency in cloud services like Azure PaaS, AKS; API development, and Pivotal Cloud Foundry (PCF)
  • Understanding of SRE operations methodology and implementation in an Enterprise environment.
  • Proficiency in Cloud Fundamentals especially Azure, OCI & GCP.
  • Sound troubleshooting issues skills & participating in Severity issues &CODE RED calls.
  • Good understanding of monitoring tools such as Splunk, New Relic, Nagios, Graphite, Grafana, etc.
  • Proficient with modern DevOps practices including CI/CD.