Job Openings
Site Reliability Engineering
About the job Site Reliability Engineering
Experience: 14 to 20 Years
Work Type: Hybrid
Employment Type: Full Time
Responsibilities:
Senior Manager, Site Reliability Engineering will be responsible for leading & Managing SRE Engineering to establish, enhance & maintain SRE practices in Tech Product teams.
- Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems.
- Additionally, SREs will keep an ever-watchful eye on our system's capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure, and eliminating work through automation.
- (eCommerce/stores and supply chain apps) built for a multi-Brand business model, while using your expertise in coding, algorithms, complexity analysis, and large-scale system design.
- SRE's culture of diversity, intellectual curiosity, problem-solving, and openness is key to its success. We encourage them to collaborate, think big, and take risks in a blame-free environment.
Requirements:
- 14+ years of Software development & Engineering experience with the large-scale enterprise.
- Bachelor's degree in Computer Science or similar, and/or related experience System-level understanding of storage, computing, distributed systems, networking
- Experience in multiple programming languages (java/j2ee/python/NodeJS), messaging (MQ, Kafka), database (SQL and NoSQL) & Middleware/Tomcat
- Willingly shares relevant technical and/or industry knowledge and expertise to mentor team members.
- Proficiency in cloud services like Azure PaaS, AKS; API development, and Pivotal Cloud Foundry (PCF)
- Understanding of SRE operations methodology and implementation in an Enterprise environment.
- Proficiency in Cloud Fundamentals especially Azure, OCI & GCP.
- Sound troubleshooting issues skills & participating in Severity issues &CODE RED calls.
- Good understanding of monitoring tools such as Splunk, New Relic, Nagios, Graphite, Grafana, etc.
- Proficient with modern DevOps practices including CI/CD.