Job Description:
JOB DESCRIPTION
Job Title: Cloud Infrastructure & Platform Engineering Lead
Business Division: Cloud Services
Department: Cloud Advisory & Transformation
JOB CONTENT
Role Objective
As the Cloud & Infrastructure Platform Engineering Lead, you will be the driving force behind the creation and management of a groundbreaking platform. This platform will automate every facet of cloud resource management, from onboarding to offboarding, from provisioning to patching, and from securing to supporting. You will lead a talented team of engineers to design, develop, and maintain this platform, revolutionizing the way we interact with cloud resources.
Detailed Roles and Responsibilities:
Platform Ownership:
- Take full ownership of designing, building, and evolving a cutting-edge platform for cloud resource management.
- Oversee the day-to-day operations of the platform, ensuring reliability, scalability, and security.
- Lead incident response efforts and maintain a resilient and highly available platform.
Automation Excellence:
- Lead the development of automation solutions that encompass on-boarding, off-boarding, provisioning, patching, securing, monitoring, and backup processes for cloud resources.
- Build the development of repeatable, highly automated solutions to enable and support hyper-scale cloud implementations.
- Drive a culture of automation and continuous improvement across the organization by eliminating toil.
Team Leadership:
- Manage and inspire a team of passionate engineers who are committed to delivering excellence.
- Provide mentorship, guidance, and opportunities for skill development within the team.
Security and Compliance:
- Implement and enforce robust security measures and compliance standards across the platform.
- Collaborate with security teams to address vulnerabilities and maintain the highest security posture.
Testing and Quality Assurance:
- Develop and execute testing strategies for the platform components, including performance testing, integration testing, and security testing.
- Identify and resolve issues through thorough testing and validation.
Scripting and Tool Skills:
- Demonstrate proficiency in scripting languages such as Python, PowerShell, and Bash/Shell scripting to automate tasks and manage cloud resources.
- Utilize Infrastructure as Code (IaC) tools like Terraform, Azure Resource Manager (ARM) Templates, and Bicep for defining and provisioning infrastructure.
- Implement configuration management using tools like Ansible.
Innovation and Technology Evaluation:
- Stay ahead of the curve on emerging cloud technologies and trends.
- Evaluate and introduce new technologies and tools to enhance the platform's capabilities.
Documentation and Knowledge Sharing:
- Maintain comprehensive documentation for system configurations, procedures, and troubleshooting guides.
- Share knowledge and expertise with team members and stakeholders.
Technical Standards and Change Management:
- Develop technical standards and maintain a repository for deliverables, methodologies, and deployment documents.
- Lead change management efforts by preparing implementation plans, rollback plans, test plans, and conducting risk and impact analyses for critical or complex changes.
KEY INTERACTIONS
Internal:
External:
- Management
- Project Teams
- Architecture Team
- Engineering Team
- Developers
- Vendors and suppliers
- Customers
BEHAVIORAL SKILLS, KNOWLEDGE, AND EXPERIENCE
(Special behavioral skills, knowledge and experience needed for the satisfactory performance of the job)
Educational Qualifications:
- Diploma or Degree in Computer Science or Engineering
- Accreditations:
- o Azure Solutions Architect Expert is a must
- CKA (Certified Kubernetes Administrator) advantageous / additional
Skills & Experience:
- A minimum of 5 years of hands-on experience in cloud platform engineering
- Experience working for Azure Expert MSPs (e.g. Accenture, Avanade, Rackspace, NTT, Wipro, HCL, TCS, etc.) with a track record of building automation and cloud platform solutions for managing end-to-end lifecycle management of cloud resources
- Proficiency in cloud platforms, particularly Microsoft Azure.CMP commercial tools or development tools
- Strong expertise in infrastructure as code (IaC) and automation tools.
- Exceptional leadership and team management skills.
- Proven problem-solving and decision-making abilities.
- Effective communication, facilitation and influencing skills - ability to present ideas clearly and concisely.
- Experience on transformation projects and successful transitions to implementation support teams.
- Strong analytical/troubleshooting skills
Behavioral Skills:
- Strong communication skills
- Ability to work under pressure, independently and prioritize with minimal supervision
- Multi-tasking skills and attention to detail
- Team player with ability to work with cross functional teams
- Strong communication skills
- Critical thinking skills
- Attention to detail
- Ability to juggle multiple, competing, frequently changing time-sensitive deadlines and priorities
The Job holder may be required to undertake additional duties, which may be reasonably expected and forms part of the function of the job.