Job Openings Site Reliability Engineering (SRE) Project Manager

About the job Site Reliability Engineering (SRE) Project Manager

DIGITALHUB es una empresa peruana de outsourcing de servicios de BPO y TI. Nuestra visión es un futuro en el que cada persona pueda encontrar el mejor empleo y donde nuestros partners puedan descubrir lo mejor del talento latinoamericano.

En esta oportunidad, nos encontramos buscando un "Site Reliability Engineering Project Manager" para trabajo remoto, para ello deberás cumplir con los siguientes requisitos:

RESUMEN

Looking for a Project Manager to join a Site Reliability Team. As a PM, they will report on the "Reliability SLO" of technology platforms scoped to a business unit or corporate function.

They will be responsible for coordinating the work of the team using agile methodologies and serve as key engagement point with development teams.

Our definition of Reliability is an aggregation of the four golden signals (latency/error rate/updatime/cost) as well as security. We ask that they utilize modern data driven techniques to track cycle time and the DORA metrics and use that to tune team efficiency and productivity. Engineering activities will involve development of dashboards, charts, and graphs pulled from telemetry tools and software platforms such as GitHub and JIRA, and utilization of data trends in collaboration with the technical lead to deliver decision making tools.

MODALIDAD

Remoto

DURACIÓN

6 meses en remoto.

REQUISITOS

An ideal candidate would have over 6 years of project management experience with a team responsible for DevSecOps and Site Reliability activities. Additionally, qualified candidates should have an engineering background with experience in DevSecOps functions and/or cloud engineering activities (preferably in AWS). They should also have hands-on experience gathering business requirements, analyzing workflows/processes, prioritizing team backlogs, writing functional specifications in user story format, assessing risks, and reporting on key business indicators. Strong communication, organizational, and task-tracking skills are highly valued.

FUNCIONES

  • Organization Enablement:
    • Perform Team Health Checks with recurring feedback.
    • Define communication strategy and execution across portfolios.
    • Perform impact analysis.
    • Enable Service adoption and sustainability measures.
  • Governance:
    • Perform Financial Reporting & Analysis of hosting charges across cloud providers.
    • Oversee operational reporting of events, incidents, issues, and root cause analysis lifecycle management.
    • Establish and report on business insights / KPIs and review key changes with engineering stakeholders.
    • Develop and execute strategy for industry certification compliance (SOC-2 / NIST / 1EdTech) across the various products inside the platform.
  • Business Product Management:
    • Establish demand management and improve business agility.
    • Perform functional decomposition on complex problems through collaboration with engineering leaders.
    • Prioritize work activities through a combination of stakeholder input, business value, and cost to achieve.
    • Curate a roadmap by establishing a technical vision in collaboration with stakeholders.
  • Program Management:
    • Refine and advocate for agile delivery management through the role of scrum master by leading ceremonies to maximize team productivity, help resolve blockers and dependencies and enable sizing of work and task breakdown.
    • Establish charters through the identification of business product opportunities and collaborate with software developers to assess the feasibility of software solutions.
    • Collaborate with software developers, TPMs and business product managers to establish development, testing and deployment plans.
    • Draft agile themes, epics and stories, maintain backlog with high-quality stories, acceptance criteria, and clear priorities.
    • Manage the schedule and identify, communicate and resolve blockers to the schedule with clear delivery timelines and scope being well understood by team members, stakeholders, and dependent teams.
    • Perform a risk assessment throughout product development.
    • Manage defect / security issue triage in partnership with product managers, business partners and customer support teams.
  • Resiliency Engineering:
    • Collaborate with dev teams to identify failure points and blast radius of systems.
    • Validate effectiveness of monitoring and observability configurations.
    • Coordinate failure injection testing.
    • Observe and document steady state production levels, growth patterns.
    • Plan and forecast for seasonal growth, communicate trend lines with leadership, enhance infrastructure scaling plans to accommodate 2x planned load.
    • Coordinate improvements of existing software and infrastructure to meet resiliency goals.
  • Cloud Engineering
    • Participate in continual learning of the cloud ecosystem, game day scenarios, and professional conferences.
    • Collaborative solutioning of enterprise applications with development teams utilizing our software stack.
    • Actively monitor cost and utilize optimizer to maximize efficiency while maintaining Service Level Objectives of application performance.

Puedes enviarnos tu CV al siguiente correo: omar.navarro@digitalhub.pe

Tipo de puesto: Tiempo completo

Sueldo: Hasta S/.8,000.00 al mes

Pregunta(s) de postulación:

  • ¿Eres Universitario Titulado o Bachiller en Ias carreras indicadas en el resumen?
  • ¿Cuántos años de experiencia tienes en el perfil solicitado?
  • ¿Cuentas con certificados acorde al perfil?
  • ¿Puedes llevar una conversación fluida en inglés?
  • ¿Cuál es tu expectativa salarial en soles en Recibos por Honorarios?