Job Openings Data Platform Engineer (Management & DevOps)

About the job Data Platform Engineer (Management & DevOps)

Data Platform Engineer (Management & DevOps)

Position Overview

We are seeking a Data Platform Engineer to manage enterprise data platform operations including upgrades, patches, monitoring, and incident management, utilizing advanced tooling (Sentry, Grafana, K6, Google Analytics, Redgate Monitor) and implementing multi-cloud self-provisioning services with DevOps methodologies.

Key Responsibilities

Platform Operations & Management

  • Plan and execute data platform upgrades, patches, and maintenance across multi-cloud environments

  • Manage database and analytics platform lifecycle including version migrations and compatibility testing

  • Implement automated patching strategies and change management procedures

  • Coordinate maintenance windows and minimize service disruptions during updates

  • Maintain platform documentation, runbooks, and operational procedures

Monitoring & Observability

  • Design comprehensive monitoring solutions using Grafana dashboards and alerting systems

  • Implement application performance monitoring with Sentry for error tracking and debugging

  • Configure performance testing frameworks using K6 for load testing and capacity validation

  • Set up analytics tracking with Google Analytics and custom telemetry solutions

  • Deploy database monitoring using Redgate Monitor and performance analysis tools

Incident Management & Response

  • Establish incident response procedures and on-call rotation management

  • Implement automated alerting, escalation procedures, and service recovery protocols

  • Conduct root cause analysis and post-incident reviews for continuous improvement

  • Manage service level agreements (SLAs) and uptime monitoring across data platforms

  • Coordinate with vendor support teams and manage critical issue resolution

Self-Provisioning & Automation

  • Implement multi-cloud self-provisioning services using platforms like Onyxia

  • Develop infrastructure as code (IaC) templates for automated resource deployment

  • Build CI/CD pipelines for data platform deployments and configuration management

  • Create self-service portals for development teams and data scientists

  • Implement policy-based governance and resource optimization automation

Required Qualifications

Technical Skills

  • 5+ years experience managing enterprise data platforms and database systems

  • Expert knowledge of monitoring tools (Grafana, Prometheus, Sentry) and observability practices

  • Strong experience with DevOps tools and practices (Docker, Kubernetes, Terraform, Ansible)

  • Proficiency in scripting languages (Python, Bash, PowerShell) for automation and tooling

  • Knowledge of multi-cloud architectures (AWS, Azure, GCP) and cloud-native services

  • Experience with performance testing tools (K6, JMeter) and capacity planning methodologies

Platform Management Skills

  • Proven experience with database administration and platform upgrade procedures

  • Strong understanding of incident management frameworks (ITIL, SRE principles)

  • Experience with configuration management and infrastructure as code practices

  • Knowledge of security frameworks, compliance monitoring, and access management

Preferred Qualifications

  • Bachelor's degree in Computer Science, Engineering, or related field

  • Cloud platform certifications (AWS DevOps, Azure DevOps, Google Cloud Engineer)

  • Experience with data platform technologies (Hadoop, Spark, Kafka, Elasticsearch)

  • Background in site reliability engineering (SRE) and platform engineering roles

  • Knowledge of service mesh technologies and microservices architectures