About the job Data Platform Engineer (Management & DevOps)
Data Platform Engineer (Management & DevOps)
Position Overview
We are seeking a Data Platform Engineer to manage enterprise data platform operations including upgrades, patches, monitoring, and incident management, utilizing advanced tooling (Sentry, Grafana, K6, Google Analytics, Redgate Monitor) and implementing multi-cloud self-provisioning services with DevOps methodologies.
Key Responsibilities
Platform Operations & Management
Plan and execute data platform upgrades, patches, and maintenance across multi-cloud environments
Manage database and analytics platform lifecycle including version migrations and compatibility testing
Implement automated patching strategies and change management procedures
Coordinate maintenance windows and minimize service disruptions during updates
Maintain platform documentation, runbooks, and operational procedures
Monitoring & Observability
Design comprehensive monitoring solutions using Grafana dashboards and alerting systems
Implement application performance monitoring with Sentry for error tracking and debugging
Configure performance testing frameworks using K6 for load testing and capacity validation
Set up analytics tracking with Google Analytics and custom telemetry solutions
Deploy database monitoring using Redgate Monitor and performance analysis tools
Incident Management & Response
Establish incident response procedures and on-call rotation management
Implement automated alerting, escalation procedures, and service recovery protocols
Conduct root cause analysis and post-incident reviews for continuous improvement
Manage service level agreements (SLAs) and uptime monitoring across data platforms
Coordinate with vendor support teams and manage critical issue resolution
Self-Provisioning & Automation
Implement multi-cloud self-provisioning services using platforms like Onyxia
Develop infrastructure as code (IaC) templates for automated resource deployment
Build CI/CD pipelines for data platform deployments and configuration management
Create self-service portals for development teams and data scientists
Implement policy-based governance and resource optimization automation
Required Qualifications
Technical Skills
5+ years experience managing enterprise data platforms and database systems
Expert knowledge of monitoring tools (Grafana, Prometheus, Sentry) and observability practices
Strong experience with DevOps tools and practices (Docker, Kubernetes, Terraform, Ansible)
Proficiency in scripting languages (Python, Bash, PowerShell) for automation and tooling
Knowledge of multi-cloud architectures (AWS, Azure, GCP) and cloud-native services
Experience with performance testing tools (K6, JMeter) and capacity planning methodologies
Platform Management Skills
Proven experience with database administration and platform upgrade procedures
Strong understanding of incident management frameworks (ITIL, SRE principles)
Experience with configuration management and infrastructure as code practices
Knowledge of security frameworks, compliance monitoring, and access management
Preferred Qualifications
Bachelor's degree in Computer Science, Engineering, or related field
Cloud platform certifications (AWS DevOps, Azure DevOps, Google Cloud Engineer)
Experience with data platform technologies (Hadoop, Spark, Kafka, Elasticsearch)
Background in site reliability engineering (SRE) and platform engineering roles
Knowledge of service mesh technologies and microservices architectures