PaaS Engineer

 Job Description:

At DIGITALL we dont just deliver technology we deliver the future! We are explorers, knowledge-hunters, tech geeks, problem solvers and game changers who want to inspire and be inspired. Our DIGITALL people are always one step forward: working with top-notch technologies, creating innovations ahead of the market trends, sharing the passion for discovering better ways. As a human-centric organization, our teams are built on mutual respect and open communication, allowing everyone to be authentic, express ideas and unleash their potential. We are proud of our DIGITALL bright minds and never stop developing their skills to keep pushing boundaries together and do what we love. DIGITALL operates globally with a team of 1000 experts in 16 locations across 8 countries.

Our client is an IT service provider located in Germany with a mission to digitalize the core business of their clients. We are seeking a skilled and experienced Intune Admin Workplace Services (Intune Engineer, Level 2). The candidate will be responsible for managing and optimizing clients endpoint environments, comprising 3,000 to 4,000 endpoints. This role demands expert knowledge of Microsoft Intune, Windows Autopilot, and Mobile Device Management (MDM) solutions, including Microsoft Defender, iOS, and Android.

This is your job:

  • Operate & Evolve Kubernetes Platform: Build, configure, and maintain bare metal hosts and Kubernetes clusters to run GPU/AI workloads.
  • Container Orchestration & Automation: Manage Helm charts, GitOps workflows, Ansible scripts, possibly Terraform code and automation for deploying services and AI workloads.
  • Operate Kubernetes Workloads: Act as primary contact for all Kubernetes-related topics, including troubleshooting, performance tuning, and scaling.
  • CI/CD & GitOps: Develop and maintain CI/CD pipelines with Jenkins and GitLab; implement GitOps practices for consistent deployments and infrastructure changes. Terraform basics.
  • Monitoring & Observability: Operate and enhance Prometheus and Grafana monitoring stacks for bare metal hosts, Kubernetes and platform services.
  • Container Images & Registries: Build, optimize and secure container images (Docker, Podman); manage registries and versioning, image scanning (Trivy).
  • Object Storage & Persistent Volumes: Integrate and maintain object storage solutions for AI workloads.
  • Run AI & HPC Workloads: Support and operate distributed AI workloads within bare metal hosts and Kubernetes environments.
  • Collaboration with Infrastructure & AI Teams: Coordinate closely with Infrastructure Engineers, data center staff and AI developers to ensure smooth delivery of services.
  • ITIL Processes: Follow incident, problem, and change management workflows; create and maintain operational runbooks. Adhere to ZERO outage guidelines.

Your qualifications:

  • Kubernetes Certified Administrator (CKA) or equivalent experience in production environments. CKS advantage.
  • Strong experience with CI/CD tools (Jenkins, GitLab) and GitOps practices.
  • Proficiency with Helm charts and Kubernetes resource management.
  • Scripting/programming in Python or Bash; Infrastructure-as-Code with Terraform and Ansible.
  • Experience with container images (Docker, Podman) and image scanning.
  • Familiarity with object storage systems and persistent volume management.
  • Knowledge of monitoring and observability tools (Prometheus, Grafana).
  • Understanding of running AI/HPC workloads at scale.
  • Strong troubleshooting and operational support skills in mission-critical environments.

Organizational information:

  • All applications will be treated in strict confidentiality
  • Please note that only shortlisted candidates will be invited to an interview