DevOps Engineer

 Job Description:

Job Scope

  • This role is a cross between traditional DevOps and Site Reliability Engineering.

  • You will be responsible for all aspects of our infrastructure, working alongside our software engineering and product teams to ensure the system's uptime, reliability, security, and maintainability.

  • Troubleshoot system problems for both the internal engineering team and external customers.

  • Complete incident resolution and root-cause analysis for any infrastructure-related issue.

  • Participating in on-call rotations, incident management, and emergency response for manufacturing incidents

  • As part of our main product, we build scalable and reliable data pipelines.

  • Creating an automated monitoring system, internal tools, and a CI/CD pipeline to boost engineering productivity.

  • Turn on logging, monitoring, and alerting for all applications.

  • In the team, promote and evangelise DevOps best practices.

  • Scalable infrastructure design using Terraform, Kubernetes, and Docker

  • Working with the software engineering team to create a general, scalable, and dependable data pipeline as a key component of our main product.

  • Create a high-performance caching system for our manufacturing environment.

Requirements

  • Strong working experiences of Linux system administration

  • Having worked as a full-stack developer or directly supporting full-stack development

  • Experience with code systems for automation/configuration management/infrastructure such as Ansible, Chef, Puppet, Terraform, or an equivalent Experience with SQL databases such as PostgreSQL, MySQL, Oracle, SqlServer, etc.

  • Scripts in common programming languages such as Ruby, Python, JavaScript/TypeScript, and Bash can be understood and developed.

  • Capable of understanding and troubleshooting basic web programming issues

  • Some understanding of security, performance, and disaster recovery best practices

  • Excellent communication skills in English (reading and writing)

  • Knowledge and experience with Docker and Kubernetes are a plus.

  • Experience with observation systems such as Grafana, Prometheus, Loki, Tempo, and others is a plus.