Job Openings Site Reliability Engineer (SRE)

About the job Site Reliability Engineer (SRE)

Job Description: Site Reliability Engineer

Job Responsibilities:

  • Build and implement CI/CD solutions in AWS environment.
  • Automate the code delivery pipeline with the goal of one click deployments, rollbacks, and parameterized builds.
  • Build, operate and maintain application infrastructure, infrastructure automation, and monitoring of infrastructure and applications.
  • Work with the development team to cross-pollenate DevOps processes and ensure that new architectures are drawn up with ease of management, delivery, and operability in mind.
  • Troubleshoot application and service issues or system outages while clearly communicating status updates with management and engineering teams.
  • Manage the scaling of all systems.
  • Participate in an on-call rotation.

Required Skills:

  • 5+ years of hands on AWS experience.
  • Effective communication skills and the ability to work in a fast-paced environment with other DevOps engineers, product managers, developers, etc.
  • Strong experience with Docker, AWS ECS or EKS.
  • Strong experience with CI/CD solutiosn using Jenkins/GitHub actions.
  • Strong experience with automation tool - Ansible, Terraform, Cloud formation. Strong scripting experience in Bash, Python, particularly in system automation and monitoring.
  • Solid experience in Unix/ Linux administration.
  • Solid networking experience within complex environments - load balancing, routing, DNS, network firewalls, and application firewalls.
  • Hands on experience with different AWS services - EC2, ASG, ElastiCache, Aurora MySQL, ALB/NLB, S3, Lambda etc.
  • Hands on experience with setting logging/monitoring solutions using tools such as CloudWatch, Jaeger, Prometheus, ELK/DataDog etc
  • Effective communication skills and the ability to work in a fast-paced environment with other DevOps engineers, product managers, developers, etc.


Preferred Skills:

  • Experience with CFD trading platform MT4/MT5
  • Experience in hosting and configuring highly available Consul, Jaeger, ELK cluster in AWS cloud
  • Experience in setting up production grade Kubernetes cluster from scratch
  • Experience in a leadership role