Singapore, Singapore, Singapore

Site Reliability Engineer (Linux/Kernel)

 Job Description:

We are looking for a skilled Site Reliability Engineer to join our client's global SRE Team in Singapore.

Responsibilities:

  • Overseeing and ensuring the continuous operation of the firm's Linux based trading infrastructure, addressing day to day operational needs
  • Providing second level support, including:
    • Rapid response to emergencies
    • Implementing scheduled updates and deployments
    • In depth analysis and resolution of performance issues
    • Engage in a rotational on call schedule, including early morning and weekend shifts, to provide timely support
  • Contributing towards the development of automated solutions for server provisioning, configuration, and monitoring, targeting a scalable management of thousands of servers
  • Engaging in interactions with the Trading and Core Engineering teams
  • Managing essential Core services such as DHCP, LDAP, DNS, and NFS for on prem and hosted data centers as well as public clouds
  • Participating in an on call rotation and occasional weekend shifts


Qualifications:

  • Sound expertise in Linux production environments
  • Basic knowledge of Python and Bash scripting
  • Engagement with automation and monitoring tool sets
  • Comprehensive knowledge of operating system principles, with a particular focus on Linux internals
  • Familiarity with Intel based server hardware and components
  • Competence in server side networking, including understanding network protocols and configurations
  • Familiarity in cloud services and architectural solutions
  • Experience in designing, building, and troubleshooting complex systems
  • Good problem solving skills, underpinned by a methodical approach to technical challenges. This includes an ability to communicate effectively, demonstrating strong interpersonal skills, a sense of responsibility, and a commitment to driving projects to completion.
  • Sense of ownership and drive