Site Reliability Engineer United States, Remote

< back to jobs opening

Job Description:

Site Reliability Engineer

United States, Remote

Are you driven to be an innovative Site Reliability Engineer, and looking to join a team where open collaboration, customer focus, and a commitment to excellence are core values?

At , we work passionately and authentically, striving to win together and make a real impact for our customers and each other. Join us to elevate your career and help deliver innovative solutions in a dynamic, empowering environment.

Why this role matters:

The Site Reliability Engineering role is a blend of infrastructure, networking, operating systems, automation, development, and application administration. It is a hands-on technical position in a fast-paced atmosphere. The ideal candidate has prior experience managing cloud-based SaaS applications and strives to solve traditional operations problems through automation and software. More so, the candidate must possess a high standard of excellence, have a strong customer focus, and is capable of technical deep dives into code, app servers, databases, load balancers, operating systems, and networks.

Site Reliability Engineering (SRE) is a growing team that partners closely with Product Engineering, Security, and Support. We are responsible for the reliability, deployment, and continuous operation of the Cloud services. We need your help to take our existing platform to the next level with observability, release automation, chaos engineering, and more.

What you'll do:

Work 4 x 10-hour days (Wednesday through Saturday, approximately 9 AM to 8 PM U.S. Pacific Time, including lunch)

Deploy, manage, and secure production Software-as-a-Service (SaaS) environments in AWS and Azure

Work with a geographically dispersed, cross-departmental teams to solve difficult problems

Automate common and repetitive tasks

Write documentation and training material

Train other colleagues

Participate in on-call rotations for 24x7 coverage (follow-the-sun model) for incident response, issue triage, and problem resolution

What you will bring:

US citizenship and must be located domestically in the U.S.

A BSc in Computer Science, a related field, or equivalent practical experience

3+ years of relevant industry experience (2+ with an achieved BSc in Computer Science or Equivalent Degree)

Proficiency with Python and experience with one of the following languages:

Java

Golang

C#

Proficiency working with Bash or PowerShell programmatically

Familiarity with public cloud platforms (AWS or Azure preferred)

Experience troubleshooting Java and .NET applications

Experience troubleshooting network and storage infrastructure issues

Experience working with core Linux distributions (Debian, RHEL, SUSE, Slackware)

Experience working with Windows

Experience working with one or more: SQL Server, PostgreSQL, Redis, Kafka, MongoDB, Elasticsearch, or similar

Ability to configure and fine tune at least one: HA Proxy, Apache, Nginx, IIS, or similar

Ability to configure: New Relic, DataDog, Splunk, or similar monitoring tools

Familiarity with container orchestration technologies (AWS EKS or Azure AKS preferred)

Experience with deployment pipeline tools such as Ansible, Jenkins, and/or GitHub Actions

Proficiency working and developing Infrastructure as Code (IaC)

Working Place:

Remote, Oregon, United States

Company :

ESR Healthcare

< back to jobs opening