Sr. IT and Infrastructure Engineer

San Francisco, California, United States

Job Openings Sr. IT and Infrastructure Engineer

About the job Sr. IT and Infrastructure Engineer

Senior IT & Infrastructure Engineer

Location: Bay Area, remote flexible

Compensation: Competitive base/equity mix

As a Senior IT & Infrastructure Engineer with our SF Bay Area client, you will be the engineer keeping their mid-peninsula engineering organization running, owning the compute infrastructure, CI/CD systems, monitoring, lab equipment, and internal IT services that engineering depends on every day. You will work hands-on across Linux administration, build pipelines, virtualization, AWS, lab operations, and tier-1 internal support, and partner closely with our 5G DevOps counterpart in Romania on shared infrastructure standards. This is a broad, hands-on role for someone energized by being the person engineers come to when things need to work in the 5G or LTE telecommunications industry.

JOB RESPONSIBILITIES

Infrastructure & Operations

Manage their compute infrastructure, including on-site lab equipment (Linux hosts, Jenkins servers, lab gear) and cloud resources.

Administer Linux hosts across the engineering environment, including SaltStack-based configuration management and NFS.

Create and operate virtualized resources, including Docker containers and KVM hosts.

Maintain backup and disaster recovery systems for managed Linux hosts.

Support core engineering SaaS tools (1Password, GitHub, GitLab, Cursor, etc.).

Provide operational support for their robotics setup, including integration of robotic testing and test results into engineering workflows.

CI/CD & Build Systems

Build and maintain CI/CD pipelines in GitHub and Jenkins for firmware and host image work, ensuring build consistency, automated linting, testing, and versioned artifact creation.

Integrate quality gates into pipelines using SonarScanner, Pytest, and other linting tools to maintain code quality standards.

Integrate hardware-in-the-loop (HITL) testing into firmware and host image build automation pipelines.

Monitoring & Alerting

Operate and evolve their Grafana observability stacks across infrastructure and testing environments, covering data ingest, visualizations, storage, and alerting.

Implement and troubleshoot alerts for system health conditions including power states

(battery vs. wall power), UPS auto-shutdowns, full disks, and other infrastructure faults.

AWS & Cloud Operations

Provide AWS organization-wide support, including new account creation, periodic security audits, and cost analysis reporting.

Internal IT Support

Provide internal helpdesk support across hardware, software, and account issues for the engineering organization.

Manage internal account creation and host configuration workflows.

Coordinate with external IT partners and vendors on compliance, vendor management, and IT-adjacent engineering projects.

MUST HAVE ATTRIBUTES/SKILLS

5+ years of experience in IT operations, systems administration, DevOps, or infrastructure engineering, with hands-on responsibility across both IT and engineering systems.

Strong Linux system administration skills (Bash), including configuration management (SaltStack, Ansible, or similar), NFS, and backup and disaster recovery.

Hands-on experience with virtualization, including Docker and KVM.

Working experience operating CI/CD pipelines in GitHub and Jenkins, including build automation, testing integration, and artifact management.

Working experience operating Grafana-based monitoring and alerting in production environments.

Familiarity with cloud administration at the organization level, including account management, security, and cost reporting (AWS preferred).

Comfort providing tier-1 internal IT support to a technical engineering team.

Based in the Bay Area, with on-site presence at their mid Peninsula, California office at least 3 days per week.

SHOULD HAVE ATTRIBUTES/SKILLS

Experience coordinating with external IT vendors and managing compliance-adjacent IT projects (security audits, vendor management).

Experience operating in a regulated or compliance-sensitive engineering environment.

Exposure to early-stage or high-growth startup environments.

Experience supporting firmware or embedded engineering teams, including hardware-in-the-loop (HITL) test integration into build pipelines.

Familiarity with 5G or LTE telecommunications at a working level.

Operational experience with lab equipment, test rigs, or robotics infrastructure in an engineering organization.

Working familiarity with code quality and static analysis tooling (SonarScanner, Pytest).

Familiarity with Elasticsearch operations (client's legacy systems; may require migration or retirement).

Or refer someone