About the job Sr. IT and Infrastructure Engineer
Senior IT & Infrastructure Engineer
Location: Bay Area, remote flexible
Compensation: Competitive base/equity mix
As a Senior IT & Infrastructure Engineer with our SF Bay Area client, you will be the engineer keeping their mid-peninsula engineering organization running, owning the compute infrastructure, CI/CD systems, monitoring, lab equipment, and internal IT services that engineering depends on every day. You will work hands-on across Linux administration, build pipelines, virtualization, AWS, lab operations, and tier-1 internal support, and partner closely with our 5G DevOps counterpart in Romania on shared infrastructure standards. This is a broad, hands-on role for someone energized by being the person engineers come to when things need to work in the 5G or LTE telecommunications industry.
JOB RESPONSIBILITIES
Infrastructure & Operations
- Manage their compute infrastructure, including on-site lab equipment (Linux hosts, Jenkins servers, lab gear) and cloud resources.
- Administer Linux hosts across the engineering environment, including SaltStack-based configuration management and NFS.
- Create and operate virtualized resources, including Docker containers and KVM hosts.
- Maintain backup and disaster recovery systems for managed Linux hosts.
- Support core engineering SaaS tools (1Password, GitHub, GitLab, Cursor, etc.).
- Provide operational support for their robotics setup, including integration of robotic testing and test results into engineering workflows.
CI/CD & Build Systems
- Build and maintain CI/CD pipelines in GitHub and Jenkins for firmware and host image work, ensuring build consistency, automated linting, testing, and versioned artifact creation.
- Integrate quality gates into pipelines using SonarScanner, Pytest, and other linting tools to maintain code quality standards.
- Integrate hardware-in-the-loop (HITL) testing into firmware and host image build automation pipelines.
Monitoring & Alerting
- Operate and evolve their Grafana observability stacks across infrastructure and testing environments, covering data ingest, visualizations, storage, and alerting.
- Implement and troubleshoot alerts for system health conditions including power states
(battery vs. wall power), UPS auto-shutdowns, full disks, and other infrastructure faults.
AWS & Cloud Operations
- Provide AWS organization-wide support, including new account creation, periodic security audits, and cost analysis reporting.
Internal IT Support
- Provide internal helpdesk support across hardware, software, and account issues for the engineering organization.
- Manage internal account creation and host configuration workflows.
- Coordinate with external IT partners and vendors on compliance, vendor management, and IT-adjacent engineering projects.
MUST HAVE ATTRIBUTES/SKILLS
- 5+ years of experience in IT operations, systems administration, DevOps, or infrastructure engineering, with hands-on responsibility across both IT and engineering systems.
- Strong Linux system administration skills (Bash), including configuration management (SaltStack, Ansible, or similar), NFS, and backup and disaster recovery.
- Hands-on experience with virtualization, including Docker and KVM.
- Working experience operating CI/CD pipelines in GitHub and Jenkins, including build automation, testing integration, and artifact management.
- Working experience operating Grafana-based monitoring and alerting in production environments.
- Familiarity with cloud administration at the organization level, including account management, security, and cost reporting (AWS preferred).
- Comfort providing tier-1 internal IT support to a technical engineering team.
- Based in the Bay Area, with on-site presence at their mid Peninsula, California office at least 3 days per week.
SHOULD HAVE ATTRIBUTES/SKILLS
- Experience coordinating with external IT vendors and managing compliance-adjacent IT projects (security audits, vendor management).
- Experience operating in a regulated or compliance-sensitive engineering environment.
- Exposure to early-stage or high-growth startup environments.
- Experience supporting firmware or embedded engineering teams, including hardware-in-the-loop (HITL) test integration into build pipelines.
- Familiarity with 5G or LTE telecommunications at a working level.
- Operational experience with lab equipment, test rigs, or robotics infrastructure in an engineering organization.
- Working familiarity with code quality and static analysis tooling (SonarScanner, Pytest).
- Familiarity with Elasticsearch operations (client's legacy systems; may require migration or retirement).