landing_page-logo
Atec Spine logo

Sr. Site Reliability Engineer, Devops

Atec SpineCarlsbad, CA

Automate your job search with Sonara.

Submit 10x as many applications with less effort than one manual application.1

Reclaim your time by letting our AI handle the grunt work of job searching.

We continuously scan millions of openings to find your top matches.

pay-wall

Job Description

The Senior Site Reliability Engineer (SRE) will be responsible for ensuring the availability, performance, scalability, and operational efficiency of the Informatix cloud platform. This role is focused on reducing manual operations work (toil), automating system reliability, and ensuring production-grade observability. The ideal candidate is a systems-focused engineer who is passionate about uptime, incident response, and continuous improvement through engineering solutions.

Essential Duties and Responsibilities

  • Serve as a primary contributor to the on-call rotation to maintain 24/7 uptime for production systems.
  • Proactively, monitor, and continuously improve SLAs, SLOs, and SLIs across critical services.
  • Develop and maintain robust observability tooling including logging, metrics, and tracing (e.g., Azure Monitor, OpenTelemetry, Prometheus).
  • Proactively conduct postmortems and root cause analysis; implement fixes to prevent repeat incidents.
  • Identify and eliminate manual operational toil through scripting and automation.
  • Design and maintain automated incident detection and response systems.
  • Establish and maintain runbooks, playbooks, and escalation protocols for system support.
  • Contribute to chaos testing and failure injection to proactively uncover weaknesses.
  • Promote a culture of operational excellence through data-driven reliability practices.
  • Proactively communicating status

Automate your job search with Sonara.

Submit 10x as many applications with less effort than one manual application.

pay-wall