
Sr Software Engineer, Reliability Engineering
Automate your job search with Sonara.
Submit 10x as many applications with less effort than one manual application.1
Reclaim your time by letting our AI handle the grunt work of job searching.
We continuously scan millions of openings to find your top matches.

Job Description
Sr Software Engineer, Reliability Engineering
Here's what you'll do day-to-day:
- Build Tooling & Infrastructure: Design and implement reliability dashboards, AI-driven alerting systems, and internal developer tools that promote operational excellence and self-service.
- Drive Strategic Initiatives: Lead the adoption of DevOps practices across product engineering teams, including environment standardization, service readiness, and release reliability.
- Automate Reliability & Observability: Develop intelligent systems for automated alerting, diagnostics, and incident response using AI/ML approaches. Enhance observability through centralized dashboards and proactive monitoring strategies.
- Mentor & Influence: Coach engineers and leaders on DevOps best practices, champion reliability-focused principles, and mentor peers in systems thinking and operational maturity.
- Establish Standards & Automation: Define engineering standards and implement deterministic automation with a focus on usability, accessibility, and long-term system resilience.
Here's what we're looking for:
- Strategic thinker, driven to identify high impact opportunities and efficiently implement systemic solutions.
- Resilient problem solver, inspired to be in service of our peers and Gusto's customers.
- Strong communicator, committed to drive alignment across technical and non-technical stakeholders.
Required Previous Experience:
- 5+ years of professional experience as a software engineer.
- Implementation and integration of observability platforms. (Datadog preferred)
- Experience with incident remediation and development of incident management programs.
Preferred Previous Experiences:
- Experience with Ruby, Python, and TypeScript.
- Deployment and operation of cloud infrastructure. (AWS preferred)
- Provisioning and managing infrastructure using Infrastructure-as-Code tools. (Terraform preferred)
- Deploying and operating container orchestration. (Kubernetes preferred)
- Proficient in Linux system administration and comfortable working in shell environments.
- Designed and supported high-availability architectures and scalability strategies.
- Participated in service extraction efforts to break apart monoliths and transition toward a service-oriented architecture.
Our cash compensation amount for this role is targeted at $164,000-$204,000 in Denver & most remote locations, and $197,000-$235,000 for San Francisco & New York. Final offer amounts are determined by multiple factors including candidate experience and expertise and may vary from the amounts listed above.
Automate your job search with Sonara.
Submit 10x as many applications with less effort than one manual application.
