landing_page-logo
F logo

Staff SRE - Data Infrastructure

Fastly Inc.San Francisco, CA

Automate your job search with Sonara.

Submit 10x as many applications with less effort than one manual application.1

Reclaim your time by letting our AI handle the grunt work of job searching.

We continuously scan millions of openings to find your top matches.

pay-wall

Job Description

Posting Open Date: Sept. 8, 2025

Anticipated Posting Close Date*: Oct. 6, 2025

  • Job posting may close early due to the volume of applicants.

Staff Site Reliability Engineer- Data Infrastructure

The Data Reliability team is looking for a talented Staff Site Reliability Engineer to help build and support the next generation of data stores for Fastly. The ideal candidate will have experience working with backend and data services in both cloud and physical systems; be skilled with configuration management tools such as Terraform; and be able to develop internal administration tools in Go and similar. Our team is responsible for supporting the infrastructure, orchestration, and reliability needs of some of Fastly's most data-intensive applications, using technologies like Terraform, Elasticsearch, ClickHouse, Prometheus, MySQL, and Redis in both cloud- and hardware-based environments. Our systems directly contribute to our customers' success by providing our product teams with a platform for effective and reliable delivery of high-quality, high-throughput, globally distributed data systems and products. You will be integral to this mission. We are a distributed team, and employ a variety of styles to get our work done-though we put a high value on working collaboratively, we also rely on asynchronous communication.

What You'll Do:

  • Lead full lifecycle projects from design and development through roll out and maintenance
  • Deploy and maintain several different types of critical data storage systems on scales from gigabytes to petabytes
  • Develop statistics and dashboards to measure service-level objectives for these systems
  • Maintain and create tools for management of configuration, backup, and authenticated access to data systems using peer review, CI/CD, and both daemon- and container-based deployment
  • Write code that is performant, maintainable, clear, and concise and contribute to code reviews, improving the codebase and other team processes
  • Technical leadership of full lifecycle projects, driving project progress and collaborating with project stakeholders- Coordinate and communicate with the team members and across other technical and cross functional teams. Foster relationships with other teams to understand and provide for end-user needs.
  • Help project, plan, and scale for growth
  • Document processes and requirements for both intra- and cross-team knowledge transfer
  • Mentor and support other engineers, fostering a culture of knowledge sharing, innovation, and collaboration within the team
  • Participate in on-call rotation as needed

What We're Looking For:

  • Significant professional experience building and maintaining at least one category of backend system, including both APIs and data stores (relational-, column-, vector-, or document-oriented) Most Staff Engineers at Fastly have more than 7 years of related experience.
  • Experience measuring customer-focused performance using tools like Prometheus or DataDog
  • Linux system skills, including file systems, networking, I/O analysis, and general kernel tuning
  • A strong grasp of networking, routing, network protocols, and related concepts
  • Advanced Terraform skills, including working with modules and remote state
  • Experience developing automation tools in Go and other languages, with an aptitude for learning new languages and technologies
  • Experience with container technology, Kubernetes/Docker/Helm, or similar
  • A collaborative mindset with experience working across cross-functional teams, fostering a culture of respect, collaboration, and knowledge sharing
  • A great teammate: communicative, collaborative, empathetic with a thoughtful, customer-driven approach

We'll be super impressed if you have experience in any of these:

  • Performance measurement using eBPF, flame graphs, etc.
  • Creation of Kubernetes operators
  • Leveraging a service mesh like Istio or Linkerd for service discovery and ACLs
  • Knowledge of one or more host management tools (Chef, Ansible, Puppet)
  • More data storage familiarity, especially Elasticsearch

Work Hours:

  • This position will require you to be available during core North American business hours and evenings and weekends as needed for on-call support. Our team's timezones span from UTC-8 (Pacific) to UTC-3.

Work Location(s) & Travel Requirements:

This position is open to the following office locations:

  • San Francisco, CA
  • Denver, CO
  • New York, NY

Fastly currently embraces a largely hybrid model for most roles which allows employees flexibility to split their time between the office and home.

This position may require travel as required by your role or requested by your manager.

SF / LA Fair Chance Ordinance Statement

Pursuant to the San Francisco Fair Chance Ordinance and the Los Angeles Fair Chance Initiative for Hiring Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Salary:

The estimated salary range for this position is $195,720 to $234,864

Starting salary may vary based on permissible, non-discriminatory factors such as experience, skills, qualifications, and location.

This role may be eligible to participate in Fastly's equity and discretionary bonus programs.

Benefits:

We care about you. Fastly works hard to create a positive environment for our employees, and we think your life outside of work is important too. We support our teams with great benefits that start on the first day of your employment with Fastly. Curious about our offerings?

We offer a comprehensive benefits package including medical, dental, and vision insurance. Family planning, mental health support along with Employee Assistance Program, Insurance (Life, Disability, and Accident), a Flexible Vacation policy and up to 18 days of accrued paid sick leave are there to help support our employees. We also offer 401(k) (including company match) and an Employee Stock Purchase Program. For 2025, we offer 11 paid local holidays, 11 paid company wellness days.

Automate your job search with Sonara.

Submit 10x as many applications with less effort than one manual application.

pay-wall