
Sr Staff Software Engineer- AI
Automate your job search with Sonara.
Submit 10x as many applications with less effort than one manual application.1
Reclaim your time by letting our AI handle the grunt work of job searching.
We continuously scan millions of openings to find your top matches.

Overview
Job Description
At GEICO, we offer a rewarding career where your ambitions are met with endless possibilities.
Every day we honor our iconic brand by offering quality coverage to millions of customers and being there when they need us most. We thrive through relentless innovation to exceed our customers' expectations while making a real impact for our company through our shared purpose.
When you join our company, we want you to feel valued, supported and proud to work here. That's why we offer The GEICO Pledge: Great Company, Great Culture, Great Rewards and Great Careers.
Position Summary
GEICO is seeking an experienced Engineer with a passion for building high-performance, low maintenance, zero-downtime platforms, and applications. You will help drive our insurance business transformation as we transition from a traditional IT model to a tech organization with engineering excellence as its mission, while co-creating the culture of psychological safety and continuous improvement.
Position Description
The Senior Staff Engineer in Availability and Incident Management will design and deploy machine learning systems that enable intelligent incident detection, automated root cause analysis, and predictive reliability improvements across the platform. This role focuses on building a multi-agent AI platform where specialized agents autonomously detect anomalies, diagnose failures, recommend remediation actions, and learn from historical patterns to prevent recurring incidents. You will lead the technical strategy for an AI-powered incident response system that reduces mean time to resolution, minimizes operational toil, and enables proactive reliability improvements through predictive analytics and autonomous workflows. The ideal candidate combines deep expertise in machine learning systems, agentic AI, and multi-agent architectures with strong knowledge of site reliability engineering, observability tooling, and large-scale distributed systems.
Position Responsibilities
As a Senior Staff Engineer, you will:
- Design and build a multi-agent AI platform where specialized agents autonomously detect, diagnose, and resolve issues through agent-to-agent (A2A) collaboration
- Develop intelligent agents using LLMs and agentic frameworks that coordinate detection, diagnostic, remediation, and knowledge tasks with minimal human intervention
- Define agent interaction protocols, A2A communication standards, and evaluation frameworks for agent decision quality and autonomous action safety
- Architect vector database solutions (Milvus, pgvector, Qdrant) for semantic search and RAG to enable context-aware agent decision-making
- Build end-to-end ML pipelines for severity classification, anomaly detection, failure pattern recognition, and impact forecasting using observability data
- Establish scalable orchestration infrastructure for multi-agent workflows with CI/CD, automated evaluation, canary releases, and rollback strategies
- Implement monitoring for agent interactions, A2A communication patterns, decision quality, data drift, and system reliability
- Lead technical architecture ensuring scalability, observability, and integration with existing alerting, logging, and monitoring systems
- Define standards for agent safety, explainability, governance, and human-in-the-loop controls for high-impact automated actions
- Partner with SRE, Product, and Engineering teams to translate reliability goals into measurable ML objectives and maintain pragmatic technical roadmaps
- Mentor engineers through complex AI platform implementations and establish best practices, coding standards, and technical documentation
- Stay current with AI/ML and multi-agent systems; educate engineering leadership on emerging technologies
Qualifications
Experience building and deploying ML systems in production with cross-functional engineering teams
Fluency in at least two modern languages such as Python, Go, Java, C++, or C# including object-oriented design
Experience architecting multi-component ML platforms using open-source/cloud-agnostic components:
Datastores: PostgreSQL, NoSQL (MongoDB, Cassandra, CosmosDB)
Streaming: Kafka, Flink, or Spark Streaming
Experience with end-to-end ML lifecycle: version control, CI/CD, Kubernetes, testing, monitoring, and production support
Experience with cloud providers (Azure, AWS or GCP) in production ML environments
Experience with observability tools and distributed systems monitoring, logging, tracing, and root cause analysis
Experience building multi-agent systems using LLMs and agentic frameworks (e.g., LangChain, LangGraph, AutoGen, Semantic Kernel, CrewAI)
Hands-on experience with RAG, semantic search, and vector databases (e.g., Milvus, pgvector, Qdrant, ElasticSearch)
Experience designing human-in-the-loop workflows and safety controls for autonomous systems
Strong architecture and design skills with ability to influence technical direction and roadmap
Proven ability to solve complex problems with data-driven approaches
Experience fine-tuning or deploying open-source LLMs (Llama, Mistral, Phi) is a plus
Experience with data warehouse/lakehouse platforms (e.g., Snowflake, Databricks, Parquet, Delta, Iceberg)
Experience
- 10+ years of professional platform development or general development experience
- 8+ years of experience with architecture and design
- 6+ years of experience building and deploying machine learning systems in production
- 6+ years of experience in open-source frameworks
- 4+ years of experience with AWS, GCP, Azure, or another cloud service
- 2+ years of experience with LLMs, agentic AI frameworks, or multi-agent systems
Education
- Bachelor's degree in Computer Science, Information Systems, or equivalent education or work experience
Annual Salary
$110,000.00 - $230,000.00
The above annual salary range is a general guideline. Multiple factors are taken into consideration to arrive at the final hourly rate/ annual salary to be offered to the selected candidate. Factors include, but are not limited to, the scope and responsibilities of the role, the selected candidate's work experience, education and training, the work location as well as market and business considerations.
GEICO will consider sponsoring a new qualified applicant for employment authorization for this position.
The GEICO Pledge:
Great Company: At GEICO, we help our customers through life's twists and turns. Our mission is to protect people when they need it most and we're constantly evolving to stay ahead of their needs.
We're an iconic brand that thrives on innovation, exceeding our customers' expectations and enabling our collective success. From day one, you'll take on exciting challenges that help you grow and collaborate with dynamic teams who want to make a positive impact on people's lives.
Great Careers: We offer a career where you can learn, grow, and thrive through personalized development programs, created with your career - and your potential - in mind. You'll have access to industry leading training, certification assistance, career mentorship and coaching with supportive leaders at all levels.
Great Culture: We foster an inclusive culture of shared success, rooted in integrity, a bias for action and a winning mindset. Grounded by our core values, we have an an established culture of caring, inclusion, and belonging, that values different perspectives. Our teams are led by dynamic, multi-faceted teams led by supportive leaders, driven by performance excellence and unified under a shared purpose.
As part of our culture, we also offer employee engagement and recognition programs that reward the positive impact our work makes on the lives of our customers.
Great Rewards: We offer compensation and benefits built to enhance your physical well-being, mental and emotional health and financial future.
- Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family's overall well-being.
- Financial benefits including market-competitive compensation; a 401K savings plan vested from day one that offers a 6% match; performance and recognition-based incentives; and tuition assistance.
- Access to additional benefits like mental healthcare as well as fertility and adoption assistance.
- Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year.
The equal employment opportunity policy of the GEICO Companies provides for a fair and equal employment opportunity for all associates and job applicants regardless of race, color, religious creed, national origin, ancestry, age, gender, pregnancy, sexual orientation, gender identity, marital status, familial status, disability or genetic information, in compliance with applicable federal, state and local law. GEICO hires and promotes individuals solely on the basis of their qualifications for the job to be filled.
GEICO reasonably accommodates qualified individuals with disabilities to enable them to receive equal employment opportunity and/or perform the essential functions of the job, unless the accommodation would impose an undue hardship to the Company. This applies to all applicants and associates. GEICO also provides a work environment in which each associate is able to be productive and work to the best of their ability. We do not condone or tolerate an atmosphere of intimidation or harassment. We expect and require the cooperation of all associates in maintaining an atmosphere free from discrimination and harassment with mutual respect by and for all associates and applicants.
Automate your job search with Sonara.
Submit 10x as many applications with less effort than one manual application.
