Cantina logo

Media Software Engineer, Speech (All Levels)

CantinaSan Francisco, California

$120,000 - $180,000 / year

Automate your job search with Sonara.

Submit 10x as many applications with less effort than one manual application.1

Reclaim your time by letting our AI handle the grunt work of job searching.

We continuously scan millions of openings to find your top matches.

pay-wall

Overview

Schedule
Full-time
Career level
Senior-level
Remote
Option for remote
Compensation
$120,000-$180,000/year
Benefits
Health Insurance
Dental Insurance
Vision Insurance

Job Description

About Cantina:

Cantina Labs is a social AI company, developing a suite of advanced real-time models that push the boundaries of expression, personality, and realism. We bring characters to life, transforming how people tell stories, connect, and create. We build and power ecosystems. Cantina, our flagship social AI platform, is just the beginning.

If you're excited about the potential AI has to shape human creativity and social interactions, join us in building the future!

About the Role:

The Media Team at Cantina is building the real-time infrastructure powering live conversations between people and AI. Our goal is simple but technically challenging: make interacting with AI feel fast, natural, and truly conversational.

We’re looking for a Software Engineer to help improve the speech, audio, and media systems at the heart of the Cantina experience. A major focus of this role is reducing latency and improving responsiveness so AI bots can hear users, process intent, and respond in real time — without awkward pauses or delays.

This team works across everything from low-level media pipelines and WebRTC frameworks to globally distributed infrastructure supporting real-time voice and video interactions across iOS, Android, and web.

If you’re excited by high-performance C++, real-time systems, speech technologies, and building the future of conversational AI, we’d love to talk.

What You’ll Do:

  • Improve the real-time speech and media systems powering live AI conversations.

  • Reduce latency and optimize responsiveness across audio streaming and speech pipelines.

  • Build new voice and video capabilities that enable more immersive interactions between users and AI bots.

  • Improve and extend our custom WebRTC infrastructure across iOS, Android, and web.

  • Work closely with product and platform teams to shape the future of conversational AI experiences.

What You’ll Bring: We welcome applicants across a wide range of experience levels, from new graduates to senior engineers. Responsibilities and leveling will be tailored to match the candidate’s background.

These are the minimum qualifications:

  • BS or MS in Computer Science, Computer Engineering, or a related field; or equivalent experience.

  • Excellent communications skills.

  • Experience with C or C++.

  • Strong computer science fundamentals, including familiarity with data structures and concurrent / multithreaded programming.

  • Exposure to system programming concepts, including network protocols; memory management; and distributed systems fundamentals.

  • Object-oriented programming and design skills.

  • Interest in solving challenging, subtle engineering problems.

These are the preferred qualifications:

  • Previous experience with WebRTC, streaming protocols, or other media-related technologies.

  • Familiarity with audio or video processing techniques and algorithms.

  • Experience creating backend server infrastructure.

  • Experience developing software for iOS and Android.

  • Familiarity with building services using Node.js.

  • Familiarity with artificial intelligence and machine learning techniques, particularly in relation to speech recognition and synthesis.

Location:

While we offer fully remote and hybrid employment opportunities, our Media Engineering team strongly desires candidates to be available (or willing to relocate) to work in the Bay Area. For reference, 95% of the Media Engineering team works from the Bay Area.

Compensation:

The anticipated annual base salary range for this role is between $120,000-$180,000. When determining compensation, a number of factors will be considered, including skills, experience, job scope, location, and competitive compensation market data.

Benefits:

  • Competitive salary and generous company equity

  • Medical, dental, and vision insurance – 99.99% of premiums covered by Cantina

  • 42 days of paid time off, including:

    • 15 PTO days

    • 10 sick days

    • 15 company holidays

    • 2 floating holidays

  • Generous parental leave & fertility support

  • 401(k) retirement savings plan

  • Lifestyle spending account – $500/month to use however you’d like

  • Complimentary lunch and snacks for in-office employees

  • One Medical membership, and more!

Automate your job search with Sonara.

Submit 10x as many applications with less effort than one manual application.

pay-wall

FAQs About Media Software Engineer, Speech (All Levels) Jobs at Cantina

What is the work location for this position at Cantina?
This job at Cantina is located in San Francisco, California, according to the details provided by the employer. Some roles may also include multiple work locations depending on the requirement.
What pay range can candidates expect for this role at Cantina?
Candidates can expect a pay range of $120,000 and $180,000 per year.
What employment applies to this position at Cantina?
Cantina lists this role as a Full-time position.
What experience level is required for this role at Cantina?
Cantina is looking for a candidate with "Senior-level" experience level.
Does Cantina allow remote work for this role?
Yes, this position at Cantina supports remote work, giving candidates the flexibility to work outside the primary office location.
What benefits are offered by Cantina for this role?
Cantina offers following benefits: Health Insurance, Dental Insurance, Vision Insurance, Family/Dependent Health, Paid Holidays, Paid Vacation, Paid Sick Leave, Parental and Family Leave, 401k Matching/Retirement Savings, and Health & Wellness Programs for this position. Actual benefits may vary depending on the employer's policies and employment terms.
What is the process to apply for this position at Cantina?
You can apply for this role at Cantina either through Sonara's automated application system, which helps you submit applications 10X faster with minimal effort, or by applying manually using the direct link on the job page.