Baseten logo

Product Manager - Infrastructure

BasetenSan Francisco, California

Automate your job search with Sonara.

Submit 10x as many applications with less effort than one manual application.1

Reclaim your time by letting our AI handle the grunt work of job searching.

We continuously scan millions of openings to find your top matches.

pay-wall

Job Description

ABOUT BASETEN

Baseten powers inference for the world's most dynamic AI companies, like OpenEvidence, Clay, Mirage, Gamma, Sourcegraph, Writer, Abridge, Bland, and Zed. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. With our recent $150M Series D funding, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction, we’re scaling our team to meet accelerating customer demand.

THE ROLE

As an Infrastructure Product Manager at Baseten, you’ll own the roadmap for our core inference and compute infrastructure, ensuring our platform delivers world-class reliability, scalability, and performance. You’ll work closely with engineering teams to define how we handle large-scale distributed systems, optimize GPU utilization, and provide enterprise-grade security and observability. This is a deeply technical role that bridges engineering excellence and customer impact, ensuring Baseten’s infrastructure is a foundation our users can depend on.

EXAMPLE INITIATIVES

You'll get to work on these types of projects as part of our Infrastructure team:

  • Multi-cloud capacity management

  • Inference on B200 GPUs

  • Multi-node inference

  • Fractional H100 GPUs for efficient model serving

RESPONSIBILITIES

  • Define the product vision and roadmap for Baseten’s inference, serving, and orchestration infrastructure

  • Collaborate with engineering to improve the reliability, latency, and cost efficiency of model deployments

  • Partner with Forward Deployed Engineering and customer teams to translate performance needs into infrastructure investments

  • Drive internal platform scalability from multi-GPU support to hybrid cloud architecture

  • Establish metrics for uptime, latency, and cost, ensuring we deliver best-in-class performance and efficiency

  • Lead cross-functional initiatives around observability, deployment automation, and infrastructure security

REQUIREMENTS

  • 4+ years of Product Management experience in developer platforms, infrastructure, or ML systems

  • Engineering background (e.g., degree in Computer Science, Electrical Engineering, or related field; or equivalent hands-on experience as a software engineer)

  • Strong technical understanding of distributed systems, cloud computing, and GPU-based workloads

  • Proven track record of shipping technical products with measurable reliability or performance improvements

  • Excellent communication and prioritization skills with deeply technical teams

NICE TO HAVE

  • Experience with Kubernetes, autoscaling systems, or inference optimization

  • Understanding of LLM and multimodal model serving

  • Prior experience at a company building infrastructure for ML or developer tools

BENEFITS

  • Competitive compensation, including meaningful equity.

  • 100% coverage of medical, dental, and vision insurance for employee and dependents

  • Generous PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)

  • Paid parental leave

  • Company-facilitated 401(k)

  • Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

Automate your job search with Sonara.

Submit 10x as many applications with less effort than one manual application.

pay-wall