
Product Manager - Infrastructure
Automate your job search with Sonara.
Submit 10x as many applications with less effort than one manual application.1
Reclaim your time by letting our AI handle the grunt work of job searching.
We continuously scan millions of openings to find your top matches.

Job Description
ABOUT BASETEN
Baseten powers inference for the world's most dynamic AI companies, like OpenEvidence, Clay, Mirage, Gamma, Sourcegraph, Writer, Abridge, Bland, and Zed. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. With our recent $150M Series D funding, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction, we’re scaling our team to meet accelerating customer demand.
THE ROLE
As an Infrastructure Product Manager at Baseten, you’ll own the roadmap for our core inference and compute infrastructure, ensuring our platform delivers world-class reliability, scalability, and performance. You’ll work closely with engineering teams to define how we handle large-scale distributed systems, optimize GPU utilization, and provide enterprise-grade security and observability. This is a deeply technical role that bridges engineering excellence and customer impact, ensuring Baseten’s infrastructure is a foundation our users can depend on.
EXAMPLE INITIATIVES
You'll get to work on these types of projects as part of our Infrastructure team:
Multi-cloud capacity management
Inference on B200 GPUs
Multi-node inference
Fractional H100 GPUs for efficient model serving
RESPONSIBILITIES
Define the product vision and roadmap for Baseten’s inference, serving, and orchestration infrastructure
Collaborate with engineering to improve the reliability, latency, and cost efficiency of model deployments
Partner with Forward Deployed Engineering and customer teams to translate performance needs into infrastructure investments
Drive internal platform scalability from multi-GPU support to hybrid cloud architecture
Establish metrics for uptime, latency, and cost, ensuring we deliver best-in-class performance and efficiency
Lead cross-functional initiatives around observability, deployment automation, and infrastructure security
REQUIREMENTS
4+ years of Product Management experience in developer platforms, infrastructure, or ML systems
Engineering background (e.g., degree in Computer Science, Electrical Engineering, or related field; or equivalent hands-on experience as a software engineer)
Strong technical understanding of distributed systems, cloud computing, and GPU-based workloads
Proven track record of shipping technical products with measurable reliability or performance improvements
Excellent communication and prioritization skills with deeply technical teams
NICE TO HAVE
Experience with Kubernetes, autoscaling systems, or inference optimization
Understanding of LLM and multimodal model serving
Prior experience at a company building infrastructure for ML or developer tools
BENEFITS
Competitive compensation, including meaningful equity.
100% coverage of medical, dental, and vision insurance for employee and dependents
Generous PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
Paid parental leave
Company-facilitated 401(k)
Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.
Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.
At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.
Automate your job search with Sonara.
Submit 10x as many applications with less effort than one manual application.
