Computer Software Jobs 2026 (Now Hiring) – Smart Auto Apply

We've scanned millions of jobs. Simply select your favorites, and we can fill out the applications for you.

Inference logo

Senior Software Engineer - Model Performance

Inference
San Francisco, California

$220,000 - $320,000 / year

Help us make inference blazingly fast. If you love squeezing every last drop of performance out of GPUs, diving deep into CUDA kernels, and turning optimization techniques into pro...

Posted 30+ days ago

Maxima logo

Software Engineer - Data Infrastructure

Maxima
San Mateo, California
Company Description At Maxima, we’re building an agentic AI platform to automate enterprise accounting. Our system handles high-volume financial data, complex accounting workflows,...

Posted 30+ days ago

Epirus logo

Future Opportunities- Software Engineering

Epirus
Torrance, California
About Epirus Epirus is a high-growth technology company dedicated to overcoming the asymmetric challenges inherent to the future of national security. Epirus' flagship product, Leo...

Posted 30+ days ago

N logo

Software Engineer – Developer and Production Systems Support

Nightwing Intelligence Solutions
Sterling, Virginia
Nightwing provides technically advanced full-spectrum cyber, data operations, systems integration and intelligence mission support services to meet our customers’ most demanding ch...

Posted 30+ days ago

B logo

Software Engineer Intern

Brevium
American Fork, Utah

$25 - $32 / hour

About Brevium: Brevium is a fast-growing tech company that develops innovative software solutions for medical practices, focusing on patient appointment lifecycle management. By us...

Posted 30+ days ago

Woolpert logo

Senior Software Engineer

Woolpert
Pittsburgh, Pennsylvania

$118,200 - $147,800 / year

We seek to move the world forward through innovative thinking. Woolpert is an award-winning, global leader in architecture, engineering, and geospatial services. We blend design ex...

Posted 30+ days ago

Pearly logo

Software Engineer - Santa Barbara, CA

Pearly
Santa Barbara, California
The Role We are looking for a Santa Barbara, CA based Software Engineer to develop platform capabilities that position Pearly as the leading payments layer in the dental industry....

Posted 30+ days ago

OpenAI logo

Software Engineer, Privacy Infrastructure

OpenAI
San Francisco, California
About the Team OpenAI’s Privacy Engineering team sits at the intersection of Security, Privacy, Legal, and Core Infrastructure. Our mission is to build data infrastructure and syst...

Posted 30+ days ago

Bevel logo

Senior Software Engineer - AI/ML

Bevel
New York, New York
About Bevel Bevel is an AI Health Companion that helps people make smarter decisions about their health every day. Our app brings together data across sleep, recovery, activity, nu...

Posted 30+ days ago

Artemis logo

Software Engineer - Fullstack (Frontend Leaning)

Artemis
New York, New York
About Artemis Traditional finance and crypto-enabled finance are merging into digital finance — an open, global financial system for everyone. Stripe launched its own blockchain, J...

Posted 30+ days ago

M logo

Software Engineering Intern

Monogram
San Mateo, California
About Monogram is the first visual interface for AI. Our brains are wired for vision. Half of our neural horsepower is dedicated to visual input. Evolution didn't train us to parse...

Posted 30+ days ago

NVIDIA logo

Senior System Software Engineer - Neural Graphics SDKs

NVIDIA
Redmond, California

$184,000 - $287,500 / year

NVIDIA is a world-leader in Gaussian Splatting and Neural reconstruction. Our team builds the Omniverse NuRec SDK to enable robotic, healthcare, and AV developers to build better m...

Posted 30+ days ago

N logo

Senior Software Engineer (Space Communications)

northwoodspace
Los Angeles, California
About Northwood: Northwood is on a mission to transform connectivity between Earth and space, bringing the benefits of space to the masses through innovations in space communicatio...

Posted 30+ days ago

A logo

Staff Software Engineering Manager (TS/SCI) {S}

ARKA Group, L.P.
Aurora, Colorado

$150,000 - $190,000 / year

ARKA Group L.P. (“ARKA”) is an advanced technologies company serving the U.S. military, intelligence community, and commercial space industry delivering next-generation solutions t...

Posted 30+ days ago

Kalshi logo

Software Engineer, Product

Kalshi
New York City, New York

$200,000 - $280,000 / year

What is Kalshi? Kalshi has defined a new category: prediction markets. Kalshi allows people to trade on the outcome of any events and turn any question about the future into a fina...

Posted 30+ days ago

Hewlett Packard Enterprise logo

Embedded Software Developer

Hewlett Packard Enterprise
Spring, Texas

$106,000 - $243,000 / year

Embedded Software DeveloperThis role has been designed as ‘’Onsite’ with an expectation that you will primarily work from an HPE office. Who We Are: Hewlett Packard Enterprise is t...

Posted 30+ days ago

C logo

Staff Software Engineer, Backend

Coco
Los Angeles, California
At Coco, we’re dedicated to perfecting the last-mile delivery experience through robotics. We believe the delivery service industry in its current state is massively under-serving...

Posted 30+ days ago

RoboForce logo

Senior/Staff Embedded Software Engineer, Robotics Devices

RoboForce
Milpitas, California
Why RoboForce RoboForce is an AI robotics company developing Physical AI–powered Robo-Labor for dull, dirty, and dangerous work. The company’s robots are engineered for demanding i...

Posted 30+ days ago

Xage Security logo

Software Engineer (Lincoln, NE)

Xage Security
Lincoln, Nebraska
About Xage About the Role In this role, you will work with highly talented engineers at all levels to build and deliver most advanced security solutions focused on high quality, sc...

Posted 30+ days ago

NVIDIA logo

Principal Software Engineer, Profiling Services

NVIDIA
Austin, California

$272,000 - $431,250 / year

Help design and ship an Always-On, low-overhead GPU profiling service that runs in production, scales across cluster environments, and delivers actionable insights for ML workloads...

Posted 30+ days ago

Inference logo

Senior Software Engineer - Model Performance

InferenceSan Francisco, California

$220,000 - $320,000 / year

Automate your job search with Sonara.

Submit 10x as many applications with less effort than one manual application.1

Reclaim your time by letting our AI handle the grunt work of job searching.

We continuously scan millions of openings to find your top matches.

pay-wall

Overview

Schedule
Full-time
Career level
Senior-level
Remote
Hybrid remote
Compensation
$220,000-$320,000/year

Job Description

Help us make inference blazingly fast. If you love squeezing every last drop of performance out of GPUs, diving deep into CUDA kernels, and turning optimization techniques into production systems, we'd love to meet you.

About Inference.net

Inference.net trains and hosts specialized language models for companies that need frontier-quality AI at a fraction of the cost. The models we train match GPT-5 accuracy but are smaller, faster, and up to 90% cheaper. Our platform handles everything end-to-end: distillation, training, evaluation, and planet-scale hosting.

We are a well-funded ten-person team of engineers who work in-person in downtown San Francisco on difficult, high-impact engineering problems. Everyone on the team has been writing code for over 10 years, and has founded and run their own software companies. We are high-agency, adaptable, and collaborative. We value creativity alongside technical prowess and humility. We work hard, and deeply enjoy the work that we do. Most of us are in the office 4 days a week in SF; hybrid works for Bay Area candidates.

About the Role

You will be responsible for making our inference stack as fast and efficient as possible. Your work spans from implementing known optimization techniques to experimenting with novel approaches, always with the goal of serving models faster and cheaper at scale.

Your north star is inference performance: latency, throughput, cost efficiency, and how quickly we can bring new model architectures into production. You'll work across the full inference stack—from CUDA kernels to serving frameworks—to find and eliminate bottlenecks. This role reports directly to the founding team. You'll have the autonomy, a large compute budget, and technical support to push the limits of what's possible in model serving.

Key Responsibilities

  • Implement and productionize optimization techniques including quantization, speculative decoding, KV cache optimization, continuous batching, and LoRA serving

  • Deep dive into inference frameworks (vLLM, SGLang, TensorRT-LLM) and underlying libraries to debug and improve performance

  • Profile and optimize CUDA kernels and GPU utilization across our serving infrastructure

  • Add support for new model architectures, ensuring they meet our performance standards before going to production

  • Experiment with novel inference techniques and bring successful approaches into production

  • Build tooling and benchmarks to measure and track inference performance across our fleet

  • Collaborate with applied ML engineers to ensure trained models can be served efficiently

Requirements

  • 2+ years of experience in ML systems, inference optimization, or GPU programming

  • Strong proficiency in Python and familiarity with C++

  • Hands-on experience with LLM inference frameworks (vLLM, SGLang, TensorRT-LLM, or similar)

  • Deep understanding of GPU architecture and experience profiling GPU workloads

  • Familiarity with LLM optimization techniques (quantization, speculative decoding, continuous batching, KV cache management)

  • Experience with PyTorch and understanding of how models execute on hardware

  • Track record of measurably improving system performance

Nice-to-Have

  • Experience with CUDA programming

  • Familiarity with serving non-LLM models (TTS, vision, embeddings)

  • Experience with distributed inference and multi-GPU serving

  • Contributions to open-source inference frameworks

  • Experience with Docker and Kubernetes

You don't need to tick every box. Curiosity and the ability to learn quickly matter more.

Compensation

We offer competitive compensation, equity in a high-growth startup, and comprehensive benefits. The base salary range for this role is $220,000 - $320,000, plus equity and benefits, depending on experience.

Equal Opportunity

Inference.net is an equal opportunity employer. We welcome applicants from all backgrounds and don't discriminate based on race, color, religion, gender, sexual orientation, national origin, genetics, disability, age, or veteran status.

If you're excited about making AI inference faster for everyone, we'd love to hear from you. Please send your resume and GitHub to amar@inference.net and/or apply here on Ashby.

Automate your job search with Sonara.

Submit 10x as many applications with less effort than one manual application.

pay-wall