San FranciscoUSAFullTimeashby2026-06-16
Why this is a real AI job: The role is explicitly focused on AI/LLM inference, solution architecture for AI products, and working with customers deploying AI models. The responsibilities heavily involve technical AI concepts and deployments.
ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the fronti…
Details Open source / apply
San FranciscoUSAgreenhouse2026-06-16
Why this is a real AI job: The role is focused on building and optimizing a platform for custom models and inference, specifically for video and audio generation. The responsibilities directly involve ML bottlenecks, model bring-up, optimization, and scaling. The company is a research-…
About the Role Our team focuses on enabling custom models and dedicated inference on Together. We are responsible for building a container platform, optimizing autoscaling, minimizing cold starts, achieving the best end-to-end model performance, and providing a best-in-class developer experience wi…
Details Open source / apply