San FranciscoUSAFullTimeashby2026-06-16
Why this is a real AI job: The role is explicitly focused on building, scaling, and optimizing LLM inference workloads. The team is a 'Forward Deployed Engineering' team working directly with customers on AI deployments. The requirements clearly state experience with LLMs and ML infere…
ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the fronti…
Details Open source / apply
San FranciscoUSAgreenhouse2026-06-16
Why this is a real AI job: The role is explicitly focused on building and optimizing the model serving layer for voice applications, working with state-of-the-art voice models and inference engines. The responsibilities are heavily centered around ML engineering tasks.
About the Role Together AI is building the best inference infrastructure for voice applications. Our Voice AI platform powers production-grade, real-time voice agents and applications — serving speech-to-text and text-to-speech models with best-in-class latency and reliability. We're looking for a…
Details Open source / apply
San FranciscoUSAgreenhouse2026-06-16
Why this is a real AI job: The role is entirely focused on building and optimizing the model serving layer for voice applications, including LLMs, STT, and TTS. It requires deep expertise in ML engineering, inference optimization, and GPU utilization. The responsibilities and requireme…
About the Role Together AI is building the best inference infrastructure for voice applications. Our Voice AI platform powers production-grade, real-time voice agents and applications — serving speech-to-text and text-to-speech models with best-in-class latency and reliability. We're looking for a…
Details Open source / apply
San Francisco, CaliforniaUSAgreenhouse2026-06-16
Why this is a real AI job: The role is explicitly focused on the architecture, development, and optimization of the inference engine for large language models (LLMs). The job description details deep technical requirements related to ML inference internals, GPU programming, and distrib…
P-1285 About This Role As a staff software engineer for GenAI inference, you will lead the architecture, development, and optimization of the inference engine that powers Databricks Foundation Model API.. You’ll bridge research advances and production demands, ensuring high throughput, low latency,…
Details Open source / apply
San Francisco, CaliforniaUSAgreenhouse2026-06-16
Why this is a real AI job: The role is explicitly focused on designing, developing, and optimizing the inference engine for Databricks' Foundation Model API (LLMs). The job description details deep technical work with model architectures, optimization, and distributed systems – all cor…
P-1284 About This Role As a software engineer for GenAI inference, you will help design, develop, and optimize the inference engine that powers Databricks’ Foundation Model API. You’ll work at the intersection of research and production, ensuring our large language model (LLM) serving systems are f…
Details Open source / apply
Belgrade, SerbiaUSAgreenhouse2026-06-16
Why this is a real AI job: The role explicitly focuses on building and deploying ML/AI models and systems, improving the performance of AI-powered products, and working with foundational models. The description highlights core AI/ML engineering tasks.
P-1439 As a Senior Applied ML/AI Engineer at Databricks, you will apply machine learning and optimization algorithms to improve the usability and efficiency of the current AutoML and several other user-facing products that will benefit from better classification, regression, forecasting, and recomm…
Details Open source / apply
San Francisco, CaliforniaUSAgreenhouse2026-06-16
Why this is a real AI job: The role is explicitly focused on building a generative AI platform, including all stages of the ML lifecycle (data generation, training, evaluation, serving, agent-building). The job description heavily emphasizes ML and AI technologies.
P-984 Founded in late 2020 by a small group of machine learning engineers and researchers, Mosaic AI enables companies to securely fine-tune, train and deploy custom AI models on their own data, for maximum security and control. Compatible with all major cloud providers, the Mosaic AI platform prov…
Details Open source / apply
New York City, New YorkUSAgreenhouse2026-05-26
Why this is a real AI job: Die Rolle befasst sich direkt mit der Entwicklung und Führung von Produkten, die LLM-Endpunkte, Model Serving und AI Governance umfassen. Der Titel und die Beschreibung zeigen klare technische und konzeptionelle KI-Arbeit, insbesondere im Bereich LLM Inferenc…
RDQ127R255 At Databricks, we are passionate about enabling data teams to solve the world’s toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the world’s best data and AI infrastructu…
Details Open source / apply
Dallas, TexasUSAgreenhouse2026-05-26
Why this is a real AI job: Die Rolle beinhaltet tiefgehende Arbeit mit Data & AI-Technologien, einschließlich der Analyse und Optimierung von AI-Workflows, Modell-Serving, Spark und Delta. Der Job ist stark auf die Entwicklung, Wartung und Unterstützung von komplexen Data & AI-Systemen…
P-1398 Note: this is a hybrid role and requires ~3 days in the office in Plano, Tx. Mission As a Staff Data & AI Technical Solutions Engineer, you will personally drive and mentor others in producing Data & AI technical solutions for any issues reported by customers - including deep diving into pro…
Details Open source / apply
BrazilUSAgreenhouse2026-05-26
Why this is a real AI job: Die Rolle beinhaltet tiefgehende Arbeit mit AI-Workflows, Data Pipelines, Spark, Delta, Model Serving und ML/AI-Anwendungen. Der TSE analysiert Code-Level-Probleme, optimiert Leistungen und unterstützt Kunden bei der Nutzung von KI-Technologien auf dem Databr…
P-993 Mission As a Data & AI Technical Solutions Engineer, you play a critical role by helping customers debug and maintain stable production data pipelines, AI workflows, and more using the Databricks platform. You will develop product expertise in a couple of areas by advising a broad set of cust…
Details Open source / apply