San Francisco, CAUSAgreenhouse2026-06-16
Why this is a real AI job: The role is explicitly focused on building and optimizing machine learning systems for enterprise generative AI applications, including post-training of LLMs using reinforcement learning techniques. The job description heavily emphasizes AI/ML skills and expe…
AI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including generative AI, defense applications, an…
Details Open source / apply
San Francisco, CaliforniaUSAgreenhouse2026-06-16
Why this is a real AI job: The role is explicitly focused on the architecture, development, and optimization of the inference engine for large language models (LLMs). The job description details deep technical requirements related to ML inference internals, GPU programming, and distrib…
P-1285 About This Role As a staff software engineer for GenAI inference, you will lead the architecture, development, and optimization of the inference engine that powers Databricks Foundation Model API.. You’ll bridge research advances and production demands, ensuring high throughput, low latency,…
Details Open source / apply
San Francisco, CaliforniaUSAgreenhouse2026-06-16
Why this is a real AI job: The role explicitly focuses on optimizing GPU kernels for GenAI inference, requiring deep expertise in ML, GPU architecture, and performance engineering. The tasks are overwhelmingly centered around AI/ML technologies.
P-1285 About This Role As a staff software engineer for GenAI Performance and Kernel, you will own the design, implementation, optimization, and correctness of the high-performance GPU kernels powering our GenAI inference stack. You will lead development of highly-tuned, low-level compute paths, ma…
Details Open source / apply
San Francisco, CaliforniaUSAgreenhouse2026-06-16
Why this is a real AI job: The role is explicitly focused on designing, developing, and optimizing the inference engine for Databricks' Foundation Model API (LLMs). The job description details deep technical work with model architectures, optimization, and distributed systems – all cor…
P-1284 About This Role As a software engineer for GenAI inference, you will help design, develop, and optimize the inference engine that powers Databricks’ Foundation Model API. You’ll work at the intersection of research and production, ensuring our large language model (LLM) serving systems are f…
Details Open source / apply
San Francisco, CaliforniaUSAgreenhouse2026-06-16
Why this is a real AI job: The role explicitly focuses on research and engineering related to GenAI, LLMs, and optimizing training workflows for large models. The tasks are deeply technical and directly contribute to advancing the state-of-the-art in AI.
At Databricks, we are obsessed with enabling data teams to solve the world’s toughest problems, from security threat detection to cancer drug development. We do this by building and running the world’s best data and AI platform so our customers can focus on the high-value challenges that are centra…
Details Open source / apply