CUDA Jobs – Page 2 | AI Job Radar

How to use CUDA in applications

If a job requires CUDA, the skill should be supported by a project, course or portfolio example. The application check reviews whether the skill is actually evidenced in your CV.

Document a small project or notebook
Name relevant tools and methods
Connect the evidence to the target role

Results

Companies

95.2

Average score

Remote

7 results on this page. 17 results in total. More results are available via pagination, company pages, skill pages and job detail pages.

Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI

Scale AI · San Francisco, CA

95/100

San Francisco, CAUSAgreenhouse2026-07-31

Why this is a real AI job: The role is explicitly focused on building and optimizing machine learning systems for enterprise generative AI applications, including post-training of LLMs using reinforcement learning techniques. The job description heavily emphasizes AI/ML skills and expe…

AI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including generative AI, defense applications, an…

Details Open source / apply

Staff Software Engineer - GenAI inference

Databricks · San Francisco, California

95/100

San Francisco, CaliforniaUSAgreenhouse2026-07-31

LLM GenAI ML inference CUDA GPU programming MLOps Model Serving Deep Learning Data Science Distributed Systems

Why this is a real AI job: The role is explicitly focused on the architecture, development, and optimization of the inference engine for large language models (LLMs). The job description details deep technical requirements related to ML inference internals, GPU programming, and distrib…

P-1285 About This Role As a staff software engineer for GenAI inference, you will lead the architecture, development, and optimization of the inference engine that powers Databricks Foundation Model API.. You’ll bridge research advances and production demands, ensuring high throughput, low latency,…

Details Open source / apply

Staff Software Engineer - GenAI Performance and Kernel

Databricks · San Francisco, California

95/100

San Francisco, CaliforniaUSAgreenhouse2026-07-31

CUDA Triton LLVM IR GPU/accelerator architecture ML-specific kernel libraries cuBLAS cuDNN CUTLASS oneDNN Auto-tuning

Why this is a real AI job: The role explicitly focuses on optimizing GPU kernels for GenAI inference, requiring deep expertise in ML, GPU architecture, and performance engineering. The tasks are overwhelmingly centered around AI/ML technologies.

P-1285 About This Role As a staff software engineer for GenAI Performance and Kernel, you will own the design, implementation, optimization, and correctness of the high-performance GPU kernels powering our GenAI inference stack. You will lead development of highly-tuned, low-level compute paths, ma…

Details Open source / apply

Software Engineer - GenAI inference

Databricks · San Francisco, California

95/100

San Francisco, CaliforniaUSAgreenhouse2026-07-31

LLM GenAI ML inference CUDA GPU programming MLOps Model Serving Data Science Deep Learning Distributed Systems

Why this is a real AI job: The role is explicitly focused on designing, developing, and optimizing the inference engine for Databricks' Foundation Model API (LLMs). The job description details deep technical work with model architectures, optimization, and distributed systems – all cor…

P-1284 About This Role As a software engineer for GenAI inference, you will help design, develop, and optimize the inference engine that powers Databricks’ Foundation Model API. You’ll work at the intersection of research and production, ensuring our large language model (LLM) serving systems are f…

Details Open source / apply

Senior GenAI Research Engineer - Optimization and Kernels

Databricks · San Francisco, California

95/100

San Francisco, CaliforniaUSAgreenhouse2026-07-14

LLM GenAI CUDA PyTorch Deep Learning Machine Learning GPU Optimization Distributed Training MLOps

Why this is a real AI job: The role explicitly focuses on research and engineering related to GenAI, LLMs, and optimizing training workflows for large models. The tasks are deeply technical and directly contribute to advancing the state-of-the-art in AI.

At Databricks, we are obsessed with enabling data teams to solve the world’s toughest problems, from security threat detection to cancer drug development. We do this by building and running the world’s best data and AI platform so our customers can focus on the high-value challenges that are centra…

Details Open source / apply

Research Engineer, Machine Learning

Mistral AI · Palo Alto

95/100

Palo AltoFranceFull-timelever2026-07-10

Machine Learning Deep Learning LLMs NLP PyTorch TensorFlow JAX CUDA Distributed Training Data Pipelines

Why this is a real AI job: The role explicitly focuses on building and optimizing large-scale learning systems for open-weight models, working directly with research scientists on core ML tasks. The description details hands-on work with cutting-edge deep learning techniques and produc…

About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life. We democratize AI through high-performance, optimized, open-source and cutting-edge models, produ…

Details Open source / apply

Research Engineer, Machine Learning - Paris/London/Zurich/Warsaw

Mistral AI · Paris

95/100

ParisFranceFull-timelever2026-07-10

Machine Learning Deep Learning NLP LLMs PyTorch JAX TensorFlow CUDA Data Pipelines Distributed Training

Why this is a real AI job: The role explicitly focuses on building and optimizing large-scale learning systems for open-weight models, working directly with research scientists on cutting-edge deep learning techniques and integrating them into production. The job description heavily em…

Details Open source / apply