AI Job Radar

Reinforcement Learning Jobs

Aktuelle KI-Jobs mit Reinforcement Learning, passende Lernpfade und Bewerbungsbezug.

How to use Reinforcement Learning in applications

If a job requires Reinforcement Learning, the skill should be supported by a project, course or portfolio example. The application check reviews whether the skill is actually evidenced in your CV.

17
Results
7
Companies
94.6
Average score
Remote

10 results on this page. 17 results in total. More results are available via pagination, company pages, skill pages and job detail pages.

Mountain View, CA (HQ)USAgreenhouse2026-06-08

Why this is a real AI job: Die Rolle ist klar auf die Entwicklung und Optimierung von LLM-basierten Agenten, Reinforcement Learning, Evaluationsrahmen und agilen Architekturen ausgerichtet. KI ist der überwiegende Kern der Tätigkeit, sowohl in Forschung als auch in Produktionsumgebung.

About Glean: Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry’s most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering intelligent Search, an AI Assistant, and scalable AI agents on one secure, open platform. With…

Details Open source / apply

Herzliya, IsraelUKgreenhouse2026-06-16

Why this is a real AI job: The role explicitly focuses on developing and training AI models for autonomous driving (L2-L4), owning the entire ML lifecycle, and deploying these models into vehicles. The core responsibilities are heavily centered around AI/ML tasks.

About us Founded in 2017, Wayve is the leading developer of Embodied AI technology. Our advanced AI software and foundation models enable vehicles to perceive, understand, and navigate any complex environment, enhancing the usability and safety of automated driving systems. Our vision is to create…

Details Open source / apply

Machine Learning Engineer, Enterprise Brain

Glean · Mountain View, CA (HQ), San Francisco, CA

95/100
Mountain View, CA (HQ), San Francisco, CAUSAgreenhouse2026-06-16

Why this is a real AI job: The role explicitly focuses on building and improving AI-powered products (Enterprise Brain) using LLMs, ML techniques, and agent orchestration. The tasks are heavily centered around core AI/ML engineering principles.

About Glean: Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry’s most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering intelligent Search, an AI Assistant, and scalable AI agents on one secure, open platform. With…

Details Open source / apply

San Francisco, CAUSAgreenhouse2026-06-16

Why this is a real AI job: The role is explicitly focused on building and maintaining systems for evaluating and improving LLM-powered AI assistants and agents. The tasks directly involve ML models, evaluation pipelines, and LLM-powered judges.

About Glean: Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry’s most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering intelligent Search, an AI Assistant, and scalable AI agents on one secure, open platform. With…

Details Open source / apply

Applied Machine Learning Research Scientist

Cerebras Systems · Headquarters/Sunnyvale Office, Toronto Office

95/100
Headquarters/Sunnyvale Office, Toronto OfficeUSAgreenhouse2026-06-16

Why this is a real AI job: The role explicitly focuses on applying and improving machine learning techniques, specifically LLMs, for training, optimization, and deployment. The responsibilities are heavily centered around ML systems and pipelines.

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training…

Details Open source / apply

San FranciscoUSAFullTimeashby2026-06-16

Why this is a real AI job: The role explicitly focuses on AI research and development, specifically large language models (LLMs) and their application to Perplexity's core products. The responsibilities are heavily centered around model training, optimization, and implementation, indic…

Perplexity is seeking top-tier AI Research Scientists and Engineers to advance our AI products and capabilities. We're building the future of AI-powered search and agent experiences through our Sonar models, Deep Research Agent, Comet Agent, and Search products. Join us in creating SOTA experiences…

Details Open source / apply

New York, NY, San Francisco, CA, Seattle, WAUSAgreenhouse2026-06-16

Why this is a real AI job: The role explicitly focuses on designing, building, and deploying AI agents using LLMs and related technologies. The job description heavily emphasizes AI/ML concepts and their application to real-world enterprise problems.

Scale AI is the data foundation for AI, helping organizations build and deploy reliable production AI applications. We partner with leading enterprises and government organizations to accelerate their AI initiatives through our data annotation platform, generative AI solutions, and enterprise AI ca…

Details Open source / apply

Washington, DCUSAgreenhouse2026-06-16

Why this is a real AI job: The role is explicitly focused on building and managing data pipelines for Generative AI models (LLMs), including SFT and RLHF. The job description heavily emphasizes AI/ML concepts and their application to customer projects.

Scale is at the frontier of the AI industry, improving the world’s leading generative AI and large language models through model evaluations, human-powered supervised fine-tuning datasets, world-class reinforcement learning with human feedback, and more. Scale AI’s Public Sector team is growing in…

Details Open source / apply

San Francisco, CAUSAgreenhouse2026-06-16

Why this is a real AI job: The role explicitly focuses on research and development of post-training techniques for LLMs, including SFT, RLHF, and reward modeling. The job description highlights the application of these techniques to enhance LLM capabilities and solve core AI problems.

Scale works with the industry’s leading AI labs to provide high quality data and accelerate progress in GenAI research. We are looking for Research Scientists and Research Engineers with expertise in LLM post-training (SFT, RLHF, reward modeling). This role will focus on optimizing data curation an…

Details Open source / apply

London, UKUSAgreenhouse2026-06-16

Why this is a real AI job: The role is explicitly focused on research and engineering of reinforcement learning for large language models, directly contributing to the core AI capabilities of Anthropic's products. The job description heavily emphasizes AI/ML/LLM concepts and techniques.

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working to…

Details Open source / apply