Model Evaluation Jobs – Page 2

Data Scientist - Algorithms, Community Support

Airbnb · United States

95/100

United StatesUSAgreenhouse2026-07-31

Why this is a real AI job: The role explicitly focuses on building and deploying LLM/ML models for Community Support, automating evaluation processes, curating datasets for training, understanding customer issues, and personalization. The job description heavily emphasizes AI/ML expert…

Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every country across the globe. Every day, hosts offer unique stays and experiences that make it possible f…

Details Open source / apply

Post-Doctoral AI Researcher (2 year Fixed-Term)

Snowflake AI · US-WA-Bellevue

95/100

US-WA-BellevueUSAFullTimeashby2026-07-31

Machine Learning Deep Learning Python PyTorch JAX TensorFlow NLP LLMs Data Science Model Evaluation

Why this is a real AI job: The role is explicitly focused on AI research and application to real-world domains. The job description heavily emphasizes machine learning techniques, model development, and publishing research.

At Snowflake, we are powering the era of the agentic enterprise. To usher in this new era, we seek AI-native thinkers across every function who are energized by the opportunity to reinvent how they work. You don’t just use tools; you possess an innate curiosity, treating AI as a high-trust collabor…

Details Open source / apply

Senior Data Scientist

Snowflake AI · US-CA-Menlo Park

95/100

US-CA-Menlo ParkUSAFullTimeashby2026-07-31

Machine Learning Time-Series Analysis Statistical Modeling Probabilistic Modeling Python SQL Data Science Forecasting Model Evaluation MLOps

Why this is a real AI job: The role explicitly focuses on building and deploying production-grade ML/statistical systems for forecasting, a core data science task. The description details advanced modeling techniques and a strong emphasis on ML and statistical research.

At Snowflake, we are powering the era of the agentic enterprise. To usher in this new era, we seek AI-native thinkers across every function who are energized by the opportunity to reinvent how they work. You don’t just use tools; you possess an innate curiosity, treating AI as a high-trust collabor…

Details Open source / apply

Machine Learning Engineer, AV Engineering

Wayve · Herzliya, Israel

95/100

Herzliya, IsraelUKgreenhouse2026-07-31

Machine Learning Deep Learning Data Mining Data Curation Model Training Model Evaluation Reinforcement Learning Transformer Networks PyTorch MLOps

Why this is a real AI job: The role explicitly focuses on developing and training AI models for autonomous driving (L2-L4), owning the entire ML lifecycle, and deploying these models into vehicles. The core responsibilities are heavily centered around AI/ML tasks.

About us Founded in 2017, Wayve is the leading developer of Embodied AI technology. Our advanced AI software and foundation models enable vehicles to perceive, understand, and navigate any complex environment, enhancing the usability and safety of automated driving systems. Our vision is to create…

Details Open source / apply

Applied Scientist / Machine Learning Engineer

Wayve · Sunnyvale, California USA

95/100

Sunnyvale, California USAUKgreenhouse2026-07-31

Machine Learning Data Science Deep Learning Foundation Models LLMs Computer Vision NLP Reinforcement Learning Data Curation MLOps

Why this is a real AI job: The role is explicitly focused on building and improving foundation models for autonomous driving. The core responsibilities revolve around data curation, enrichment, model training, evaluation, and deployment – all central to AI/ML work.

About us Founded in 2017, Wayve is the leading developer of Embodied AI technology. Our advanced AI software and foundation models enable vehicles to perceive, understand, and navigate any complex environment, enhancing the usability and safety of automated driving systems. Our vision is to create…

Details Open source / apply

Machine Learning Engineer, ADAS

Wayve · London, United Kingdom

95/100

London, United KingdomUKgreenhouse2026-07-31

Machine Learning Computer Vision Deep Learning Data Pipelines MLOps 3D Perception Data Annotation Model Evaluation

Why this is a real AI job: The role explicitly focuses on building and improving computer vision and 3D perception models for autonomous driving systems. It involves the entire ML lifecycle, from data creation to model deployment and iteration.

About us Founded in 2017, Wayve is the leading developer of Embodied AI technology. Our advanced AI software and foundation models enable vehicles to perceive, understand, and navigate any complex environment, enhancing the usability and safety of automated driving systems. Our vision is to create…

Details Open source / apply

Senior Machine Learning Scientist

Intercom · Dublin, Ireland

95/100

Dublin, IrelandGlobalgreenhouse2026-07-31

Machine Learning Deep Learning NLP Data Analysis Algorithm Research Model Evaluation MLOps Statistical Modeling Experiment Design

Why this is a real AI job: The role is explicitly focused on building and deploying machine learning models for a core AI product (Fin - AI Customer Agent). The job description details tasks such as identifying ML opportunities, researching algorithms, conducting data analysis, and bri…

Fin is the AI Customer Agent company on a mission to help businesses provide perfect customer experiences. Our AI Agent Fin is the highest-performing AI Customer Agent on the market today, enabling businesses to deliver impeccable, always-on customer support across the customer journey – from servi…

Details Open source / apply

Staff Machine Learning Scientist

Intercom · London, England

95/100

London, EnglandGlobalgreenhouse2026-07-31

Machine Learning Deep Learning NLP Data Analysis Algorithm Research Model Evaluation MLOps Data Science Python SQL

Why this is a real AI job: The role is explicitly focused on machine learning for a core AI product (Fin AI Agent). The job description details research, development, and deployment of ML models, with a strong emphasis on product impact. The candidate will be a technical leader in an M…

Fin is the AI Customer Agent company on a mission to help businesses provide perfect customer experiences. Our AI Agent Fin is the highest-performing AI Customer Agent on the market today, enabling businesses to deliver impeccable, always-on customer support across the customer journey – from servi…

Details Open source / apply

Staff Machine Learning Scientist

Intercom · Dublin, Ireland

95/100

Dublin, IrelandGlobalgreenhouse2026-07-31

Machine Learning Deep Learning NLP Data Analysis Algorithm Research Model Evaluation MLOps Data Science Python SQL

Why this is a real AI job: The role is explicitly focused on building and deploying machine learning models to power the core product (Fin AI Agent). The description details research, development, and productionization of ML features, algorithms, and models. The team is 'ML focused' an…

Fin is the AI Customer Agent company on a mission to help businesses provide perfect customer experiences. Our AI Agent Fin is the highest-performing AI Customer Agent on the market today, enabling businesses to deliver impeccable, always-on customer support across the customer journey – from servi…

Details Open source / apply

AI Engineer - Public Sector

Unstructured · Remote

95/100

RemoteUSAFullTimeashby2026-07-31

LLMs RAG Agentic Systems Python Machine Learning Data Pipelines NLP Computer Vision Vector Databases Embedding Models

Why this is a real AI job: The role is explicitly focused on building and deploying AI solutions (RAG pipelines, agentic systems) for government clients. The job description heavily emphasizes AI/ML technologies and their application to real-world problems. A significant portion of the…

Unstructured is defining the standard for enterprise data transformation in the age of LLMs and generative AI. In just two years, we've raised over $65M from world-class investors, including Menlo Ventures, Bain Capital, Databricks, NVIDIA, Microsoft, and IBM. Our open-source toolkit has been downl…

Details Open source / apply

How to use Model Evaluation in applications