Latest research
November 2025 - OlmoEarth: A new state-of-the-art Earth observation foundation model family
OlmoEarth is a new family of industry-leading open foundation models built to make Earth AI practical, scalable,…
October 2025 - SamudrACE: Highly efficient coupled global climate modeling with the Ai2 climate emulator
SamudrACE couples 3D models of both the ocean and the atmosphere, giving it a deep understanding of global Earth…
October 2025 - Asta DataVoyager: Data-driven discovery and analysis
DataVoyager is our new feature in Asta built to address the challenges scientists face in drilling down into…
September 2025 - Fluid language model benchmarking
We explore how Fluid Benchmarking can adapt evaluation items to a language model’s capability level.
August 2025 - OLMoASR: A series of open speech recognition models
We release OLMoASR, a family of open automatic speech recognition (ASR) models trained from scratch on a curated,…
August 2025 - Asta: Accelerating science through trustworthy agentic AI
We announce Asta, our bold initiative to accelerate science through trustworthy, truly open agentic AI.
August 2025 - AstaBench: Rigorous benchmarking of AI agents with a holistic scientific research suite
Introducing AstaBench, a novel AI agents evaluation framework and scientific research benchmark suite.
August 2025 - Signal and Noise: Reducing uncertainty in language model evaluation
We find that two simple metrics, signal and noise, reveal key differences in the utility of current LLM benchmarks.
August 2025 - MoNaCo: More natural questions for reasoning across dozens of documents
Introducing MoNaCo, a benchmark of highly challenging questions spanning dozens of documents for evaluating large…