Foundation Models Meet Embodied Agents
@ 2025 BEHAVIOR Challenge
December 6-7, 2025
San Diego Convention Center, San Diego, California
Robots in the BEHAVIOR simulator perform everyday activities (like preparing food) in virtual home environments. BEHAVIOR (Benchmark for Everyday Household Activities in Virtual, Interactive, and Realistic environments) is a large-scale embodied AI benchmark with 1,000 defined household tasks grounded in real human needs. These tasks introduce long-horizon mobile manipulation challenges in realistic settings, bridging the gap between current research and real-world, human-centric applications.
Even state-of-the-art robot learning models still struggle with the complexity and extended duration of BEHAVIOR's activities, which is why we are thrilled to announce the 1st BEHAVIOR Challenge at NeurIPS 2025. This competition invites the community to tackle 50 full-length tasks in a realistic simulator โ pushing the frontiers of both high-level planning and low-level control in house-scale environments.
Participants will need to make progress on hierarchical planning, robust perception under realistic visual conditions, and reliable manipulation across long-horizon episodes. By focusing on full-length, human-scale household tasks, the challenge aims to surface the practical limitations of current methods and drive advances that matter for real-world robot deployments.
๐More information on the official 2025 BEHAVIOR challenge website.
The benchmark includes 1,000 everyday household activities covering diverse behaviors across:
50 fully interactive scenes with house-scale layouts
10,000+ richly annotated objects
The simulation environment supports:
The benchmark includes 10,000 human-demonstrated trajectories with diverse behaviors across all task categories. Each demonstration contains:
Participants have access to training and evaluation pipelines for these baseline methods:
Agents are evaluated across three areas: