CogArena

A benchmark for testing AI agents on behavioral experiments.

Kianté Fernandez · University of California, Los Angeles — Department of Psychology

What is CogArena?

CogArena is a benchmark that tests AI agents on behavioral experiments through a web browser. Tasks are sampled widely across cognitive science—from attention and memory to decision-making, reinforcement learning, and social cognition.

Agents must interpret visual stimuli, provide responses, just like a participant in an experiment might.

Agents
Evaluations
Tasks
Live from the leaderboard

Citation

@article{fernandez2025cogarena,
  title     = {CogArena: Benchmarking AI Agents on Interactive Behavioral Experiments},
  author    = {Fernandez, Kiant{\'e} and Chen, Caitlin and Zhou, Grace and Miceli, Anthony and Sadowski, Bartek and Krajbich, Ian},
  year      = {2025},
  url       = {https://cog-arena.vercel.app}
}