#llm-benchmarking

[ follow ]
Venture
fromTechCrunch
4 days ago

The leaderboard "you can't game," funded by the companies it ranks | TechCrunch

Arena has become the dominant public leaderboard for evaluating frontier AI models, rapidly growing from a UC Berkeley research project to a $1.7 billion valuation while influencing industry funding, product launches, and competitive dynamics.
[ Load more ]