Lm Arena

Tech Crunch - May 1st, 2025
Score 7.2

Study accuses LM Arena of helping top AI labs game its benchmark

AI benchmark fairness questioned: LM Arena accused of bias in leaderboard tests

Tech Crunch - Apr 11th, 2025
Score 7.0

Meta’s vanilla Maverick AI model ranks below rivals on a popular chat benchmark

Meta's Llama 4 Maverick struggles after AI benchmark controversy.

Previous Next

Showing 1 to 2 of 2 results