Skip to content

Commit

Permalink
update (#52)
Browse files Browse the repository at this point in the history
  • Loading branch information
infwinston authored Dec 7, 2023
1 parent e74cfe2 commit f31015f
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion blog/2023-12-07-leaderboard.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
title: "Chatbot Arena: New models & Elo system update"
author: "Wei-Lin Chiang, Tim Li, Joseph E. Gonzalez, Ion Stoica"
date: "Dec 7, 2023"
previewImg: /images/blog/slora/thumbnail_preview.png
previewImg: /images/blog/leaderboard_202312/mle_elo.png
---

Welcome to our latest update on the Chatbot Arena, our open evaluation platform to test the most advanced LLMs. We're excited to share that over 130,000 votes that are now collected to rank the most capable 40+ models! In this blog post, we'll cover the results of six new models, the transition from the online Elo system to the Bradley-Terry model, which gives us significantly more stable ratings and precise confidence intervals, and our findings from differentiating versions of proprietary models (e.g., GPT-4 => GPT-4-0314, GPT-4-0613).
Expand Down

0 comments on commit f31015f

Please sign in to comment.