diff --git a/404/index.html b/404/index.html index 9182f3f6..b2763b68 100644 --- a/404/index.html +++ b/404/index.html @@ -1 +1 @@ -

404 - Not Found


Go back home
\ No newline at end of file +

404 - Not Found


Go back home
\ No newline at end of file diff --git a/_next/data/baH8RYLXGx1d8VufpT6wE/about.json b/_next/data/baH8RYLXGx1d8VufpT6wE/about.json new file mode 100644 index 00000000..5dcb5fdf --- /dev/null +++ b/_next/data/baH8RYLXGx1d8VufpT6wE/about.json @@ -0,0 +1 @@ +{"pageProps":{"frontmatter":{"title":"About"},"content":"\nLarge Model Systems Organization (LMSYS Org) is an open research organization founded by students and faculty from UC Berkeley in collaboration with Stanford, UCSD, and CMU.\n\nWe aim to make large models accessible to everyone by co-development of open models, datasets, systems, and evaluation tools. Our work encompasses research in both machine learning and systems. We train large language models and make them widely available, while also developing distributed systems to accelerate their training and inference.\n\n### Members\n[Lianmin Zheng](https://lmzheng.net/), [Ying Sheng](https://sites.google.com/view/yingsheng/home), [Wei-Lin Chiang](https://infwinston.github.io/), [Lisa Dunlap](https://lisabdunlap.com), [Shiyi Cao](https://shiyicao.com/), [Tianle Li](https://codingwithtim.github.io/), [Christopher Chou](https://github.com/BabyChouSr), [Evan Frick](https://efrick2002.github.io/), [Isaac Ong](https://isaacong.me), [Dacheng Li](https://dachengli1.github.io/), [Zhuohan Li](https://people.eecs.berkeley.edu/~zhuohan/), [Zi Lin](https://zi-lin.com/), [Zhanghao Wu](https://zhanghaowu.me/), [Shuo Yang](https://github.com/andy-yang-1), [Yineng Zhang](https://zhyncs.com/), [Siyuan Zhuang](https://github.com/suquark), [Yonghao Zhuang](https://github.com/ZYHowell)\n\n#### Advisors\n[Joseph E. Gonzalez](https://people.eecs.berkeley.edu/~jegonzal/), [Ion Stoica](https://people.eecs.berkeley.edu/~istoica/), [Eric P. Xing](http://www.cs.cmu.edu/~epxing/), [Hao Zhang](https://people.eecs.berkeley.edu/~hao/), [Trevor Darrell](https://people.eecs.berkeley.edu/~trevor/)\n\n#### Institutions\nUC Berkeley, Stanford, UCSD, CMU, MBZUAI\n\n### Contact us\n- Email us at [lmsys.org@gmail.com](mailto:lmsysorg@gmail.com).\n- Join us on [discord](https://discord.com/invite/HSWAKCrnFx).\n- Follow us on [twitter](https://twitter.com/lmsysorg).\n"},"__N_SSG":true} \ No newline at end of file diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-03-30-vicuna.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-03-30-vicuna.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-03-30-vicuna.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-03-30-vicuna.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-05-03-arena.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-05-03-arena.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-05-03-arena.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-05-03-arena.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-05-10-leaderboard.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-05-10-leaderboard.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-05-10-leaderboard.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-05-10-leaderboard.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-05-25-leaderboard.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-05-25-leaderboard.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-05-25-leaderboard.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-05-25-leaderboard.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-06-09-api-server.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-06-09-api-server.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-06-09-api-server.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-06-09-api-server.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-06-22-leaderboard.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-06-22-leaderboard.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-06-22-leaderboard.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-06-22-leaderboard.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-06-29-longchat.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-06-29-longchat.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-06-29-longchat.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-06-29-longchat.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-07-20-dataset.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-07-20-dataset.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-07-20-dataset.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-07-20-dataset.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-10-30-toxicchat.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-10-30-toxicchat.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-10-30-toxicchat.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-10-30-toxicchat.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-11-14-llm-decontaminator.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-11-14-llm-decontaminator.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-11-14-llm-decontaminator.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-11-14-llm-decontaminator.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-11-15-slora.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-11-15-slora.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-11-15-slora.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-11-15-slora.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-11-21-lookahead-decoding.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-11-21-lookahead-decoding.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-11-21-lookahead-decoding.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-11-21-lookahead-decoding.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-12-07-leaderboard.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-12-07-leaderboard.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2023-12-07-leaderboard.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2023-12-07-leaderboard.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-01-17-sglang.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-01-17-sglang.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-01-17-sglang.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-01-17-sglang.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-02-05-compressed-fsm.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-02-05-compressed-fsm.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-02-05-compressed-fsm.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-02-05-compressed-fsm.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-03-01-policy.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-03-01-policy.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-03-01-policy.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-03-01-policy.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-04-19-arena-hard.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-04-19-arena-hard.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-04-19-arena-hard.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-04-19-arena-hard.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-05-02-kaggle-competition.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-05-02-kaggle-competition.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-05-02-kaggle-competition.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-05-02-kaggle-competition.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-05-08-llama3.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-05-08-llama3.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-05-08-llama3.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-05-08-llama3.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-05-17-category-hard.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-05-17-category-hard.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-05-17-category-hard.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-05-17-category-hard.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-06-27-multimodal.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-06-27-multimodal.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-06-27-multimodal.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-06-27-multimodal.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-07-01-routellm.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-07-01-routellm.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-07-01-routellm.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-07-01-routellm.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-07-25-sglang-llama3.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-07-25-sglang-llama3.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-07-25-sglang-llama3.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-07-25-sglang-llama3.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-08-28-style-control.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-08-28-style-control.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-08-28-style-control.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-08-28-style-control.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-09-04-sglang-v0-3.json b/_next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-09-04-sglang-v0-3.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/blog/2024-09-04-sglang-v0-3.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/blog/2024-09-04-sglang-v0-3.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/donations.json b/_next/data/baH8RYLXGx1d8VufpT6wE/donations.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/donations.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/donations.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/vicuna_eval.json b/_next/data/baH8RYLXGx1d8VufpT6wE/vicuna_eval.json similarity index 100% rename from _next/data/tcCHXShxVCzo9Qciur9ha/vicuna_eval.json rename to _next/data/baH8RYLXGx1d8VufpT6wE/vicuna_eval.json diff --git a/_next/data/tcCHXShxVCzo9Qciur9ha/about.json b/_next/data/tcCHXShxVCzo9Qciur9ha/about.json deleted file mode 100644 index 8962e2dc..00000000 --- a/_next/data/tcCHXShxVCzo9Qciur9ha/about.json +++ /dev/null @@ -1 +0,0 @@ -{"pageProps":{"frontmatter":{"title":"About"},"content":"\nLarge Model Systems Organization (LMSYS Org) is an open research organization founded by students and faculty from UC Berkeley in collaboration with UCSD and CMU.\n\nWe aim to make large models accessible to everyone by co-development of open models, datasets, systems, and evaluation tools. Our work encompasses research in both machine learning and systems. We train large language models and make them widely available, while also developing distributed systems to accelerate their training and inference.\n\n### Members\n[Lianmin Zheng](https://lmzheng.net/), [Ying Sheng](https://sites.google.com/view/yingsheng/home), [Wei-Lin Chiang](https://infwinston.github.io/), [Lisa Dunlap](https://lisabdunlap.com), [Shiyi Cao](https://shiyicao.com/), [Tianle Li](https://codingwithtim.github.io/), [Christopher Chou](https://github.com/BabyChouSr), [Evan Frick](https://efrick2002.github.io/), [Isaac Ong](https://isaacong.me), [Dacheng Li](https://dachengli1.github.io/), [Zhuohan Li](https://people.eecs.berkeley.edu/~zhuohan/), [Zi Lin](https://zi-lin.com/), [Zhanghao Wu](https://zhanghaowu.me/), [Shuo Yang](https://github.com/andy-yang-1), [Yineng Zhang](https://zhyncs.com/), [Siyuan Zhuang](https://github.com/suquark), [Yonghao Zhuang](https://github.com/ZYHowell)\n\n#### Advisors\n[Joseph E. Gonzalez](https://people.eecs.berkeley.edu/~jegonzal/), [Ion Stoica](https://people.eecs.berkeley.edu/~istoica/), [Eric P. Xing](http://www.cs.cmu.edu/~epxing/), [Hao Zhang](https://people.eecs.berkeley.edu/~hao/), [Trevor Darrell](https://people.eecs.berkeley.edu/~trevor/)\n\n#### Institutions\nUC Berkeley, UCSD, CMU, MBZUAI\n\n### Contact us\n- Email us at [lmsys.org@gmail.com](mailto:lmsysorg@gmail.com).\n- Join us on [discord](https://discord.com/invite/HSWAKCrnFx).\n- Follow us on [twitter](https://twitter.com/lmsysorg).\n"},"__N_SSG":true} \ No newline at end of file diff --git a/_next/static/tcCHXShxVCzo9Qciur9ha/_buildManifest.js b/_next/static/baH8RYLXGx1d8VufpT6wE/_buildManifest.js similarity index 100% rename from _next/static/tcCHXShxVCzo9Qciur9ha/_buildManifest.js rename to _next/static/baH8RYLXGx1d8VufpT6wE/_buildManifest.js diff --git a/_next/static/tcCHXShxVCzo9Qciur9ha/_middlewareManifest.js b/_next/static/baH8RYLXGx1d8VufpT6wE/_middlewareManifest.js similarity index 100% rename from _next/static/tcCHXShxVCzo9Qciur9ha/_middlewareManifest.js rename to _next/static/baH8RYLXGx1d8VufpT6wE/_middlewareManifest.js diff --git a/_next/static/baH8RYLXGx1d8VufpT6wE/_ssgManifest.js b/_next/static/baH8RYLXGx1d8VufpT6wE/_ssgManifest.js new file mode 100644 index 00000000..13dd29c3 --- /dev/null +++ b/_next/static/baH8RYLXGx1d8VufpT6wE/_ssgManifest.js @@ -0,0 +1 @@ +self.__SSG_MANIFEST=new Set(["\u002Fabout","\u002Fdonations","\u002Fblog","\u002Fvicuna_eval","\u002Fblog\u002F[slug]"]);self.__SSG_MANIFEST_CB&&self.__SSG_MANIFEST_CB() \ No newline at end of file diff --git a/_next/static/tcCHXShxVCzo9Qciur9ha/_ssgManifest.js b/_next/static/tcCHXShxVCzo9Qciur9ha/_ssgManifest.js deleted file mode 100644 index 73959270..00000000 --- a/_next/static/tcCHXShxVCzo9Qciur9ha/_ssgManifest.js +++ /dev/null @@ -1 +0,0 @@ -self.__SSG_MANIFEST=new Set(["\u002Fabout","\u002Fvicuna_eval","\u002Fdonations","\u002Fblog","\u002Fblog\u002F[slug]"]);self.__SSG_MANIFEST_CB&&self.__SSG_MANIFEST_CB() \ No newline at end of file diff --git a/about/index.html b/about/index.html index bff188ca..16241cf9 100644 --- a/about/index.html +++ b/about/index.html @@ -1,15 +1,15 @@ -About | LMSYS Org

ABOUT


Large Model Systems Organization (LMSYS Org) is an open research organization founded by students and faculty from UC Berkeley in collaboration with UCSD and CMU.

+About | LMSYS Org

ABOUT


Large Model Systems Organization (LMSYS Org) is an open research organization founded by students and faculty from UC Berkeley in collaboration with Stanford, UCSD, and CMU.

We aim to make large models accessible to everyone by co-development of open models, datasets, systems, and evaluation tools. Our work encompasses research in both machine learning and systems. We train large language models and make them widely available, while also developing distributed systems to accelerate their training and inference.

Members

Lianmin Zheng, Ying Sheng, Wei-Lin Chiang, Lisa Dunlap, Shiyi Cao, Tianle Li, Christopher Chou, Evan Frick, Isaac Ong, Dacheng Li, Zhuohan Li, Zi Lin, Zhanghao Wu, Shuo Yang, Yineng Zhang, Siyuan Zhuang, Yonghao Zhuang

Advisors

Joseph E. Gonzalez, Ion Stoica, Eric P. Xing, Hao Zhang, Trevor Darrell

Institutions

-

UC Berkeley, UCSD, CMU, MBZUAI

+

UC Berkeley, Stanford, UCSD, CMU, MBZUAI

Contact us

-
\ No newline at end of file +
\ No newline at end of file diff --git a/blog/2023-03-30-vicuna/index.html b/blog/2023-03-30-vicuna/index.html index 3b74e835..1539123a 100644 --- a/blog/2023-03-30-vicuna/index.html +++ b/blog/2023-03-30-vicuna/index.html @@ -1,4 +1,4 @@ -Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality | LMSYS Org

Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality

by: The Vicuna Team, Mar 30, 2023


We introduce Vicuna-13B, an open-source chatbot trained by fine-tuning LLaMA on user-shared conversations collected from ShareGPT. Preliminary evaluation using GPT-4 as a judge shows Vicuna-13B achieves more than 90%* quality of OpenAI ChatGPT and Google Bard while outperforming other models like LLaMA and Stanford Alpaca in more than 90%* of cases. The cost of training Vicuna-13B is around $300. The code and weights, along with an online demo, are publicly available for non-commercial use.

+Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality | LMSYS Org

Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality

by: The Vicuna Team, Mar 30, 2023


We introduce Vicuna-13B, an open-source chatbot trained by fine-tuning LLaMA on user-shared conversations collected from ShareGPT. Preliminary evaluation using GPT-4 as a judge shows Vicuna-13B achieves more than 90%* quality of OpenAI ChatGPT and Google Bard while outperforming other models like LLaMA and Stanford Alpaca in more than 90%* of cases. The cost of training Vicuna-13B is around $300. The code and weights, along with an online demo, are publicly available for non-commercial use.

Vicuna (generated by stable diffusion 2.1)

*According to a fun and non-scientific evaluation with GPT-4. Further rigorous evaluation is needed.

@@ -171,4 +171,4 @@

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena.

-

\ No newline at end of file +
\ No newline at end of file diff --git a/blog/2023-05-03-arena/index.html b/blog/2023-05-03-arena/index.html index 4ad23d58..f1e65097 100644 --- a/blog/2023-05-03-arena/index.html +++ b/blog/2023-05-03-arena/index.html @@ -1,4 +1,4 @@ -Chatbot Arena: Benchmarking LLMs in the Wild with Elo Ratings | LMSYS Org

Chatbot Arena: Benchmarking LLMs in the Wild with Elo Ratings

by: Lianmin Zheng*, Ying Sheng*, Wei-Lin Chiang, Hao Zhang, Joseph E. Gonzalez, Ion Stoica, May 03, 2023


We present Chatbot Arena, a benchmark platform for large language models (LLMs) that features anonymous, randomized battles in a crowdsourced manner. In this blog post, we are releasing our initial results and a leaderboard based on the Elo rating system, which is a widely-used rating system in chess and other competitive games. We invite the entire community to join this effort by contributing new models and evaluating them by asking questions and voting for your favorite answer.

+Chatbot Arena: Benchmarking LLMs in the Wild with Elo Ratings | LMSYS Org

Chatbot Arena: Benchmarking LLMs in the Wild with Elo Ratings

by: Lianmin Zheng*, Ying Sheng*, Wei-Lin Chiang, Hao Zhang, Joseph E. Gonzalez, Ion Stoica, May 03, 2023


We present Chatbot Arena, a benchmark platform for large language models (LLMs) that features anonymous, randomized battles in a crowdsourced manner. In this blog post, we are releasing our initial results and a leaderboard based on the Elo rating system, which is a widely-used rating system in chess and other competitive games. We invite the entire community to join this effort by contributing new models and evaluating them by asking questions and voting for your favorite answer.

-
\ No newline at end of file +
\ No newline at end of file