Hyungson / ko_chatbot_arena Public

Notifications You must be signed in to change notification settings
Fork 1
Star 1

Open KO Chatbot Leaderboard using Langchain, Gradio, API

1 star 1 fork Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
image		image
.DS_Store		.DS_Store
README.md		README.md
chat_arena.py		chat_arena.py
chatbot_arena_leaderboard.py		chatbot_arena_leaderboard.py
gradio_app_fn.py		gradio_app_fn.py
llm_api.py		llm_api.py
main.py		main.py
rag_arena.py		rag_arena.py

Repository files navigation

KO Chatbot Arena(RAG) Leaderboard

해결하고 싶은 문제

한국어 LLM 리더보드 성능이 높은데, 한국어를 잘 못하는 모델이 많음.
내가 개발하려는 서비스에 어떤 모델의 적합할지, LLM 리더보드로는 알수 없음.

해결방안

과거에 만들어진 test set이 아닌 현재 사용자의 경험을 통해 평가 받는 리더보드
서비스에 자주 사용 기능들을 각각 평가

구현 방안

기능 별 블라인드 테스트
테스트 데이터 누적 후 RAW 데이터 허깅페이스에 공개 -> 자동화
Elo Rating System 기반 리더보드 구축

Demo

한국어 LLM chatbot arena
한국어 RAG arena
ELO rating 리더보드

reference

lmsys chatbot arena

https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard

About

Open KO Chatbot Leaderboard using Langchain, Gradio, API

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%