Change the repository type filter
All
Repositories list
22 repositories
- Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparent evaluation of foundation models, including large language models (LLMs) and multimodal models.
levanter
Publiclm-evaluation-harness
Publicimage2struct
Publicchatnoir-resiliparse
Publicair-bench-2024
Publicfmti
Publicdata-overlap
Publichelm-efficiency
Publichalie
Publicmistral
Publicmosaicml-benchmarks
PublicEUAIActJune15
PublicBioMedLM
Publiccomposer
Publicsprucfluo
Publictransformers
Publicjanus
Public