Stanford Center for Research on Foundation Models

All

22 repositories

helm
Public
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparent evaluation of foundation models, including large language models (LLMs) and multimodal models.
Python
•
Apache License 2.0
•326•2.4k•139•15•Updated Aug 19, 2025Aug 19, 2025
haliax
Public
Named Tensors for Legible Deep Learning in JAX
Python
•
Apache License 2.0
•19•201•22•14•Updated Aug 19, 2025Aug 19, 2025
levanter
Public
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
Python
•
Apache License 2.0
•109•644•56•24•Updated Aug 19, 2025Aug 19, 2025
lm-evaluation-harness
Public
Fork of lm-evaluation-harness
Python
•
MIT License
•0•4•0•0•Updated Jul 24, 2025Jul 24, 2025
image2struct
Public
A Benchmark for Evaluating Vision-Language Models in extracting Structured Information from Images
Python
•
Apache License 2.0
•0•8•0•0•Updated Apr 17, 2025Apr 17, 2025
chatnoir-resiliparse
Public
A robust web archive analytics toolkit
Cython
•
Apache License 2.0
•15•1•0•0•Updated Mar 25, 2025Mar 25, 2025
cc-index-server
Public
Common Crawl Index Server
HTML
•28•1•0•0•Updated Feb 25, 2025Feb 25, 2025
ecosystem-graphs
Public
JavaScript
•36•267•0•0•Updated Jan 24, 2025Jan 24, 2025
air-bench-2024
Public
AIR-Bench 2024 is a safety benchmark that aligns with emerging government regulations and company policies
Jupyter Notebook
•
Apache License 2.0
•3•24•0•0•Updated Aug 14, 2024Aug 14, 2024
fmti
Public
The Foundation Model Transparency Index
Creative Commons Attribution 4.0 International
•8•83•0•1•Updated May 23, 2024May 23, 2024
data-overlap
Public
Python
•0•7•0•0•Updated Mar 27, 2024Mar 27, 2024
helm-efficiency
Public
Jupyter Notebook
•1•10•0•0•Updated Dec 12, 2023Dec 12, 2023
halie
Public
HTML
•
Apache License 2.0
•2•17•0•0•Updated Dec 11, 2023Dec 11, 2023
mistral
Public
Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers.
Python
•
Apache License 2.0
•52•573•18•3•Updated Nov 10, 2023Nov 10, 2023
mosaicml-benchmarks
Public
Fast and flexible reference benchmarks
Python
•
Apache License 2.0
•129•0•0•1•Updated Oct 27, 2023Oct 27, 2023
EUAIActJune15
Public
Stanford CRFM's initiative to assess potential compliance with the draft EU AI Act
MIT License
•6•94•1•0•Updated Oct 18, 2023Oct 18, 2023
BioMedLM
Public
Python
•67•633•21•2•Updated Aug 20, 2023Aug 20, 2023
transformers_fsdp_checkpoint_fix
Public
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python
•
Apache License 2.0
•30k•1•0•0•Updated Jul 31, 2023Jul 31, 2023
composer
Public
Composing methods for ML training efficiency
Python
•
Apache License 2.0
•451•2•0•0•Updated Sep 23, 2022Sep 23, 2022
sprucfluo
Public
Data streaming for LMs. WIP
Python
•
Apache License 2.0
•1•3•0•0•Updated May 10, 2022May 10, 2022
transformers
Public
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Python
•
Apache License 2.0
•30k•6•1•0•Updated Sep 28, 2021Sep 28, 2021
janus
Public
A Streamlit interface that's a doorway into GPT-X.
Python
•0•2•0•1•Updated May 23, 2021May 23, 2021