Skip to content
Change the repository type filter

All

    Repositories list

    • AP5

      Public
      Shell
      0200Updated Oct 10, 2024Oct 10, 2024
    • A framework for few-shot evaluation of autoregressive language models.
      Python
      MIT License
      1.8k9298Updated Oct 8, 2024Oct 8, 2024
    • Python
      0000Updated Oct 7, 2024Oct 7, 2024
    • Ongoing research training transformer models at scale
      Python
      Other
      2.3k401Updated Aug 29, 2024Aug 29, 2024
    • aqua

      Public
      viz framework for data quality introspection
      Jupyter Notebook
      1100Updated Apr 14, 2024Apr 14, 2024
    • illuminer

      Public
      Python
      Apache License 2.0
      1600Updated Mar 28, 2024Mar 28, 2024
    • docs

      Public
      Documentation for platform users
      Jupyter Notebook
      0001Updated Oct 5, 2023Oct 5, 2023
    • DeepSpeed

      Public
      DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
      Python
      MIT License
      4.1k002Updated Jul 13, 2023Jul 13, 2023
    • llama

      Public
      Inference code for LLaMA models
      Python
      GNU General Public License v3.0
      9.5k000Updated Apr 20, 2023Apr 20, 2023
    • Pipeline for pulling and processing online language model pretraining data from the web
      Python
      Apache License 2.0
      23071Updated Feb 23, 2023Feb 23, 2023
    • Code used for sourcing and cleaning data within the olm-datasets repo
      Jupyter Notebook
      Apache License 2.0
      40000Updated Feb 23, 2023Feb 23, 2023
    • For testing executing existing data pipelines
      0000Updated Jan 15, 2023Jan 15, 2023
    • Dockerfile
      0001Updated Dec 6, 2022Dec 6, 2022
    • Go
      0000Updated Sep 12, 2022Sep 12, 2022
    • Mustache
      0000Updated Jun 14, 2022Jun 14, 2022