Skip to content
Change the repository type filter

All

    Repositories list

    • jan

      Public
      Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)
      TypeScript
      GNU Affero General Public License v3.0
      1.4k23k1618Updated Nov 8, 2024Nov 8, 2024
    • Local AI API Platform
      C++
      Apache License 2.0
      1162.1k1077Updated Nov 8, 2024Nov 8, 2024
    • cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server at runtime.
      C++
      GNU Affero General Public License v3.0
      31923Updated Nov 8, 2024Nov 8, 2024
    • models

      Public
      Models support in Jan and Cortex
      Python
      MIT License
      25150Updated Nov 1, 2024Nov 1, 2024
    • For Remote API issues
      0000Updated Oct 31, 2024Oct 31, 2024
    • C++ code that run Python embedding
      C++
      GNU Affero General Public License v3.0
      0411Updated Oct 17, 2024Oct 17, 2024
    • docs

      Public archive
      Jan.ai Website & Documentation
      MDX
      821501Updated Oct 14, 2024Oct 14, 2024
    • cortex.so

      Public archive
      TypeScript
      2432Updated Oct 9, 2024Oct 9, 2024
    • Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU accelerated inference on NVIDIA's GPUs.
      C++
      Apache License 2.0
      9783973Updated Sep 26, 2024Sep 26, 2024
    • 0000Updated Aug 23, 2024Aug 23, 2024
    • Ruby
      0000Updated Aug 22, 2024Aug 22, 2024
    • ppa

      Public
      0000Updated Aug 21, 2024Aug 21, 2024
    • C++
      GNU Affero General Public License v3.0
      0220Updated Aug 13, 2024Aug 13, 2024
    • C++
      0621Updated Aug 12, 2024Aug 12, 2024
    • cortex.js

      Public
      The official Node.js / Typescript library for the OpenAI API
      TypeScript
      Apache License 2.0
      8623019Updated Aug 9, 2024Aug 9, 2024
    • C++
      0000Updated Jul 9, 2024Jul 9, 2024
    • Ruby
      0100Updated Jul 5, 2024Jul 5, 2024
    • cortex.py

      Public
      The official Python library for the OpenAI API
      Python
      Apache License 2.0
      3.2k201Updated May 20, 2024May 20, 2024
    • pymaker

      Public archive
      Make the py
      0000Updated Apr 9, 2024Apr 9, 2024
    • tensorrtllm_backend

      Public archive
      The Triton TensorRT-LLM Backend
      Python
      Apache License 2.0
      104000Updated Mar 19, 2024Mar 19, 2024
    • triton_tensorrt_llm

      Public archive
      Shell
      0100Updated Mar 15, 2024Mar 15, 2024
    • openai_trtllm

      Public archive
      OpenAI compatible API for TensorRT LLM triton backend
      Rust
      MIT License
      25000Updated Mar 15, 2024Mar 15, 2024
    • This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows instead of cloud.
      Python
      Other
      12000Updated Mar 8, 2024Mar 8, 2024
    • architecture

      Public archive
      1100Updated Feb 28, 2024Feb 28, 2024
    • llama.cpp-avx-vnni

      Public archive
      Port of Facebook's LLaMA model in C/C++
      C++
      MIT License
      9.7k000Updated Feb 19, 2024Feb 19, 2024
    • infinity

      Public archive
      The AI-native database built for LLM applications, providing incredibly fast vector and full-text search
      C++
      Apache License 2.0
      273000Updated Feb 19, 2024Feb 19, 2024
    • langchainjs

      Public archive
      TypeScript
      MIT License
      2.2k000Updated Feb 19, 2024Feb 19, 2024
    • model-converter

      Public archive
      Python
      52020Updated Dec 14, 2023Dec 14, 2023
    • JavaScript
      GNU Affero General Public License v3.0
      0100Updated Nov 29, 2023Nov 29, 2023
    • py-nitro

      Public
      0000Updated Nov 16, 2023Nov 16, 2023