Skip to content
@nndeploy

AI推理部署加速

Pinned Loading

  1. nndeploy nndeploy Public

    nndeploy是一款模型端到端部署框架。以多端推理以及基于有向无环图模型部署为基础,致力为用户提供跨平台、简单易用、高性能的模型部署体验。

    C++ 668 102

Repositories

Showing 6 of 6 repositories
  • nndeploy Public

    nndeploy是一款模型端到端部署框架。以多端推理以及基于有向无环图模型部署为基础,致力为用户提供跨平台、简单易用、高性能的模型部署体验。

    nndeploy/nndeploy’s past year of commit activity
    C++ 668 Apache-2.0 102 8 0 Updated Dec 26, 2024
  • safetensors-cpp Public Forked from syoyo/safetensors-cpp

    Header-only safetensors loader and saver in C++

    nndeploy/safetensors-cpp’s past year of commit activity
    C++ 0 MIT 8 0 0 Updated Nov 19, 2024
  • onnx-llm Public Forked from wangzhaode/onnx-llm

    llm deploy project based onnx.

    nndeploy/onnx-llm’s past year of commit activity
    C++ 0 Apache-2.0 4 0 0 Updated Oct 9, 2024
  • tokenizers-cpp Public Forked from mlc-ai/tokenizers-cpp

    Universal cross-platform tokenizers binding to HF and sentencepiece

    nndeploy/tokenizers-cpp’s past year of commit activity
    C++ 1 Apache-2.0 66 0 0 Updated Jun 3, 2024
  • Awesome-LLM-Inference Public Forked from DefTruth/Awesome-LLM-Inference

    💻A small Collection for Awesome LLM Inference [Papers|Blogs|Docs] with codes, contains TensorRT-LLM, streaming-llm, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

    nndeploy/Awesome-LLM-Inference’s past year of commit activity
    2 GPL-3.0 208 0 0 Updated Dec 3, 2023
  • onnx-simplifier Public Forked from daquexian/onnx-simplifier

    Simplify your onnx model

    nndeploy/onnx-simplifier’s past year of commit activity
    Python 1 Apache-2.0 395 0 0 Updated Apr 27, 2022

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…