Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

refactor: Refactor nitro with cortext.tensorrtllm engine #37

Closed
wants to merge 78 commits into from

Conversation

CameronNg
Copy link

@CameronNg CameronNg commented May 28, 2024

  • Refactor nitro to cortex.tensorrt-llm
  • Add code for cortex.tensorrt-llm engine
  • Add code for example server
  • Add script for build flow and e2e testing

dan-homebrew and others added 30 commits March 8, 2024 14:22
WIP: Draft README for `nitro-tensorrt-llm`
[Release] Implement Nitro on top of TensorRT-LLM C++ repo
docs: add instructions for pointing jan fe to the engine
README: add Quickstart, Model Load and Inference Request docs
Add Package Contents, and README
@CameronNg CameronNg marked this pull request as ready for review May 29, 2024 09:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants