Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for Nvidia TensorRT #1665

Open
wants to merge 16 commits into
base: main
Choose a base branch
from
Open

Support for Nvidia TensorRT #1665

wants to merge 16 commits into from

Conversation

imartinez
Copy link
Collaborator

Add support for Nvidia TensorRT LLM

  • Followed the LlamaIndex integration steps here, adapted to PrivateGPT
  • Updated the Documentation to add instructions in the Installation page
  • Created a settings-tensorrt.yaml config file ready to work

Base automatically changed from feature/upgrade-llamaindex to main March 6, 2024 16:51
Copy link
Contributor

github-actions bot commented Mar 7, 2024

Copy link
Contributor

github-actions bot commented Mar 7, 2024

Copy link
Contributor

github-actions bot commented Mar 7, 2024

@iukea1
Copy link

iukea1 commented Mar 15, 2024

You are a damn king sir

@rolandomar
Copy link

Great work! Was curious and tried to deploy this, managed to get it running but now I having this error:
NotImplementedError: Nvidia TensorRT-LLM does not currently support streaming completion.
Is this a limitation from llama index? Any pointers for a workaround?
Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants