Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LLM Gateway Using LiteLLM #632

Open
vara-bonthu opened this issue Sep 3, 2024 · 0 comments
Open

LLM Gateway Using LiteLLM #632

vara-bonthu opened this issue Sep 3, 2024 · 0 comments
Labels
enhancement New feature or request gen-ai pattern Distributed Training and Inference Patterns for Various Generative AI Large Language Models (LLMs)

Comments

@vara-bonthu
Copy link
Collaborator

Community Note

  • Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
  • Please do not leave "+1" or other comments that do not add relevant new information or questions, they generate extra noise for issue followers and do not help prioritize the request
  • If you are interested in working on this issue or have submitted a pull request, please leave a comment

This issue proposes the addition of a new Data on EKS blueprint and example pattern to deploy and leverage LiteLLM as an LLM Gateway.

Feature Request:

Develop a blueprint for deploying LiteLLM as an LLM Gateway on EKS.
Provide an example pattern that demonstrates how to integrate and utilize LiteLLM within existing AI/ML workloads on EKS.
Ensure compatibility with existing EKS infrastructure and support for common LLM models.
Include detailed documentation and configuration options to customize the deployment.
Use Cases:

Streamline LLM deployment and management on EKS.
Provide a standardized gateway for accessing multiple LLM models through LiteLLM.
Enhance scalability and flexibility in deploying LLMs on Kubernetes.
Additional Context:

LiteLLM is an emerging tool that provides lightweight, scalable LLM deployment capabilities, making it a suitable choice for cloud-native environments like EKS.
References:

Please feel free to add any comments or suggestions regarding this feature request. Contributions and feedback are highly appreciated!

What is the outcome that you are trying to reach?

Describe the solution you would like

Describe alternatives you have considered

Additional context

@vara-bonthu vara-bonthu added the gen-ai pattern Distributed Training and Inference Patterns for Various Generative AI Large Language Models (LLMs) label Sep 3, 2024
@askulkarni2 askulkarni2 added the enhancement New feature or request label Sep 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request gen-ai pattern Distributed Training and Inference Patterns for Various Generative AI Large Language Models (LLMs)
Projects
None yet
Development

No branches or pull requests

2 participants