Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TF Deploy LLM model on SageMaker #4

Open
4 tasks
wesdottoday opened this issue Sep 26, 2023 · 0 comments
Open
4 tasks

TF Deploy LLM model on SageMaker #4

wesdottoday opened this issue Sep 26, 2023 · 0 comments
Assignees
Labels
documentation Improvements or additions to documentation enhancement New feature or request

Comments

@wesdottoday
Copy link
Contributor

  • TF takes the LLM model from S3 and makes it available as an endpoint in SageMaker
  • TF takes the instance type variable and uses that to decide what instance to deploy (default types?)
  • Set the endpoint URI to a variable in TF
  • Documentation of the URI and how to use it

Terraform should take the user's input and deploy the LLM model of choice to SageMaker.

@wesdottoday wesdottoday added documentation Improvements or additions to documentation enhancement New feature or request labels Sep 26, 2023
@wesdottoday wesdottoday self-assigned this Sep 26, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation enhancement New feature or request
Projects
None yet
Development

When branches are created from issues, their pull requests are automatically linked.

1 participant