Normalize input tensor names #59

nnshah1 · 2023-10-09T20:08:23Z

updating to make names lower case and change 'prompt' and 'text' to more generic:

'text_input', 'text_output'

rmccorm4

LGTM, but need to sync with @nv-hwoo @matthewkotila for PA VLLM guide updates, and @pskiran1 for VLLM CI test updates.

L0_http/generate_endpoint_test.py uses a mock model so it can be updated separately, should be unaffected by this.

Creates parity with triton-inference-server/tutorials#59

matthewkotila · 2023-10-09T21:41:11Z

@rmccorm4: LGTM, but need to sync with @nv-hwoo @matthewkotila for PA VLLM guide updates, and @pskiran1 for VLLM CI test updates.

L0_http/generate_endpoint_test.py uses a mock model so it can be updated separately, should be unaffected by this.

triton-inference-server/client#412 ready to go 👍🙏

nv-hwoo

One nit comment but LGTM. I ran the changes with PA LLM guide and confirmed that it works 👍

Quick_Deploy/vLLM/model_repository/vllm/1/model.py

nnshah1 · 2023-10-11T15:09:19Z

@tanmayv25, @jbkyang-nvi let me know if we want to merge this or close it as the changes have merged into the backend.

matthewkotila · 2023-10-12T19:33:05Z

What's still blocking this?

matthewkotila · 2023-10-23T17:05:12Z

@nnshah1 @tanmayv25 @rmccorm4 @jbkyang-nvi Friendly ping on what is blocking this?

nnshah1 · 2023-10-23T17:20:31Z

@nnshah1 @tanmayv25 @rmccorm4 @jbkyang-nvi Friendly ping on what is blocking this?

Plan is to move this tutorial to reference the new backend and the changes for naming have already been made there. We wanted to stop making changes here to avoid duplication / sync issues.

@matthewkotila Let us know issues with that approach.

rmccorm4 and others added 2 commits October 9, 2023 12:27

Make stream input optional

6fbc02c

changing prompt to text_input and making all inputs lowercase

dd3676e

nnshah1 requested review from rmccorm4 and tanmayv25 October 9, 2023 20:20

rmccorm4 previously approved these changes Oct 9, 2023

View reviewed changes

nnshah1 requested a review from nv-hwoo October 9, 2023 21:26

matthewkotila added a commit to triton-inference-server/client that referenced this pull request Oct 9, 2023

Update llm.md

c62f6c2

Creates parity with triton-inference-server/tutorials#59

matthewkotila mentioned this pull request Oct 9, 2023

Update LLM guide and profile.py to use new Triton+vLLM input names triton-inference-server/client#412

Closed

nv-hwoo previously approved these changes Oct 9, 2023

View reviewed changes

Quick_Deploy/vLLM/model_repository/vllm/1/model.py Show resolved Hide resolved

Merge branch 'main' into nnshah-input-normalization

94f2337

nnshah1 dismissed stale reviews from nv-hwoo and rmccorm4 via 94f2337 October 9, 2023 21:59

nnshah1 added 2 commits October 9, 2023 15:52

removing whitespace

f0df18a

remove whitespace

f8903ac

nnshah1 requested review from rmccorm4 and nv-hwoo October 9, 2023 22:55

nv-hwoo previously approved these changes Oct 9, 2023

View reviewed changes

rmccorm4 previously approved these changes Oct 9, 2023

View reviewed changes

tanmayv25 previously approved these changes Oct 9, 2023

View reviewed changes

tanmayv25 mentioned this pull request Oct 10, 2023

Renaming the tensors and removing tools triton-inference-server/vllm_backend#7

Merged

Merge branch 'main' into nnshah-input-normalization

a83585f

nnshah1 dismissed stale reviews from tanmayv25, rmccorm4, and nv-hwoo via a83585f October 11, 2023 15:06

nv-hwoo mentioned this pull request Oct 11, 2023

Add LLM support to Brute Search triton-inference-server/model_analyzer#769

Merged

nnshah1 closed this Nov 21, 2023

nnshah1 deleted the nnshah-input-normalization branch November 21, 2023 19:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Normalize input tensor names #59

Normalize input tensor names #59

nnshah1 commented Oct 9, 2023

rmccorm4 left a comment

matthewkotila commented Oct 9, 2023

nv-hwoo left a comment

nnshah1 commented Oct 11, 2023

matthewkotila commented Oct 12, 2023

matthewkotila commented Oct 23, 2023

nnshah1 commented Oct 23, 2023

Normalize input tensor names #59

Normalize input tensor names #59

Conversation

nnshah1 commented Oct 9, 2023

rmccorm4 left a comment

Choose a reason for hiding this comment

matthewkotila commented Oct 9, 2023

nv-hwoo left a comment

Choose a reason for hiding this comment

nnshah1 commented Oct 11, 2023

matthewkotila commented Oct 12, 2023

matthewkotila commented Oct 23, 2023

nnshah1 commented Oct 23, 2023