Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MultimodalQnA Image and Audio Support Phase 1 #1071

Open
wants to merge 52 commits into
base: main
Choose a base branch
from

Conversation

mhbuehler
Copy link

@mhbuehler mhbuehler commented Nov 5, 2024

Description

This PR adds the following new features as specified in "Phase 1" of this RFC. The related PR in GenAIComps is opea-project/GenAIComps#852 and it will need to be merged before this one in GenAIExamples.

Data prep and ingestion enhancements:

  • Accept image only
  • Accept image and text
  • Accept speech audio only

Other enhancements:

  • Allow the user to choose the embedding model and LVM when starting the services

Note that the planned query enhancement "Accept speech audio only" has been moved to Phase 2 and a PR for that phase will be submitted for the next release.

Issues

MultimodalQnA Image & Audio Support RFC

Type of change

List the type of change like below. Please delete options that are not relevant.

  • New feature (non-breaking change which adds new functionality)
  • Breaking change (fix or feature that would break existing design and interface)
  • Others (enhancement, documentation, validation, etc.)

Dependencies

No new dependencies

Tests

Updated the individual microservice's test scripts, the GenAIExamples' MultimodalQnA test scripts, and did manual testing of the UI and documented curl commands.

mhbuehler and others added 30 commits October 14, 2024 16:28
* added entrypoint override

Signed-off-by: okhleif-IL <[email protected]>

* fixed entrypoint container

Signed-off-by: okhleif-IL <[email protected]>

* removed workdir

Signed-off-by: okhleif-IL <[email protected]>

* reverted set env vars changes

Signed-off-by: okhleif-IL <[email protected]>

* added entrypoint for embedding model

Signed-off-by: okhleif-IL <[email protected]>

* added llava model to readme

Signed-off-by: okhleif-IL <[email protected]>

* removed stray \n

Signed-off-by: okhleif-IL <[email protected]>

* added entrypoint override in bridgetower hpu

Signed-off-by: okhleif-IL <[email protected]>

---------

Signed-off-by: okhleif-IL <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
* changed variable names

Signed-off-by: okhleif-IL <[email protected]>

* changed gaudi env vars

Signed-off-by: okhleif-IL <[email protected]>

* changed vars + docs

Signed-off-by: okhleif-IL <[email protected]>

* verified + updated readme

Signed-off-by: okhleif-IL <[email protected]>

* updated readme

Signed-off-by: okhleif-IL <[email protected]>

* updated tests

Signed-off-by: okhleif-IL <[email protected]>

---------

Signed-off-by: okhleif-IL <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Copy link
Collaborator

@ashahba ashahba left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @mhbuehler @dmsuehir and @okhleif-IL for this well put together PR!

LGTM!

mhbuehler and others added 5 commits November 6, 2024 09:00
Updated main README and supported_examples with MMQA details
Fix for lvm-dependent tests and clear textbox
Signed-off-by: okhleif-IL <[email protected]>
export video_fn="WeAreGoingOnBullrun.mp4"
export caption_fn="apple.txt"

function build_docker_images() {
cd $WORKPATH/docker_image_build
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please update the line 23 so that you are cloning the right repo/branch during tests and while this PR opea-project/GenAIComps#852 still being merged.

git clone https://github.com/opea-project/GenAIComps.git should be replaced with:

git clone --single-branch --branch="melanie/mm-rag-enhanced" https://github.com/mhbuehler/GenAIComps.git

export video_fn="WeAreGoingOnBullrun.mp4"
export caption_fn="apple.txt"

function build_docker_images() {
cd $WORKPATH/docker_image_build
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please update the line 23 so that you are cloning the right repo/branch during tests and while this PR opea-project/GenAIComps#852 still being merged.

git clone https://github.com/opea-project/GenAIComps.git should be replaced with:

git clone --single-branch --branch="melanie/mm-rag-enhanced" https://github.com/mhbuehler/GenAIComps.git

Copy link
Collaborator

@ashahba ashahba left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Putting this PR on hold until opea-project/GenAIComps#852 is merged.

Comment on lines +23 to +25
# TODO: Revert to the clone of opea-project "main" after the merge of https://github.com/opea-project/GenAIComps/pull/852
# git clone https://github.com/opea-project/GenAIComps.git && cd GenAIComps && git checkout "${opea_branch:-"main"}" && cd ../
git clone --single-branch --branch="melanie/mm-rag-enhanced" https://github.com/mhbuehler/GenAIComps.git
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As said in the comments this will be reverted prior to merged.

Comment on lines +23 to +25
# TODO: Revert to the clone of opea-project "main" after the merge of https://github.com/opea-project/GenAIComps/pull/852
# git clone https://github.com/opea-project/GenAIComps.git && cd GenAIComps && git checkout "${opea_branch:-"main"}" && cd ../
git clone --single-branch --branch="melanie/mm-rag-enhanced" https://github.com/mhbuehler/GenAIComps.git
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As said in the comments this will be reverted prior to merged.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants