Skip to content

Conversation

@mhbuehler
Copy link
Collaborator

@mhbuehler mhbuehler commented Nov 5, 2024

Description

This PR adds the following new features as specified in "Phase 1" of this RFC. The related PR in GenAIComps is opea-project/GenAIComps#852 and it will need to be merged before this one in GenAIExamples.

Data prep and ingestion enhancements:

  • Accept image only
  • Accept image and text
  • Accept speech audio only

Other enhancements:

  • Allow the user to choose the embedding model and LVM when starting the services

Note that the planned query enhancement "Accept speech audio only" has been moved to Phase 2 and a PR for that phase will be submitted for the next release.

Issues

MultimodalQnA Image & Audio Support RFC

Type of change

List the type of change like below. Please delete options that are not relevant.

  • New feature (non-breaking change which adds new functionality)
  • Breaking change (fix or feature that would break existing design and interface)
  • Others (enhancement, documentation, validation, etc.)

Dependencies

No new dependencies

Tests

Updated the individual microservice's test scripts, the GenAIExamples' MultimodalQnA test scripts, and did manual testing of the UI and documented curl commands.

mhbuehler and others added 30 commits October 14, 2024 16:28
* added entrypoint override

Signed-off-by: okhleif-IL <[email protected]>

* fixed entrypoint container

Signed-off-by: okhleif-IL <[email protected]>

* removed workdir

Signed-off-by: okhleif-IL <[email protected]>

* reverted set env vars changes

Signed-off-by: okhleif-IL <[email protected]>

* added entrypoint for embedding model

Signed-off-by: okhleif-IL <[email protected]>

* added llava model to readme

Signed-off-by: okhleif-IL <[email protected]>

* removed stray \n

Signed-off-by: okhleif-IL <[email protected]>

* added entrypoint override in bridgetower hpu

Signed-off-by: okhleif-IL <[email protected]>

---------

Signed-off-by: okhleif-IL <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
* changed variable names

Signed-off-by: okhleif-IL <[email protected]>

* changed gaudi env vars

Signed-off-by: okhleif-IL <[email protected]>

* changed vars + docs

Signed-off-by: okhleif-IL <[email protected]>

* verified + updated readme

Signed-off-by: okhleif-IL <[email protected]>

* updated readme

Signed-off-by: okhleif-IL <[email protected]>

* updated tests

Signed-off-by: okhleif-IL <[email protected]>

---------

Signed-off-by: okhleif-IL <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
mhbuehler and others added 4 commits November 6, 2024 09:30
Updated main README and supported_examples with MMQA details
Fix for lvm-dependent tests and clear textbox
Signed-off-by: okhleif-IL <[email protected]>
Copy link
Collaborator

@ashahba ashahba left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Putting this PR on hold until opea-project/GenAIComps#852 is merged.

@lvliang-intel lvliang-intel requested a review from Spycsh November 8, 2024 01:53
@Spycsh
Copy link
Collaborator

Spycsh commented Nov 8, 2024

opea-project/GenAIComps#852 is merged now, please resolve comments from @ashahba .

Copy link
Collaborator

@ashahba ashahba left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@ashahba
Copy link
Collaborator

ashahba commented Nov 8, 2024

@tileintel this one is ready to be merged.
All I need is your 👍
Thanks.

@joshuayao joshuayao linked an issue Nov 8, 2024 that may be closed by this pull request
@ftian1 ftian1 merged commit bbc95bb into opea-project:main Nov 8, 2024
@dmsuehir dmsuehir deleted the melanie/mm-rag-enhanced branch March 19, 2025 23:48
cogniware-devops pushed a commit to Cogniware-Inc/GenAIExamples that referenced this pull request Dec 19, 2025
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: okhleif-IL <[email protected]>
Signed-off-by: dmsuehir <[email protected]>
Co-authored-by: Omar Khleif <[email protected]>
Co-authored-by: dmsuehir <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Abolfazl Shahbazi <[email protected]>
Signed-off-by: cogniware-devops <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Image and Audio Support for MultimodalityQnA

6 participants