Feat/qwen speech summarization #418

eric-mccann-pro · 2026-01-05T19:22:38Z

Issues:

Related PRs:

Feat/qwen speech summarization openmpf-docker#255

This change is

…g BECAUSE it tries to talk to a vllm container TODO: parameterize the URL

…ither needed at build OR plumbing

…he LLM

…y WFM

… I have tested their functionality

jrobble

@jrobble made 5 comments.
Reviewable status: 0 of 21 files reviewed, 4 unresolved discussions (waiting on @eric-mccann-pro).

a discussion (no related file):
The component is trying to reach out to huggingface:

2026-01-22 17:52:06,962 DEBUG [connectionpool.py:544] - [Job 14:video.mp4] https://huggingface.co:443 "HEAD /Qwen/Qwen3-30B-A3B-Instruct-2507-FP8/resolve/main/added_tokens.json HTTP/1.1" 404 0
2026-01-22 17:52:06,981 DEBUG [connectionpool.py:544] - [Job 14:video.mp4] https://huggingface.co:443 "HEAD /Qwen/Qwen3-30B-A3B-Instruct-2507-FP8/resolve/main/special_tokens_map.json HTTP/1.1" 404 0
2026-01-22 17:52:07,001 DEBUG [connectionpool.py:544] - [Job 14:video.mp4] https://huggingface.co:443 "HEAD /Qwen/Qwen3-30B-A3B-Instruct-2507-FP8/resolve/main/chat_template.jinja HTTP/1.1" 404 0
2026-01-22 17:52:07,337 DEBUG [connectionpool.py:544] - [Job 14:video.mp4] https://huggingface.co:443 "GET /api/models/Qwen/Qwen3-30B-A3B-Instruct-2507-FP8 HTTP/1.1" 200 4932

Please make sure this works in an environment with no Internet connectivity.

a discussion (no related file):
Please poll to ensure the server is initialized before sending it a request. This is how our YOLO component does it with Triton:

openmpf-components/cpp/OcvYoloDetection/triton/TritonInferencer.cpp

Line 390 in 7e9b4e9

// do some check on server and model

python/QwenSpeechSummarization/Dockerfile.vllm line 57 at r1 (raw file):

CMD [ \
    "--host", "0.0.0.0",\
    "--port", "11434",\

This generates a warning:

 1 warning found:
 - JSONArgsRecommended: JSON arguments recommended for CMD to prevent unintended behavior related to OS signals (line 55)
JSON arguments recommended for ENTRYPOINT/CMD to prevent unintended behavior related to OS signals
More info: https://docs.docker.com/go/dockerfile/rule/json-args-recommended/
Dockerfile.vllm:55
--------------------
  54 |     
  55 | >>> CMD [ \
  56 | >>>     "--host", "0.0.0.0",\
  57 | >>>     "--port", "11434",\
  58 | >>>     ]
  59 |     
--------------------

Remove the trailing comma after "11434", to get rid of the warning.

I fixed this in a recent commit.

python/QwenSpeechSummarization/qwen_speech_summarization_component/test_data/SOURCE line 1 at r1 (raw file):

test.json is PUBLIC DOMAIN text from the US Library of Congress.

Instead of SOURCE, call this NOTICE for consistency with other components. Update the formatting to match. For example: https://github.com/openmpf/openmpf-components/blob/master/cpp/OcvYoloDetection/test/data/NOTICE

python/QwenSpeechSummarization/README.md line 40 at r1 (raw file):

NOTE: if you have an internet connection at runtime, you may use the image `vllm/vllm-openai:latest` directly in lieu of building Dockerfile.vllm. We do not support this arrangement BUT it is possible with the right command on the docker service.

# Environment variables

Define all of these as algorithm properties in descriptor.json. For future reference, any algorithm property can be set in the compose file as an env. var. with the MPF_PROP_ prefix like this:

  qwen-speech-summarization:
    depends_on:
      - workflow-manager
    deploy:
      mode: replicated
      replicas: 1
    environment:
      WFM_PASSWORD: mpfadm
      WFM_USER: admin
      MPF_PROP_VLLM_URI: http://qwen-speech-summarization-server:11434/v1
    image: openmpf_qwen_speech_summarization:jrobble-video
    volumes:
      - shared_data:/opt/mpf/share:rw

Those env. vars take precedence over everything, including incoming job properties.

Rename VLLM_URI to VLLM_SERVER for consistency with https://github.com/openmpf/openmpf-components/blob/master/cpp/OcvYoloDetection/plugin-files/descriptor/descriptor.json#L72C20-L72C33

Right now you're reading these in once and assuming they will never change. When using algorithm properties they may change with any job, so your code needs to check for this and reinit the client_factory, if necessary.

By default they should not need to be set as env. vars. The default algorithm property values should work.

jrobble

@jrobble made 1 comment.
Reviewable status: 0 of 21 files reviewed, 5 unresolved discussions (waiting on @eric-mccann-pro).

a discussion (no related file):
When running WHISPER SPEECH DETECTION WITH QWEN SUMMARIZATION PIPELINE I was getting:

2026-01-21 21:10:45,929 ERROR [Camel (camelContext) thread #119 - JmsConsumer[MPF.JOB_ROUTER]] o.m.m.w.c.JobCompleteProcessor - [Job 2] Failed to create the output object due to: java.lang.IllegalArgumentException: Invalid range: [0..-1]
java.lang.IllegalArgumentException: Invalid range: [0..-1]

I fixed that Whisper issue in a commit I made to this PR.

jrobble

@jrobble made 1 comment.
Reviewable status: 0 of 22 files reviewed, 6 unresolved discussions (waiting on @eric-mccann-pro).

python/WhisperSpeechDetection/plugin-files/descriptor/descriptor.json line 24 at r3 (raw file):

      "properties": [
        {
          "name": "TARGET_SEGMENT_LENGTH",

Prior to adding this Whisper was processing multiple video chunks as sub-jobs from the WFM. It would process the whole video per sub-job, not caring about the frame range. These settings disable segmentation.

…into feat/qwen-speech-summarization

* Change server service name.

jrobble · 2026-01-22T22:38:23Z

I believe that we ultimately decided on dropping a classifier track if the confidence is too low. I believe we should make dropping them a configurable option. This can be done by setting a classifier confidence threshold. If -1 we don't drop any of them.

jrobble

@jrobble made 1 comment.
Reviewable status: 0 of 22 files reviewed, 8 unresolved discussions (waiting on @eric-mccann-pro).

python/QwenSpeechSummarization/Dockerfile line 58 at r4 (raw file):

    # make sure the tokenizer is available offline
    /opt/mpf/plugin-venv/bin/python3 -c 'from qwen_speech_summarization_component.qwen_speech_summarization_component import QwenSpeechSummaryComponent; QwenSpeechSummaryComponent()'; \
    if [ "${RUN_TESTS,,}" == true ]; then pytest qwen_speech_summarization_component; fi

Also run test_slapchop.py.

jrobble

@jrobble made 1 comment.
Reviewable status: 0 of 22 files reviewed, 9 unresolved discussions (waiting on @eric-mccann-pro).

python/QwenSpeechSummarization/qwen_speech_summarization_component/llm_util/input_cleanup.py line 31 at r4 (raw file):

import mpf_component_api as mpf

def clean_input_json(input):

There's supporting code in this PR like clean_input_json(input) and convert_to_csv(input) (possibly other files as well) that's not used by the component.

We should either remove it, or provide a script that uses it and document how to use that script in the README if it's important enough to keep and you think it will have value in the future. Right now it's dead code.

jrobble · 2026-01-23T22:22:01Z

python/QwenSpeechSummarization/Dockerfile line 58 at r4 (raw file):

Previously, jrobble (Jeff Robble) wrote…

Also run test_slapchop.py.

Never mind. I see that it runs.

jrobble

@jrobble made 2 comments and resolved 1 discussion.
Reviewable status: 0 of 22 files reviewed, 10 unresolved discussions (waiting on @eric-mccann-pro).

a discussion (no related file):
Please address this:

/opt/mpf/plugin-venv/lib/python3.12/site-packages/qwen_speech_summarization_component/qwen_speech_summarization_component.py:49: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
  from pkg_resources import resource_filename

Here's an example in the EAST component of how we resolved it during the Python 3.12 upgrade: 9adca03#diff-e8aac52dfacf8355656a33a662170ab6a26c9243c82e0b57969824966b4df265

a discussion (no related file):
Please address this:

Failed to convert the "PROMPT_TEMPLATE" key with value "" to <class 'NoneType'> due to: NoneType takes no arguments

…n' into feat/qwen-speech-summarization

jrobble · 2026-01-26T18:07:08Z

python/QwenSpeechSummarization/qwen_speech_summarization_component/qwen_speech_summarization_component.py line 226 at r5 (raw file):

        print(f'Received audio job.')

        raise Exception('Getting 1 track at a time is going to be rough')

Do this:

raise mpf.DetectionError.UNSUPPORTED_DATA_TYPE.exception(f'Audio detection not supported.')

eric-mccann-pro added 30 commits December 12, 2025 12:52

Add QwenSpeechSummarization

2689205

Runs main() in container... does not run main during build for testin…

f01498a

…g BECAUSE it tries to talk to a vllm container TODO: parameterize the URL

Logger won't log. Deal with it later

98bc842

Fix format strings

98bb794

Add primary_topic and other_topics to output

e29c34a

Make sure we download the tokenizer giblets during docker build

0604c07

Mock an LLM generator's events stream. Run pytest if RUN_TESTS is true

97ae53f

Use releasable descriptor

f04d5b0

Readme

86a7ab4

Change default RUN_TEST to false

50cb5f7

Parameterize VLLM_MODEL and VLLM_URI at container scope, as they're e…

e006afd

…ither needed at build OR plumbing

+x

b0b1c15

Include served-model-name param in the entrypoint, not the CMD

198f3ec

Make sure tokenizer pull step has VLLM_MODEL defined in env if overriden

0908932

License blocks

c20c3d2

Make exception text less useless when there are no FF tracks

ebbecd7

Fix typo

5aea1b7

Fix another typo

68c8456

Fix default in descriptor

ae4f6f0

Make speaker id optional

de6f2d3

input_cleanup: be cool

cc151c6

again

16a367c

Change summary and print the final summary after it comes back from t…

7d231e5

…he LLM

Print number of results from component video track func when called b…

3c04189

…y WFM

Actually return results. duh

47ca541

Set an ImageLocation for video tracks

dbed34c

Define CLASSIFIERS_FILE and ENABLED_CLASSIFIERS in the json, now that…

bb5d333

… I have tested their functionality

Gate some of the output behind debug parameter

6948569

Provide Items of Interest instruction

82f37b6

Remove businesses from entities list

9e47148

jrobble requested changes Jan 22, 2026

View reviewed changes

jrobble reviewed Jan 22, 2026

View reviewed changes

jrobble mentioned this pull request Jan 22, 2026

Feat/qwen speech summarization openmpf/openmpf-docker#255

Open

jrobble reviewed Jan 22, 2026

View reviewed changes

jrobble and others added 4 commits January 22, 2026 19:40

Fix how Whisper is returning duplicate tracks for videos.

04f7e1a

Wait up to two minutes for vllm to be healthy for each call to summarize

541fac1

Merge remote-tracking branch 'origin/feat/qwen-speech-summarization' …

e6506bd

…into feat/qwen-speech-summarization

Use algorithm prop.

2a14fe1

* Change server service name.

jrobble requested changes Jan 23, 2026

View reviewed changes

jrobble added 10 commits January 23, 2026 16:22

Fix test.

e3d6327

Fix test round 2.

08d5531

Fix bug.

f7fa93c

Use local_files_only=True.

e3e9c0d

Download autotokenizer in Dockerfile.

48acdaa

Fix syntax.

f38fc8a

Proper quotes.

31f1b68

Use import.

4710b35

Bug fix.

c5d9d52

Use HF_HUB_OFFLINE.

319d1a7

Use HF_HUB_OFFLINE before import.

073003b

jrobble requested changes Jan 24, 2026

View reviewed changes

eric-mccann-pro added 4 commits January 26, 2026 16:03

Merge remote-tracking branch 'origin/jrobble/qwen-speech-summarizatio…

e0be4ec

…n' into feat/qwen-speech-summarization

Filter out low confidence classifiers

15404ee

Add classifier_confidence_minimum to descriptor

79ffed8

Add requests to setup.cfg

05e12ee

descriptor: true ==> "TRUE"

b935b2e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/qwen speech summarization #418

Feat/qwen speech summarization #418

Uh oh!

eric-mccann-pro commented Jan 5, 2026 •

edited by jrobble

Loading

Uh oh!

jrobble left a comment

Uh oh!

jrobble left a comment

Uh oh!

jrobble left a comment •

edited

Loading

Uh oh!

jrobble commented Jan 22, 2026

Uh oh!

jrobble left a comment

Uh oh!

jrobble left a comment

Uh oh!

jrobble commented Jan 23, 2026

Uh oh!

jrobble left a comment

Uh oh!

jrobble commented Jan 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Feat/qwen speech summarization #418

Are you sure you want to change the base?

Feat/qwen speech summarization #418

Uh oh!

Conversation

eric-mccann-pro commented Jan 5, 2026 • edited by jrobble Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jrobble left a comment

Choose a reason for hiding this comment

Uh oh!

jrobble left a comment

Choose a reason for hiding this comment

Uh oh!

jrobble left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jrobble commented Jan 22, 2026

Uh oh!

jrobble left a comment

Choose a reason for hiding this comment

Uh oh!

jrobble left a comment

Choose a reason for hiding this comment

Uh oh!

jrobble commented Jan 23, 2026

Uh oh!

jrobble left a comment

Choose a reason for hiding this comment

Uh oh!

jrobble commented Jan 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

eric-mccann-pro commented Jan 5, 2026 •

edited by jrobble

Loading

jrobble left a comment •

edited

Loading