feat(dialogflow-cx): Example implementation of Streaming Detect Intent with continuous microphone input and audio output #13053

ffeldhaus · 2025-01-14T11:21:41Z

Description

Fixes #13041

Checklist

I have followed Sample Guidelines from AUTHORING_GUIDE.MD
Tests pass: nox -s py-3.9 (see Test Environment Setup)
Lint pass: nox -s lint (see Test Environment Setup)
Please merge this PR for me once it is approved

README.rst is not updated as it seems to be auto generated.

…rophone input and speaker output

… was received

ghost

Hi team,

I'm currently reviewing this pull request and will provide my feedback in a few minutes. In the meantime, here's a summary of the changes for you and other reviewers to quickly get up to speed:

This pull request implements a real-time bidirectional streaming audio interface with Google Cloud Dialogflow CX. The primary goal is to create an example that captures audio from a user's microphone, streams it to Dialogflow CX for intent detection, and plays back the synthesized audio responses.

Here's a breakdown of the changes:

Intent: Provide a complete example implementation of Streaming Detect Intent with continuous microphone input and audio output. This addresses issue Streaming Detect Intent with continuous microphone input #13041.
Changes:
- Added a new Python file: dialogflow-cx/streaming_detect_intent_infinite.py. This file contains the core logic for the bidirectional audio streaming with Dialogflow CX. It's a substantial addition (lines 1-727).
- Updated dialogflow-cx/requirements.txt (lines 1-7) to include necessary dependencies: termcolor==2.5.0 and pyaudio==0.2.14. The google-cloud-dialogflow-cx library was also updated to version 1.38.0.
Activity: The author, ffeldhaus, has indicated that tests and lint checks have passed.

The example is well-documented and includes instructions on setting up the environment and running the script. It also handles potential errors and provides informative logging.

I'll be checking for things like error handling, code style, and overall clarity of the example. I'll post my detailed review shortly.

And now, a little haiku to lighten the mood:

Code flows like a stream,
Dialog's voice, a clear response,
Smooth talk, all in sync.

ghost

This pull request introduces a continuous streaming example for Dialogflow CX, addressing issue #13041. The implementation is comprehensive and well-structured, demonstrating bidirectional streaming, audio input/output, and graceful exit handling. The code is generally clean and follows best practices. However, there are a few minor suggestions for improvement.

dialogflow-cx/streaming_detect_intent_infinite.py

Update example invocation Co-authored-by: code-review-assist[bot] <182814678+code-review-assist[bot]@users.noreply.github.com>

Add validation of agent name Co-authored-by: code-review-assist[bot] <182814678+code-review-assist[bot]@users.noreply.github.com>

glasnt · 2025-01-16T05:27:34Z

Internal b/389902797

snippet-bot · 2025-01-17T07:54:14Z

Here is the summary of changes.

You are about to add 1 region tag.

dialogflow-cx/streaming_detect_intent_infinite.py:54, tag dialogflow_streaming_detect_intent_infinite

This comment is generated by snippet-bot.
If you find problems with this result, please file an issue at:
https://github.com/googleapis/repo-automation-bots/issues.
To update this comment, add snippet-bot:force-run label or use the checkbox below:

Refresh this comment

dialogflow-cx/streaming_detect_intent_infinite.py

…not yet available

davidcavazos

This PR also needs tests. Since it's an "infinite" streaming CLI app, there's no need to test it end-to-end, but unit tests around the most important functions would be nice.

It's a little tricky since most functions are methods depending on the internal state of an object, so each test might need to create its own object.

I'm okay with more minimal testing around AudioIO since that's what streams from the microphone and can get tricky. However the Dialogflow code should be tested, it seems it could simply use a bytes generator with some fixed data, or maybe it could be read from a file?

dialogflow-cx/streaming_detect_intent_infinite.py

ffeldhaus · 2025-02-14T22:00:34Z

This PR also needs tests. Since it's an "infinite" streaming CLI app, there's no need to test it end-to-end, but unit tests around the most important functions would be nice.

It's a little tricky since most functions are methods depending on the internal state of an object, so each test might need to create its own object.

I'm okay with more minimal testing around AudioIO since that's what streams from the microphone and can get tricky. However the Dialogflow code should be tested, it seems it could simply use a bytes generator with some fixed data, or maybe it could be read from a file?

I added a simple test similar to the existing test in streaming_detect_intent_partial_response_test.py reusing the some hello.wav resource file as input and the same existing Dialogflow Agent.

I reused and extended the MockPyAudio from transcribe_streaming_infinite_v2_test.py.

Please review again, all comments so far should be addressed.

…pport Python 3.8

ffeldhaus and others added 7 commits January 12, 2025 22:42

feat(dialogflow-cx): Dialogflow CX infinit streaming example with mic…

7cc6181

…rophone input and speaker output

chore(dialogflow-cx) Fix restarting the stream after response message…

3334240

… was received

chore(dialogflow-cx): do not capture microphone input when playing audio

cdae068

chore(dialogflow-cx): remove unused reset_stream

f47b9be

Merge branch 'GoogleCloudPlatform:main' into main

14fc2d7

chore(dialogflow-cx): Apply Authoring Guideline improvements

32a414e

chore(dialogflow-cx): Update dependencies for audio IO streaming

c6bcab0

ffeldhaus requested review from a team as code owners January 14, 2025 11:21

ghost reviewed Jan 14, 2025

View reviewed changes

product-auto-label bot added the samples label Jan 14, 2025

blunderbuss-gcf bot assigned glasnt Jan 14, 2025

ghost reviewed Jan 14, 2025

View reviewed changes

ffeldhaus and others added 3 commits January 14, 2025 12:29

Update dialogflow-cx/streaming_detect_intent_infinite.py

50d7da6

Update example invocation Co-authored-by: code-review-assist[bot] <182814678+code-review-assist[bot]@users.noreply.github.com>

Update dialogflow-cx/streaming_detect_intent_infinite.py

d8a8dc1

Add validation of agent name Co-authored-by: code-review-assist[bot] <182814678+code-review-assist[bot]@users.noreply.github.com>

chore(dialogflow-cx): Fix invocation

0c1b10f

ffeldhaus changed the title ~~Example implementation of Streaming Detect Intent with continuous microphone input and audio output~~ feat(dialogflow-cx): Example implementation of Streaming Detect Intent with continuous microphone input and audio output Jan 14, 2025

glasnt added the do not merge label Jan 16, 2025

ffeldhaus added 3 commits January 16, 2025 17:49

chore(dialogflow-cx): Remove message on quitting with Exit and Quit

368d246

chore(dialogflow-cx): Add region tags

21e1fca

chore(dialogflow-cx): Improved and cleaned up documentation.

3f3ed12

Merge branch 'GoogleCloudPlatform:main' into main

4c14b5a

glasnt reviewed Jan 20, 2025

View reviewed changes

dialogflow-cx/streaming_detect_intent_infinite.py Outdated Show resolved Hide resolved

glasnt reviewed Jan 20, 2025

View reviewed changes

dialogflow-cx/streaming_detect_intent_infinite.py Outdated Show resolved Hide resolved

glasnt and others added 3 commits January 20, 2025 14:38

Update region tags to correct format

e1e0248

Merge branch 'GoogleCloudPlatform:main' into main

2923fd5

chore(dialoglow-cx): fix race condition when _output_audio_stream is …

da66566

…not yet available

davidcavazos requested changes Jan 27, 2025

View reviewed changes

glasnt assigned davidcavazos and unassigned glasnt Jan 28, 2025

ffeldhaus added 6 commits February 6, 2025 18:39

chore(dialogflow-cx): Use correct codeblock syntax in docstring

5b2f5c8

chore(dialogflow-cx): Code cleanup and refactoring addressing comments

66bfdf4

chore(dialogflow-cx): Code cleanup addressing code review suggestions

75e7bdd

chore(dialogflow-cx): Added default constants

5f04335

chore(dialogflow-cx): restructured audio stream creation

18ff672

feat(dialogflow-cx): Add test for streaming_detect_intent_infinite

8b43d8f

ffeldhaus requested a review from davidcavazos February 16, 2025 19:14

glasnt added kokoro:force-run and removed do not merge labels Feb 16, 2025

kokoro-team removed the kokoro:force-run label Feb 16, 2025

chore(dialogflow-cx): Add requirements specifiers for termcolor to su…

36ef022

…pport Python 3.8

glasnt added the kokoro:force-run label Feb 25, 2025

kokoro-team removed the kokoro:force-run label Feb 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(dialogflow-cx): Example implementation of Streaming Detect Intent with continuous microphone input and audio output #13053

feat(dialogflow-cx): Example implementation of Streaming Detect Intent with continuous microphone input and audio output #13053

ffeldhaus commented Jan 14, 2025

ghost left a comment

ghost left a comment

glasnt commented Jan 16, 2025

snippet-bot bot commented Jan 17, 2025 •

edited

Loading

davidcavazos left a comment

ffeldhaus commented Feb 14, 2025

feat(dialogflow-cx): Example implementation of Streaming Detect Intent with continuous microphone input and audio output #13053

Are you sure you want to change the base?

feat(dialogflow-cx): Example implementation of Streaming Detect Intent with continuous microphone input and audio output #13053

Conversation

ffeldhaus commented Jan 14, 2025

Description

Checklist

ghost left a comment

Choose a reason for hiding this comment

ghost left a comment

Choose a reason for hiding this comment

glasnt commented Jan 16, 2025

snippet-bot bot commented Jan 17, 2025 • edited Loading

davidcavazos left a comment

Choose a reason for hiding this comment

ffeldhaus commented Feb 14, 2025

snippet-bot bot commented Jan 17, 2025 •

edited

Loading