Skip to content

Commit

Permalink
fix: update the model name (GoogleCloudPlatform#1699)
Browse files Browse the repository at this point in the history
Updated the model name to gemini-2.0-flash-001.

---------

Co-authored-by: Holt Skinner <[email protected]>
  • Loading branch information
2 people authored and leebaro committed Feb 7, 2025
1 parent 3158d73 commit d3c5931
Show file tree
Hide file tree
Showing 2 changed files with 3 additions and 3 deletions.
4 changes: 2 additions & 2 deletions gemini/sample-apps/gemini-quart-cloudrun/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@ This application demonstrates a non-blocking communication with [Quart](https://

## Application screenshot

![Demo animation](https://storage.googleapis.com/github-repo/generative-ai/sample-apps/gemini-quart-cloudrun/demo_anim.gif)
![Demo animation](https://storage.googleapis.com/github-repo/generative-ai/sample-apps/gemini-quart-cloudrun/demo_anim.png)

Interruption example with the demo chat app

Expand All @@ -27,7 +27,7 @@ Interruption example with the demo chat app

**Key Benefits building Gen AI app with Quart + Gemini Live API:**

- **Responsiveness and Natural Conversation:** Quart supports non-blocking, full-duplex WebSocket communication natively, crucial for a truly interactive Gen AI experience. It doesn't halt while waiting for Gemini, ensuring quick replies and a smooth conversation flow, especially when the app supports multimodal interaction using audio and images and is network-latency sensitive. Users can send text or voice messages in quick succession, and Quart handles them and interrupts with less delays (as shown in the demo animation above).
- **Responsiveness and Natural Conversation:** Quart supports non-blocking, full-duplex WebSocket communication natively, crucial for a truly interactive Gen AI experience. It doesn't halt while waiting for Gemini, ensuring quick replies and a smooth conversation flow, especially when the app supports multimodal interaction using audio and images and is network-latency sensitive. Users can send text or voice messages in quick succession, and Quart handles them and interrupts with less delays.
- **Concurrency and Scalability:** Handles many users and their messages simultaneously. Quart can process multiple requests and replies with Gemini concurrently, making the gen AI app faster and more efficient. Quart makes better use of server resources with the single thread event-loop design, leading to lower operational costs and better scalability.

### Flask (blocking) v. Quart (non-blocking)
Expand Down
2 changes: 1 addition & 1 deletion gemini/sample-apps/gemini-quart-cloudrun/app/app.py
Original file line number Diff line number Diff line change
Expand Up @@ -38,7 +38,7 @@
GEMINI_API_KEY: str = os.environ.get("GEMINI_API_KEY", "")
QUART_DEBUG_MODE: bool = os.environ.get("QUART_DEBUG_MODE") == "True"

GEMINI_MODEL: str = "gemini-2.0-flash-exp"
GEMINI_MODEL: str = "gemini-2.0-flash-001"

# Gemini API Client: Use either one of the following APIs
gemini_client: Client = (
Expand Down

0 comments on commit d3c5931

Please sign in to comment.