added voice recording shortcut, added markdown for responses for both…#64
Open
karansingh7773-rathore wants to merge 2 commits intoPrat011:masterfrom
Open
added voice recording shortcut, added markdown for responses for both…#64karansingh7773-rathore wants to merge 2 commits intoPrat011:masterfrom
karansingh7773-rathore wants to merge 2 commits intoPrat011:masterfrom
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
🎙️ Voice Recording & Gemini Model Selection Features
📋 Overview
This PR introduces the highly anticipated Voice Recording feature with live transcription powered by Deepgram, and seamless integration with the Gemini API to analyze the transcribed discussions. It also brings dynamic Gemini Model Selection directly to the user interface, alongside major stability improvements for API rate limits and WebSockets.
✨ New Features
1. Global "Record Voice" Shortcut (
Ctrl+Shift+R)Ctrl+Shift+R) that can be triggered from anywhere to instantly capture system and microphone audio.2. Deepgram Live Transcription Integration
wss://api.deepgram.com/v1/listen) with thenova-3model for state-of-the-art live Speech-to-Text (STT).3. Gemini Audio Analysis Pipeline
4. Dynamic Gemini Model Selector
localStorage.🛠️ Stability & Performance Fixes
pendingChunksqueue to buffer all early audio chunks locally. Once the Deepgramsocket.onopenfires, the queue is instantaneously flushed. This guarantees that critical WebM Container Headers are never silently dropped before the connection opens, ensuring 100% reliable transcription decoding.isRecordingRef) with a hard mutex (isTogglingRef) wrapped in atry/finallyblock. This strictly prevents thetoggleRecordingexecution from being double-triggered, eliminating theGemini 429 Too Many RequestsAPI spam bug.