SignSpeak - ASL Fingerspelling to Speech Chrome Extension

A Chrome extension that detects ASL (American Sign Language) fingerspelling via webcam, converts it to text, and uses text-to-speech to speak the words aloud. This enables deaf/HoH individuals who use ASL to communicate verbally in video calls, online meetings, and anywhere on the web.

Features

Real-time ASL Detection: Detects fingerspelling letters A-Z using webcam
Word Accumulation: Automatically builds words from detected letters
Text-to-Speech: Converts words to speech using Groq API
Video Conferencing Support: Works on Google Meet, Zoom, Teams, Webex, and Discord
Manual Control: "Speak Now" button for immediate speech
Customizable Settings: Adjust pause detection, voice selection, and more

Installation

Prerequisites

Groq API Key: Pre-configured - no setup required!
- The extension comes with a pre-configured API key
- No need to sign up or configure anything
Chrome Browser: Version 88 or later (for Manifest V3 support)

Setup Steps

Clone or Download this repository
Add Extension Icons:
- Navigate to assets/icons/
- Add three icon files:
  - icon16.png (16x16 pixels)
  - icon48.png (48x48 pixels)
  - icon128.png (128x128 pixels)
- You can create simple icons using any image editor
Load Extension in Chrome:
- Open Chrome and navigate to chrome://extensions/
- Enable "Developer mode" (toggle in top right)
- Click "Load unpacked"
- Select the signspeak-extension folder
Ready to Use:
- The extension is pre-configured with an API key
- No setup required - just activate and start using!
- Optionally configure settings (pause detection, voice, etc.)

Usage

Basic Usage

Navigate to a supported site:
- Google Meet: https://meet.google.com
- Zoom: https://zoom.us
- Microsoft Teams: https://teams.microsoft.com
- Webex: https://webex.com
- Discord: https://discord.com
Activate the extension:
- Click the SignSpeak icon in Chrome toolbar
- Click "Activate" button
- Grant camera permissions when prompted
Start fingerspelling:
- Position your hand in front of the webcam
- Fingerspell letters one at a time
- The extension will detect letters and build words
- After a pause (default 1.5 seconds), the word will be spoken aloud
Manual control:
- Click "Speak Now" to immediately speak the current word
- No need to wait for pause detection

Settings

Groq API Key: Pre-configured (no input required)
Pause Detection: Time in milliseconds before auto-speaking (300-3000ms)
Voice Selection: Choose from available Groq voices
Auto-speak on pause: Toggle automatic speech on pause detection

Technical Details

Architecture

Manifest V3: Uses Chrome Extension Manifest V3
Content Scripts: Injected into video conferencing sites
Background Service Worker: Manages state and coordinates components
ASL Detection: Uses basic image processing for hand detection (self-contained, no external dependencies)
TTS: Groq API for high-quality text-to-speech

Note: The current ASL detection uses a basic image processing approach. For production use with higher accuracy, consider integrating a trained machine learning model.

File Structure

signspeak-extension/
├── manifest.json              # Extension manifest
├── popup/                     # Extension popup UI
│   ├── popup.html
│   ├── popup.js
│   └── popup.css
├── content/                   # Content scripts
│   └── content.js
├── background/                # Background service worker
│   └── service-worker.js
├── src/                       # Core modules
│   ├── config.js             # Configuration
│   ├── asl-detector.js      # ASL detection logic
│   ├── word-accumulator.js   # Word building logic
│   └── tts-service.js        # TTS integration
├── assets/
│   └── icons/                # Extension icons
└── README.md

Dependencies

The extension is self-contained and does not require external JavaScript libraries:

No CDN dependencies: All code runs locally
Basic image processing: Uses built-in browser APIs for hand detection
Groq API: Only external dependency is the TTS API (requires internet connection)

Note: The current implementation uses basic image processing for ASL detection. For production use with higher accuracy, integrate a trained machine learning model (can be bundled locally).

Development

Improving ASL Detection

The current implementation uses basic image processing. For better accuracy:

Train a machine learning model:
- Collect ASL fingerspelling data (images or hand landmarks)
- Train a model (TensorFlow.js, ONNX, etc.) to classify into letters
- Bundle the model with the extension
- Update src/asl-detector.js to use your trained model
Use MediaPipe Hands (requires bundling):
- Bundle MediaPipe Hands library locally
- Use it for accurate hand landmark detection
- Combine with a trained classification model
Use a pre-trained model:
- Look for open-source ASL detection models
- Bundle with the extension
- Integrate into the detection pipeline

Adding New Sites

To add support for additional video conferencing sites:

Add the site URL to host_permissions in manifest.json
Add the site pattern to content_scripts.matches in manifest.json
Add the site domain to SUPPORTED_SITES in src/config.js

Troubleshooting

Camera Not Working

Ensure camera permissions are granted
Check that no other application is using the camera
Try refreshing the page after granting permissions

No Letters Detected

Ensure good lighting
Position hand clearly in front of camera
Check that MediaPipe Hands is loading (check browser console)
Try adjusting confidence threshold in code if needed

TTS Not Working

Verify your Groq API key is correct
Check browser console for API errors
Ensure you have API credits/quota available
Check internet connection

Extension Not Activating

Ensure you're on a supported site
Check that content script is loading (check browser console)
Try reloading the extension in chrome://extensions/

Privacy & Security

Camera Access: Only used locally for hand detection
API Key: Stored securely in Chrome's local storage
Data Processing: Hand detection happens locally; only text is sent to Groq API
No Data Collection: The extension does not collect or store personal data

Limitations

Detection Accuracy: Current rule-based classifier has limited accuracy. A trained model is recommended for production use.
Internet Required: TTS requires internet connection for Groq API
Single Hand: Currently optimized for single-hand fingerspelling
Browser Support: Chrome/Chromium only (Manifest V3)

Contributing

Contributions are welcome! Areas for improvement:

Better ASL detection models
Support for more video conferencing platforms
Additional TTS providers
Performance optimizations
UI/UX improvements

License

[Specify your license here]

Support

For issues, questions, or contributions, please [create an issue or contact information].

Acknowledgments

MediaPipe for hand landmark detection
Groq for TTS API
TensorFlow.js community
ASL community for inspiration

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
assets/icons		assets/icons
background		background
content		content
docs		docs
lib		lib
models		models
popup		popup
scripts		scripts
src		src
website		website
.DS_Store		.DS_Store
.gitignore		.gitignore
IMPROVEMENTS.md		IMPROVEMENTS.md
QUICK-START-CNN.md		QUICK-START-CNN.md
README-CNN-MODEL.md		README-CNN-MODEL.md
README-TFJS.md		README-TFJS.md
README.md		README.md
manifest.json		manifest.json
setup-tfjs.sh		setup-tfjs.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SignSpeak - ASL Fingerspelling to Speech Chrome Extension

Features

Installation

Prerequisites

Setup Steps

Usage

Basic Usage

Settings

Technical Details

Architecture

File Structure

Dependencies

Development

Improving ASL Detection

Adding New Sites

Troubleshooting

Camera Not Working

No Letters Detected

TTS Not Working

Extension Not Activating

Privacy & Security

Limitations

Contributing

License

Support

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SignSpeak - ASL Fingerspelling to Speech Chrome Extension

Features

Installation

Prerequisites

Setup Steps

Usage

Basic Usage

Settings

Technical Details

Architecture

File Structure

Dependencies

Development

Improving ASL Detection

Adding New Sites

Troubleshooting

Camera Not Working

No Letters Detected

TTS Not Working

Extension Not Activating

Privacy & Security

Limitations

Contributing

License

Support

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages