A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
-
Updated
Jul 1, 2024 - Python
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Convert speech to text using HuggingFace, comparing Wav2Vec2 versus OpenAI Whisper
Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules …
A guide to the fundamentals of technical writing in American English
Training a model to detect fake news articles, then Identifying the text features that indicate fake news.
Fix grammar and punctuation in markdown using GPT
Remove Punctuation is a tool that help you to strip all punctuation marks and symbols from a text document or input string.
SPA that clears input text from words, leaves only punctuation
🤏 Tiny & versatile 🔥 Node.js library for in-depth text analysis, manipulation and data extraction.
Spam mail detection is the process of identifying and filtering out unwanted or unsolicited emails, commonly referred to as "spam," from a user's inbox.
A small seq2seq punctuator tool based on DistilBERT
LinTO Platform punctuation service.
🈶 Useful Punctuation marks or symbols for live coding interview
Windows keyboard layouts (made with Microsoft Keyboard Layout Creator) for macrons (ā), breves (ă), and punctuation that I find useful.
Add a description, image, and links to the punctuation topic page so that developers can more easily learn about it.
To associate your repository with the punctuation topic, visit your repo's landing page and select "manage topics."