Universal Twitch Reader

Universal Twitch Reader (UTR) is an engine which takes in a recording of a twitch stream and produces a log of time-stamped text from the stream separated into user-informed categories.

It works in a few phases.

First, tesseract is used to find regions with high-probability of containing text
Then, a merging algorithm is applied to group very adjacent boxes together
Next, MobileNet is used to extract features and the boxes are clustered into categories based off targeted input from the user. Meta-data about the boxes such as position on screen and size is also used to improve semantic clustering.
Finally, tesseract is applied on the individual boxes and changes are logged between frames and timestamped in the final output file.

The input is a list of files to analyze.

The output is a series of files that look like:

// A.json
{
  texts: [
    ["00:00:03", "Hellu everyone!"],
    ["00:00:04", "*Hello "],
    ["00:00:11", "glhf"],
    ...
  ]
}

where "A" is the name of a type of on-screen text as inputted by the user.

Documentation

Documentation is split into files in the docs folder for readability.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
analysis		analysis
data		data
docs		docs
exploration		exploration
output/moist		output/moist
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
classify.py		classify.py
engine.py		engine.py
preprocess.py		preprocess.py
process.py		process.py
requirements.txt		requirements.txt
vid2frames.py		vid2frames.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Universal Twitch Reader

Documentation

Setup

System Overview

Results

Pitfalls / Future Work

About

Releases

Packages

Languages

mfpekala/universal-twitch-reader

Folders and files

Latest commit

History

Repository files navigation

Universal Twitch Reader

Documentation

Setup

System Overview

Results

Pitfalls / Future Work

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages