Speech Data Collection

This is a set of tools for building speech corpora. Users can record speech samples in an Android app according to instructions that are managed from a web-based admin dashboard. Completed recordings are uploaded automatically and accessible from the dashboard.

There are 3 main components:

Android App - native app written in Kotlin, distributed to end-users to record audio samples
Admin Dashboard - Vue/Vuetify single-page application for managing contents of the app and uploaded recordings
Back-end - Node/Express/Postgres based REST API to serve both the app and the dashboard

Repository structure

Some of the more important folders are:

/app/ - Android app project
/app/src/main/java/.../speechdatacollection/ - app main activity and startup logic
/app/src/main/java/.../speechdatacollection/fragments/ - sub-views to the main activity
/app/src/main/java/.../speechdatacollection/network/ - service to call back-end with data structures
/backend/admin/ - admin dashboard Vue project
/backend/admin/src/views/ - Vue components for dashboard views (pages)
/backend/admin/src/components/ - subcomponents for views
/backend/server/ - back-end server Node project
/backend/server/public/ - static HTML/CSS page for providing app download
/backend/server/src/db/ - file storage and database services
/backend/server/src/db/sql/ - database schema and queries
/backend/server/src/routes/api/ - REST API routes
/backend/server/src/middleware/ - oauth2 authentication for API calls

Further details are available at https://digikogu.taltech.ee/en/Item/cb25a4f4-2243-4766-bd78-e6fccad12b18 (in Estonian).

Name		Name	Last commit message	Last commit date
Latest commit History 119 Commits
app		app
backend		backend
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Speech Data Collection

Repository structure

About

Uh oh!

Languages

License

volkhu/speech-data-collection

Folders and files

Latest commit

History

Repository files navigation

Speech Data Collection

Repository structure

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages