TEQST: Tool to easily quench speechdata thirst
A mobile-first web platform to create audio recordings of texts (e.g. ASR training data). I’m very proud of this acronym.
Pitch
You need a few people to record a few sentences for you? You need a bunch of people to record large amounts of ASR data? In any case, TEQST is what you’re looking for. It’s designed for easy, intuitive, and efficient audio recording by your speakers. Just upload your texts and share a link with them. Then you can track you speakers’ progress, hours, and audio quality.
I needed a small ASR test dataset for a group project once, and with TEQST we were able to quickly produce ca. 1000 samples spending a collective 2h on recording - barely more than the audio data itself.
Features
- 3 Roles: Publisher, Speaker, Listener
- Publisher can create folder structure and upload texts in leaf folders
- Publisher can share folders with speakers
- Speaker can record texts
- Publisher can listen to recordings and see statistics
- Publisher can download data
- Publisher can share folder with listener
- Listener can listen to recordings
Architecture
- Django backend with rest-like API
- Angular + Ionic frontend
Dev story
This was PSE (Practical Software Engineering) project in my 3rd Semester at Uni, developed in a team of 5. We built the initial version in one semester and continued to work on it for the following 2 years while maintaining a live production version for the university institute. I was mainly involved in the backend, later more in the frontend. We’ve used it with Postgres and SQLite databases.