Keyword Based ASR

Repo for implementation of keyword-based ASR system

Setup

create virtual enviroment and install requirements from requirements.txt
NOTE: NeMo toolkit is not supported on Windows, so WSL or UNIX-based OS is required, see the docs or github
minimal, necessary data is already in the repo, but to reproduce training process and / or test other keywords you need to download full datasets from this link (if the link doesn't work, please contact me via email: [email protected]) and put them in Data directory (check Data README for the structure)
modify config file to match your setup (all paths with suffix DATA_DIR should be changed to match your setup)

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Data		Data
Models		Models
Utils		Utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
demo.ipynb		demo.ipynb
get-metadata.ipynb		get-metadata.ipynb
keyword-recognition.ipynb		keyword-recognition.ipynb
play-sound.ipynb		play-sound.ipynb
requirements.txt		requirements.txt
speaker-recognition.ipynb		speaker-recognition.ipynb
visualize-spectrogram.ipynb		visualize-spectrogram.ipynb