Skip to content

A full-stack application that allows users to upload audio files (like call recordings, meetings, or lectures) and search for keywords or phrases within them.

Notifications You must be signed in to change notification settings

LunaticPrakash/Audio-Query

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

51 Commits
 
 
 
 
 
 
 
 

Repository files navigation

AudioQuery

A full-stack application that allows users to upload audio files (like call recordings, meetings, or lectures) and search for keywords or phrases within them. The matching audio files will be displayed to user which he can directly listen or read the transcribed text with highlighted matching phrase.

The app automatically transcribes audio into text using OpenAI Whisper and stores it into database for quick, future searches to avoid duplicate processing.

Reason for building

A colleague of mine started to get around 30-35 calls per day from recruiters and with time it became challenging to identify which call number belongs to which company's recruiter after some days. So, created this Audio Query website that allows her to upload many call recording files at once and it get transcribed into the text form. Post that she can use the provided search feature to look for given phrase/keyword, and it'll display the recordings that have those phrases in them, plus user can instantly read the text version of that recoeding instead of listening to it.


Status:

Deployed at: Audio Query


Screenshots:

i. Home Page

Image

ii. Model Selection

Image

iii. Language Selection

Image

iv. All uploaded audio files and matching searched audio

Image

v. Popup View for details (Transcription is not very accurate as used the base model)

Image


Technologies Used:

  • Backend: Python, Flask, SQLAlchemy, OpenAI-Whisper, Pydub
  • Frontend: ReactJS, Vite
  • Database: SQLite

Features:

  • Upload recordings – Supports multiple audio files in one go.
  • Automatic transcription – Converts speech to searchable text using Whisper.
  • Smart search – Find specific keywords or phrases across all transcribed recordings.
  • Audio playback – Play back uploaded recordings directly in the browser.
  • Manage recordings – View all recordings and delete unwanted ones.
  • Duplicate check – Avoids reprocessing if the same file already exists.
  • Preprocessing – Ensures audio is standardized before transcription (mono, 16kHz).
  • Multi Language Support – Provided support for multiple languages like English, Hindi, Telugu, French, etc.
  • Highlighted Phrase/Keyword – The matching keywords/phrases will be highlighted in the text transcription for easy finding.

Real-life Use Cases:

  • Call centers – Quickly locate calls mentioning “refund,” “complaint,” or “policy.”
  • Meetings & lectures – Upload long recordings and search for important discussions later.
  • Legal & compliance – Easily review sensitive phrases across large volumes of audio logs.
  • Personal productivity – Search through voice notes or interviews without replaying them fully.

Dev:

Prakash Gupta

About

A full-stack application that allows users to upload audio files (like call recordings, meetings, or lectures) and search for keywords or phrases within them.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published