📝 Transcription API – Setup & Usage Guide

✅ Requirements

Python 3.12 (⚠️ Do not use 3.13 – compatibility issues)
FFmpeg (required for Whisper to process audio)

⚙️ Setup Instructions

1. Set Python Version (Optional if using `pyenv`)

pyenv local 3.12.3  # ensures 3.12.x is used in this directory

2. Create & Activate Virtual Environment

python3.12 -m venv env
source env/bin/activate

3. Install FFmpeg

brew install ffmpeg  # For macOS
# OR
sudo apt install ffmpeg  # For Ubuntu/Debian

4. Install Python Dependencies

pip install -r requirements.txt

🌐 Postman Collection

For testing the API endpoints, you can use the following Postman collection:

RuxAiLab Transcription Tool APIs Postman Collection

🚀 Run the API Server

uvicorn app.main:app --reload

Swagger UI: http://localhost:8000/docs
ReDoc UI: http://localhost:8000/redoc

🧺 Running Tests

Make sure your virtual environment is activated before running tests.

Run All Tests

pytest

Unit Tests Only

pytest ./tests/unit

Integration Tests Only

pytest ./tests/integration

🔊 Audio Sample Links (For Testing)

You can use sample audio files from:

🔗 https://thevoiceovervoice.co.uk/female-voice-over-samples/

🛠️ Deployment Guide

Deploy a Dockerized FastAPI service to Google Cloud Run with NVIDIA L4 GPUs. Images are stored in Artifact Registry and built with Cloud Build.

Prerequisites

A Google Cloud project (e.g. ruxailab-develop)
gcloud CLI installed: Install guide
Billing enabled on the GCP project

Set your active project & region

# Project / region / registry
PROJECT_ID="ruxailab-develop"     # your-gcp-project
REGION="europe-west4"             # choose a region near you / with GPU
REPO="containers"                 # Artifact Registry repo name

# Image naming
IMAGE="transcription-api"
TAG="gpu-v1"                      # Change per New Releases :D

# Cloud Run service name
export SERVICE="transcription-api-gpu"

Authenticate & set project/region

gcloud auth login

# Set your active project & region
gcloud config set project "$PROJECT_ID"
gcloud config set run/region "$REGION"

Enable required APIs

gcloud services enable   artifactregistry.googleapis.com   run.googleapis.com   cloudbuild.googleapis.com

Create Artifact Registry (Docker)

gcloud artifacts repositories create "$REPO"   --repository-format=docker   --location="$REGION"

Build & Push the Image (Cloud Build)

gcloud builds submit   --tag "$REGION-docker.pkg.dev/$PROJECT_ID/$REPO/$IMAGE:$TAG" .

Deploy to Cloud Run with GPU (L4)

gcloud beta run deploy "$SERVICE"   --image "$REGION-docker.pkg.dev/$PROJECT_ID/$REPO/$IMAGE:$TAG"   --region "$REGION"   --allow-unauthenticated   --gpu 1   --gpu-type nvidia-l4   --cpu 4   --memory 16Gi   --concurrency 1   --no-cpu-throttling   --port 8000   --set-env-vars "DEVICE=cuda,OPENAI_API_KEY=YOUR_API_KEY_HERE"

Updating to a New Version

export TAG="gpu-v2"
gcloud builds submit   --tag "$REGION-docker.pkg.dev/$PROJECT_ID/$REPO/$IMAGE:$TAG" .

gcloud beta run deploy "$SERVICE"   --image "$REGION-docker.pkg.dev/$PROJECT_ID/$REPO/$IMAGE:$TAG"   --region "$REGION"   --allow-unauthenticated   --gpu 1   --gpu-type nvidia-l4   --cpu 4   --memory 16Gi   --concurrency 1   --no-cpu-throttling   --port 8000

Optional: CPU-only Deployment

export TAG="v1"
gcloud builds submit   --tag "$REGION-docker.pkg.dev/$PROJECT_ID/$REPO/$IMAGE:$TAG" .

gcloud run deploy "transcription-api"   --image "$REGION-docker.pkg.dev/$PROJECT_ID/$REPO/$IMAGE:$TAG"   --region "$REGION"   --allow-unauthenticated   --cpu 2   --memory 2Gi   --port 8000  --set-env-vars "DEVICE=cuda,OPENAI_API_KEY=YOUR_API_KEY_HERE"

GSoC Docs

This repository is part of the Google Summer of Code (GSoC) 2025 program.

Contributor: Basma Elhoseny
Mentors: Karine - Marc

🔗 Useful Links

🧠 GSoC'25 Project Page: Transcription Tool for Usability Testing GSoC 25 Program

🧾 Proof of Work: gsoc_2025_summary.md

License

This software is licensed under the MIT License. See the LICENSE file for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
app		app
docs		docs
samples/raw		samples/raw
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
Deployment Steps.txt		Deployment Steps.txt
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
gsoc_2025_summary.md		gsoc_2025_summary.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📝 Transcription API – Setup & Usage Guide

✅ Requirements

⚙️ Setup Instructions

1. Set Python Version (Optional if using `pyenv`)

2. Create & Activate Virtual Environment

3. Install FFmpeg

4. Install Python Dependencies

🌐 Postman Collection

🚀 Run the API Server

🧺 Running Tests

Run All Tests

Unit Tests Only

Integration Tests Only

🔊 Audio Sample Links (For Testing)

🛠️ Deployment Guide

Prerequisites

Set your active project & region

Authenticate & set project/region

Enable required APIs

Create Artifact Registry (Docker)

Build & Push the Image (Cloud Build)

Deploy to Cloud Run with GPU (L4)

Updating to a New Version

Optional: CPU-only Deployment

GSoC Docs

🔗 Useful Links

License

About

Uh oh!

Releases

Packages

Languages

License

ruxailab/transcription-api

Folders and files

Latest commit

History

Repository files navigation

📝 Transcription API – Setup & Usage Guide

✅ Requirements

⚙️ Setup Instructions

1. Set Python Version (Optional if using pyenv)

2. Create & Activate Virtual Environment

3. Install FFmpeg

4. Install Python Dependencies

🌐 Postman Collection

🚀 Run the API Server

🧺 Running Tests

Run All Tests

Unit Tests Only

Integration Tests Only

🔊 Audio Sample Links (For Testing)

🛠️ Deployment Guide

Prerequisites

Set your active project & region

Authenticate & set project/region

Enable required APIs

Create Artifact Registry (Docker)

Build & Push the Image (Cloud Build)

Deploy to Cloud Run with GPU (L4)

Updating to a New Version

Optional: CPU-only Deployment

GSoC Docs

🔗 Useful Links

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. Set Python Version (Optional if using `pyenv`)

Packages