mirrorstage

mirrorstage is a one-shot AI livestreaming platform that creates automated talking head videos in response to user input or chat messages.

features

modular service architecture - swap between different LLM, TTS, and video sync providers
real-time chat ingestion - automatically responds to pump.fun chat messages
obs integration - seamless streaming with dynamic video switching
concurrent processing - handles multiple requests with configurable queue limits
vision analysis - can analyze screenshots from obs (currently disabled)
character customization - define custom AI personalities and prompts

supported services

text generation (llm)

openai
openrouter

text-to-speech (tts)

zonos (local/api)
elevenlabs

video synchronization

latentsync (local)
fal api (latentsync/pixverse)
sync labs

how to use

clone this repo
install dependencies
```
npm install
# or
yarn install
```
set up assets
- add base_video.mp4 to _assets/ (30 seconds, front-facing human, minimal movement)
- add base_audio.wav to _assets/ (voice sample for tts reference)
configure character
- edit server/prompts/character-file.ts to define your AI personality

set up environment

cp .env.example .env
# edit .env with your api keys and configuration

configure obs
- install obs studio
- enable websocket server in obs (tools → websocket server settings)
- default port: 4455
- set password if desired (update in .env)

run the service

npm run dev
# or for production
npm run build && npm start

configuration

key environment variables:

# api keys
OPENAI_API_KEY=your-key
ELEVENLABS_API_KEY=your-key
FAL_KEY=your-key

# obs configuration
OBS_WEBSOCKET_URL=ws://localhost:4455
OBS_WEBSOCKET_PASSWORD=your-password

# file paths
BASE_VIDEO_PATH=./_assets/base_video.mp4
BASE_AUDIO_PATH=./_assets/base_audio.wav
OUTPUT_DIR=./_outputs

# processing settings
PIPELINE_CONCURRENT_LIMIT=2
MAX_QUEUE_SIZE=10

usage modes

cli mode

npm run cli:dev
# type messages directly to test the pipeline

chat ingestion mode

# set PUMP_FUN_URL in .env
# the service will automatically monitor pump.fun chat

architecture

input sources → evaluation → text generation → tts → video sync → obs stream
     ↓              ↓             ↓              ↓         ↓           ↓
  cli/chat    priority filter   llm api      audio     talking    broadcast
                               response    generation    head

development

project structure

mirrorstage/
├── server/
│   ├── app.ts              # main pipeline orchestrator
│   ├── config.ts           # configuration management
│   ├── services/           # modular service implementations
│   │   ├── interfaces.ts   # service interfaces
│   │   ├── OBSStream.ts    # obs integration
│   │   ├── PipelineInitializer.ts
│   │   └── ...
│   ├── prompts/            # ai prompts and character definitions
│   └── utils/              # utilities and helpers
├── _assets/                # base video/audio files
├── _outputs/               # generated content
└── package.json

adding new services

implement the appropriate interface from server/services/interfaces.ts
add service initialization in PipelineInitializer.ts
update environment configuration in config.ts

linting

npm run lint

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.claude		.claude
.cursor/rules		.cursor/rules
assets		assets
generated_videos		generated_videos
server		server
.env.example		.env.example
.gitignore		.gitignore
README-VISION.md		README-VISION.md
README.md		README.md
biome.jsonc		biome.jsonc
ecosystem.config.cjs		ecosystem.config.cjs
package-lock.json		package-lock.json
package.json		package.json
tsconfig.build.json		tsconfig.build.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

mirrorstage

features

supported services

text generation (llm)

text-to-speech (tts)

video synchronization

how to use

configuration

usage modes

cli mode

chat ingestion mode

architecture

development

project structure

adding new services

linting

troubleshooting

obs connection issues

video generation failures

About

Uh oh!

Releases

Packages

Languages

dutchiono/mirrorstage

Folders and files

Latest commit

History

Repository files navigation

mirrorstage

features

supported services

text generation (llm)

text-to-speech (tts)

video synchronization

how to use

configuration

usage modes

cli mode

chat ingestion mode

architecture

development

project structure

adding new services

linting

troubleshooting

obs connection issues

video generation failures

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages