EchoKit Server

EchoKit Server is the central component that manages communication between the EchoKit device and AI services. It can be deployed locally or connected to preset servers, allowing developers to customize LLM endpoints, plan the LLM prompt, configure speech models, and integrate additional AI features like MCP servers.

Website | Discord | Live Demo | Documentation

You will need an EchoKit device, or create your own ESP32 device with the EchoKit firmware.

Features

EchoKit Server powers the full voice–AI interaction loop, making it easy for developers to run end-to-end speech pipelines with flexible model choices and custom integrations.

ASR → LLM → TTS Pipeline

Seamlessly connect ASR → LLM → TTS for real-time, natural conversations. Each stage can be configured independently with your preferred models or APIs.

Model Compatibility

ASR (Speech Recognition): Works with any API that’s OpenAI-compatible.
LLM (Language Model): Connect to any OpenAI-spec endpoint — local or cloud.
TTS (Text-to-Speech): Use any OpenAI-spec voice model for flexible deployment.
- ElevenLabs (Streaming Mode)

End-to-End Model Pipelines

Out-of-the-box support for:

Gemini — Google’s multimodal model
Qwen Real-Time — Alibaba’s powerful open LLM

Developer Customization

Deploy locally or connect to remote inference servers
Define your own LLM prompts and response workflows
Configure speech and voice models for different personas or use cases
Integrate MCP servers for extended functionality

Set up the EchoKit server

Build

git clone https://github.com/second-state/echokit_server

Edit config.toml to customize the VAD, ASR, LLM, TTS services, as well as prompts and MCP servers. You can see many examples.

cargo build --release

Configure AI services

The config.toml can use any combination of open-source or proprietary AI services, as long as they offer OpenAI-compatible API endpoints. Here are instructions to start open source AI servers for the EchoKit server.

Alternatively, you could use Google Gemini Live services for VAD + ASR + LLM, and even optionally, TTS. See config.toml examples.

You can also configure MCP servers to give the EchoKit server tool use capabilities.

Configure the voice prompt

The hello.wav file on the server is sent to the EchoKit device when it connects. It is the voice prompt the device will say to tell the user that it is ready.

Run the EchoKit server

export RUST_LOG=debug
nohup target/release/echokit_server &

Test on a web page

Go here: https://echokit.dev/chat/

Click on the link to save the index.html file to your local hard disk.

Double click the local index.html file and open it in your browser.

In the web page, set the URL to your own EchoKit server address, and start chatting!

Configure a new device

Go to web page: https://echokit.dev/setup/ and use Bluetooth to connect to the GAIA ESP332 device.

Configure WiFi and server

WiFi SSID (e.g., MyHome)
WiFi password (e.g., MyPassword)
Web Socket server URL for echokit_server
- US: ws://indie.echokit.dev/ws/
- Asia: ws://hk.echokit.dev/ws/

Use the device

Chat: press the K0 button once or multiple times until the screen shows "Listening ...". You can now speak and it will answer.

Record: long press the K0 until the screen shows "Recording ...". You can now speak and the audio will be recorded on the server.

Config: press RST. While it is restarting, press and hold K0 to enter the configuration mode. Then open the configuration UI to connect to the device via BT.

Name		Name	Last commit message	Last commit date
Latest commit History 194 Commits
.github/workflows		.github/workflows
docker		docker
examples		examples
resources		resources
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
background.gif		background.gif
config.gemini.toml		config.gemini.toml
config.toml		config.toml
echokit.gif		echokit.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

EchoKit Server

Features

ASR → LLM → TTS Pipeline

Model Compatibility

End-to-End Model Pipelines

Developer Customization

Set up the EchoKit server

Build

Configure AI services

Configure the voice prompt

Run the EchoKit server

Test on a web page

Configure a new device

Use the device

About

Uh oh!

Releases 4

Contributors 6

Uh oh!

Languages

License

second-state/echokit_server

Folders and files

Latest commit

History

Repository files navigation

EchoKit Server

Features

ASR → LLM → TTS Pipeline

Model Compatibility

End-to-End Model Pipelines

Developer Customization

Set up the EchoKit server

Build

Configure AI services

Configure the voice prompt

Run the EchoKit server

Test on a web page

Configure a new device

Use the device

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Contributors 6

Uh oh!

Languages