Soundmind is an Ubuntu desktop assistant for system audio. It captures the current output monitor, transcribes speech in near real time, detects likely questions, and lets you generate answers, summaries, and commentary while keeping session history and agent setup locally in SQLite.
Current release: v0.2.1
- Captures desktop audio through PulseAudio or PipeWire's Pulse compatibility layer
- Streams audio to ElevenLabs Scribe Realtime for transcription
- Uses OpenAI for manual answer, summary, and commentary actions
- Shows live transcript and assistant output in a Tauri desktop UI
- Flags detected questions explicitly in the UI
- Lets you configure a default interview-assistant instruction
- Lets you upload priming documents such as your CV, job description, or notes
- Stores sessions, transcript segments, settings, priming documents, and assistant output locally
- Supports tray controls, global shortcuts, history, export, and
systemd --userinstallation
Download the latest soundmind_*_amd64.deb from GitHub Releases, then:
sudo apt install ./soundmind_0.2.1_amd64.deb
soundmind-setup-user
soundmindThe package installs the desktop app, backend, config example, desktop entry,
icon, and a systemd --user service unit. soundmind-setup-user prepares
~/.config/soundmind and enables the backend for the current user.
- Copy the config:
cp config.example.toml config.toml- Put your provider keys in
keys.env:
OPENAI_API_KEY=...
ELEVENLABS_API_KEY=...- Start the backend:
cargo run -p app_backend- Start the desktop UI:
cargo run -p app_ui- In the desktop UI, open Agent Configuration to:
- edit the default interview-assistant instruction
- upload priming documents for the model to use
Best results come from text or markdown files. PDF upload also works when
pdftotext is installed on the machine.
- If you want the terminal debug client instead:
cargo run -p app_ui --bin terminal_ui./scripts/install-user-service.sh
soundmindThis installs release binaries under ~/.local, writes config to
~/.config/soundmind, and enables soundmind-backend.service with
systemctl --user.
Ctrl+Alt+Shift+M: show or hide the main windowCtrl+Alt+Shift+A: answer the detected or last questionCtrl+Alt+Shift+S: summarise the last minuteCtrl+Alt+Shift+C: comment on the current topic
If another app already owns one of these shortcuts, Soundmind still launches.
The current build includes:
- real system-audio capture
- real ElevenLabs realtime STT integration
- OpenAI-backed manual answer, summary, and commentary actions
- question detection surfaced in the desktop UI
- local settings, history, export, and privacy status
- configurable interview instruction plus uploaded priming documents
- tray integration and best-effort global shortcuts
- packaging helpers and
systemd --userassets - Debian release packaging and GitHub release tooling
