WebUI speech-to-text with a proxy #21134

kiwixz · 2026-03-28T23:24:33Z

kiwixz
Mar 28, 2026

It would be nice if we could tell the WebUI to transcribe text before feeding it to the LLM; for example by setting up the URL of a live whisper.cpp server.
My understanding is that currently it only accepts audio for model that natively supports it. The models I like are multimodal for image/documents but for some reason are all missing audio!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WebUI speech-to-text with a proxy #21134

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

WebUI speech-to-text with a proxy #21134

Uh oh!

kiwixz Mar 28, 2026

Replies: 0 comments

kiwixz
Mar 28, 2026