Speech-to-Text¶
Speech-to-Text (STT) lets you dictate a chat message from the input bar instead of typing it. Lumiverse supports the browser's built-in Web Speech API and OpenAI-compatible transcription connections such as Whisper.
Setting Up STT¶
Open Settings > Voice > Speech-to-Text and choose a provider.
| Provider | Best For | Notes |
|---|---|---|
| Web Speech API | Fast browser-native dictation | Availability depends on your browser. Chrome and Edge usually work best. |
| STT Connection | Whisper and OpenAI-compatible transcription models | Requires an STT connection with an API key and transcription model. |
For an STT connection:
- Open the Connections drawer
- Go to STT Connections
- Click New STT Connection
- Enter a name, API key, and transcription model such as
gpt-4o-transcribe,whisper-1, or your provider's equivalent - Return to Settings > Voice and select that connection
OpenAI-compatible endpoints
STT connections use OpenAI-compatible /audio/transcriptions APIs. Leave API URL empty for OpenAI, or enter your proxy/self-hosted endpoint if it implements that route.
Dictating a Message¶
- Open a chat
- Click the microphone button in the input bar
- Speak your message
- Click the microphone again to finish, or use auto-submit after silence if enabled
When transcription finishes, Lumiverse places the dictated text into the chat flow.
By default, a completed STT transcript is queued as a user message. If you want Lumiverse to send it immediately and start generation, end your dictation with send message.
Example
Saying I gently open the door send message sends I gently open the door immediately.
Auto-Submit After Silence¶
For STT connections, enable Auto-submit after silence if you want Lumiverse to stop recording automatically after you finish speaking.
This is useful for Whisper-style providers because they do not stream interim words back to the browser. Lumiverse listens for confirmed speech, then waits for a sustained pause before sending the audio to transcription.
Use this when:
- You want hands-free dictation
- Your messages were being cut off by stopping the mic too early
- You prefer Lumiverse to decide when the utterance is complete
Leave it off when:
- You want full manual control over when recording ends
- You often pause for long stretches while thinking mid-sentence
- Your microphone or room noise makes silence detection unreliable
Silence detection happens before transcription
Whisper receives one completed audio recording. The silence detector decides when that recording is complete; Whisper then transcribes the whole clip.
Command Words¶
Lumiverse recognizes a small set of spoken commands while normalizing STT transcripts.
Message Action¶
| Say | Result |
|---|---|
send message at the end |
Sends the dictated message immediately instead of only queueing it |
send message only works as a command at the end of the transcript. If you say it in the middle, it remains part of the message text.
Formatting and Punctuation¶
| Say | Inserts |
|---|---|
quote start |
" |
quote end |
" |
open quote |
" |
close quote |
" |
single quote |
' |
apostrophe |
' |
thought start |
* or ** |
begin thought |
* or ** |
thought end |
* or ** |
end thought |
* or ** |
asterisk |
* |
em dash |
— |
Thought markers nest. The first thought start inserts *; a second nested thought inserts **. Matching thought end commands unwind that nesting.
Example
Saying thought start I should be careful thought end becomes *I should be careful*.
Tips for Better Transcription¶
- Speak a little past your final word before stopping the mic manually.
- Use Auto-submit after silence for Whisper/STT connections if you frequently clip the end of messages.
- Keep the microphone close enough that speech is clearly louder than room noise.
- If auto-submit triggers too early, turn it off and stop the recording manually.
- Use
send messageonly when you are sure the dictated message should start generation immediately.
Troubleshooting¶
| Problem | What to Try |
|---|---|
| The microphone button is disabled | Check browser microphone permissions and make sure your selected STT provider is available. |
| Web Speech is unavailable | Switch to an STT connection, or use a browser with Web Speech support. |
| Whisper transcription fails | Verify the STT connection API key, API URL, and model name. |
| Recording stops too soon | Enable Auto-submit after silence, or wait a moment after finishing your sentence before stopping manually. |
| Auto-submit never stops | Check for background noise, move closer to the mic, or stop manually. |