docs: add voice dictation plan and roadmap entry
Add docs/voice-dictation-plan.md — a ready-to-implement design covering server-side Whisper transcription via the existing per-workspace AI provider, with the mic button in both the AI agent chat and the page editor. The doc consolidates four parts: STT provider credentials (full parity with the LLM and embedding creds, incl. the encrypted stt_api_key_enc column and both provider-field whitelists), the getTranscriptionModel builder + /transcribe endpoint, the ai.dictation visibility toggle, and the client capture (useDictation + MicButton). Includes edge cases, security notes, an implementation order, and the full list of affected files.
This commit is contained in:
@@ -112,6 +112,7 @@ community feature, with no enterprise license. Open it from the page header; the
|
||||
- 🔭 **Windows / Linux app** — native desktop app for Windows and Linux.
|
||||
- 🔭 **Mobile app** — native mobile application.
|
||||
- 🔭 **Offline mode** — offline sync & PWA support.
|
||||
- 🔭 **Voice dictation** — microphone button in the AI agent chat and the page editor; audio is transcribed server-side (Whisper / OpenAI-compatible STT) via the workspace AI provider, with an admin toggle to show/hide it. See [docs/voice-dictation-plan.md](docs/voice-dictation-plan.md).
|
||||
- 🔭 **Editor & UX improvements** — blocks inside tables (lists, to-do items), column layout, additional heading levels, highlight blocks, custom emoji in callouts, floating images, anchor links for page mentions, toggles (shared-page width, aside/sidebar, spellcheck, ligatures), sanitized space-tree export, and mentions in breadcrumbs.
|
||||
|
||||
## Getting started
|
||||
|
||||
Reference in New Issue
Block a user