docs: add voice dictation plan and roadmap entry

Add docs/voice-dictation-plan.md — a ready-to-implement design covering server-side Whisper transcription via the existing per-workspace AI provider, with the mic button in both the AI agent chat and the page editor. The doc consolidates four parts: STT provider credentials (full parity with the LLM and embedding creds, incl. the encrypted stt_api_key_enc column and both provider-field whitelists), the getTranscriptionModel builder + /transcribe endpoint, the ai.dictation visibility toggle, and the client capture (useDictation + MicButton). Includes edge cases, security notes, an implementation order, and the full list of affected files.
2026-06-18 01:48:49 +03:00
parent a7f244053b
commit 8bd2001877
3 changed files with 371 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -112,6 +112,7 @@ community feature, with no enterprise license. Open it from the page header; the
 - 🔭 **Windows / Linux app** — native desktop app for Windows and Linux.
 - 🔭 **Mobile app** — native mobile application.
 - 🔭 **Offline mode** — offline sync & PWA support.
+- 🔭 **Voice dictation** — microphone button in the AI agent chat and the page editor; audio is transcribed server-side (Whisper / OpenAI-compatible STT) via the workspace AI provider, with an admin toggle to show/hide it. See [docs/voice-dictation-plan.md](docs/voice-dictation-plan.md).
 - 🔭 **Editor & UX improvements** — blocks inside tables (lists, to-do items), column layout, additional heading levels, highlight blocks, custom emoji in callouts, floating images, anchor links for page mentions, toggles (shared-page width, aside/sidebar, spellcheck, ligatures), sanitized space-tree export, and mentions in breadcrumbs.

 ## Getting started