gitmost

Author	SHA1	Message	Date
vvzvlad	0c46f60ddf	Merge gitea/develop into feat/public-share-assistant Resolve conflicts with the independently-merged ai-agent-roles feature: - ai-chat.module.ts: keep BOTH AiAgentRolesModule and the public-share wiring (Share/Search modules, PublicShareChatController, services). - ai.service.ts: take develop's getChatModel ChatModelOverride superset, which already covers the public-share model-id-only override. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-20 18:40:58 +03:00
claude_code	4c1d1aa2ee	Merge pull request 'feat(ai-chat): agent roles (admin persona + optional model)' (#11 ) from feat/ai-agent-roles into develop	2026-06-20 18:31:10 +03:00
vvzvlad	4b31128e24	fix(ai-roles): harden model override, role-name uniqueness, id validation, list least-privilege Follow-up fixes on the agent-roles feature: - ai.service: a cross-driver override to the ollama driver (when the workspace driver is not ollama) now fails with an explicit 503 instead of silently reusing the workspace base URL, which belongs to a different provider. Same-driver ollama and openai/gemini overrides are unchanged. - migration: add a partial unique index on (workspace_id, name) WHERE deleted_at IS NULL so role names are unique per workspace without soft-deleted rows blocking re-creation; map Postgres 23505 to a 409 ConflictException on create/update. - dto: validate the role id as @IsUUID instead of @IsString. - roles list: do not expose instructions/modelConfig to non-admin members. The list endpoint now returns a picker view (id/name/emoji/description/ enabled) to members and the full view only to admins (same gate as the CRUD endpoints). Client IAiRole fields made optional accordingly. Adds tests for the cross-driver-ollama throw, the 23505->409 mapping, and the non-admin picker-view security invariant. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-20 18:30:33 +03:00
vvzvlad	45cf4140eb	Merge branch 'develop' into feat/ai-chat-review-followups Integrate the already-merged step-limit work from develop. Only conflict was ai-chat.service.spec.ts: both sides appended a describe block and edited the import line. Resolved as a union — keep compactToolOutput + the assistantParts/ serializeSteps/rowToUiMessage suites (this branch) AND the prepareAgentStep suite (develop), importing all symbols from ai-chat.service. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-20 18:09:17 +03:00
claude code agent 227	ec128d54b4	test(ssrf): add IP-level bypass-vector cases (ported from GLM branch) Adds explicit isIpAllowed cases for the CGNAT, ULA (fd00::/8) and IPv4-mapped IPv6 loopback (::ffff:127.0.0.1) sample addresses from the parallel safety-coverage branch. The mapped-loopback case is genuinely new (the existing table only covered the mapped private variant); CGNAT and ULA ranges were already covered with other samples and are kept here as explicit regression guards for these specific addresses. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-20 18:00:43 +03:00
claude code agent 227	cedea4072b	refactor(ai-chat)!: unify provider error formatting via describeProviderError Behaviour change (split out of the test commit per review, and now covered). Both the stream onError log line and the error text streamed to the client were formatted by separate inline blocks that only emitted "<status>: <message>". Route both through the shared describeProviderError() so formatting stays in one place. BEHAVIOUR CHANGE: describeProviderError additionally appends a single-line, 300-char-truncated snippet of the provider responseBody/text. So the log line AND the user-facing stream error now include that snippet (e.g. the HTML error page from a misconfigured endpoint), which previously neither did. This is intentional — it makes a misconfigured external endpoint diagnosable — and is safe: the API key travels in the Authorization header and is never echoed in the response body (see the util's docstring). A `fallback` param is added so each call site keeps its own default ('AI stream error' for the stream). Adds ai-error.util.spec.ts covering the formatter, including the appended / truncated body snippet, so this behaviour is no longer untested. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-20 17:59:55 +03:00
claude code agent 227	f1980cf425	test(ai-chat): safety-critical coverage + a11y + pure refactors Unit tests for the safety-critical paths: crypto secret-box (round-trip, tamper detection, wrong key), the SSRF guard (blocked ranges + DNS-rebinding), the ai-chat tools service, the page-embedding repo, and the assistant-parts/serialization helpers. Those server helpers (assistantParts, rowToUiMessage, serializeSteps) are exported ONLY for the tests — no runtime change. Also: keyboard a11y on the chat history header and conversation rows (role/tabIndex/Enter+Space), and DRY refactors that move shared logic into one place (isToolPart -> tool-parts util; buildInitialValues in the MCP form). The behaviour-changing edits that previously rode along in this commit are split out into the following two commits, per review. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-20 17:58:44 +03:00
claude_code	965cbb32e5	Merge pull request 'feat(ai-chat): step cap 8→20 + forced final text answer' (#9 ) from feat/ai-chat-step-limit into develop	2026-06-20 17:47:37 +03:00
vvzvlad	0b969c8675	test(ai-chat): pin step-limit boundary + note AI SDK v7 system->instructions Port two refinements from the GLM variant onto the Claude base: - prepareAgentStep: add a comment note that AI SDK v7 renames the per-step `system` field to `instructions` (v6 ^6.0.134 still uses `system`), so it gets updated correctly on the next SDK bump. - ai-chat.service.spec: add an explicit off-by-one boundary test for prepareAgentStep, expressed via MAX_AGENT_STEPS instead of a hardcoded 18/19 so it tracks the constant if the cap changes. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-20 17:47:16 +03:00
claude_code	b20ffd1b91	Merge pull request 'feat(tree): Expand all / Collapse all for the space page tree' (#23 ) from feat/tree-expand-collapse-all-agent227 into develop	2026-06-20 17:40:29 +03:00
vvzvlad	234ae759f5	refactor(tree): borrow cleanups from the sibling expand-all impl - extract collectAllIds / collectBranchIds into tree/utils and use them in space-tree.tsx instead of inline closures - drop the duplicate SidebarPageTreeDto, reuse the existing SidebarPageDto for the /pages/tree endpoint - type the getSpaceTree client call as api.post<{ items: IPage[] }> Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-20 17:39:34 +03:00
claude code agent 180	36ae4bd3d3	feat(page-tree): gate compact tree density behind COMPACT_PAGE_TREE flag Make the denser page-tree layout opt-in instead of hardcoded, so row density can be toggled per deployment via the COMPACT_PAGE_TREE runtime config flag. - doc-tree: extract ROW_HEIGHT_STANDARD (32) / ROW_HEIGHT_COMPACT (26); default the virtualizer row stride to STANDARD density. - client: isCompactPageTreeEnabled() in lib/config (reads COMPACT_PAGE_TREE, default true); used by space-tree and shared-tree to choose the row height. - server: EnvironmentService.isCompactPageTreeEnabled() and expose COMPACT_PAGE_TREE through the window runtime config (static.module). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-20 16:54:09 +03:00
claude code agent 227	9aff427ad8	harden(public-share): sliding cluster-wide token cap; testable access seam Release-cycle review: the per-workspace cost cap was fixed-window + per-instance (allowed ~2x at a window boundary and K*cap behind K instances) on an anonymous endpoint that spends the owner's provider budget. Rewrite it as a sliding-window, CLUSTER-WIDE Redis limiter: one atomic Lua EVAL does ZREMRANGEBYSCORE (age out) -> ZCARD -> ZADD with PEXPIRE, so concurrent instances share one budget and the true rate over any trailing window is <= cap. Fails OPEN on a Redis error (logged) — it's a cost backstop, not access control (the funnel gates + per-IP throttle still apply), so a Redis blip must not take the assistant offline. Per-IP @Throttle kept; commented that it needs an XFF-rewriting trusted proxy to be meaningful. Extract deriveShareAccess (resolvedShareId===requestedShareId + isSharingAllowed + !restricted, equality-only, never widening) and filterShareTranscript into pure helpers, and add tests: limiter sliding-window + boundary-burst + fail-open; access derivation; and red-team boundary locks (cross-share/cross-workspace swap rejected, forged shareId can't widen tool scope, transcript injection filtered). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-20 15:04:26 +03:00
claude code agent 227	20a1780977	test(ai-roles): cover role-resolution, CASL gate, model override; hide disabled badge Release-cycle test audit found the role feature's security-critical paths untested. Adds real unit tests (against the actual functions): - resolveRoleForRequest invariants: role comes from chat.roleId not body.roleId (no per-turn swap), lookup scoped to workspace.id, disabled/soft-deleted role -> null, new-chat uses body.roleId, stale chatId falls back. - CASL admin gate: non-admin create/update/delete -> Forbidden and service not called; admin delegates with workspace.id; list() is member-reachable. - roleModelOverride: unknown driver dropped (never reaches getChatModel's throwing default), valid override passes through, blanks ignored. - getChatModel override success path (cross-driver fetch + decrypt; chatModel- only reuse), and service update/remove cross-workspace 'not found' guards + modelConfig tri-state. Tiny fix: findByCreator badge left-join now also requires enabled=true, so a disabled role (downgraded to universal by resolveRoleForRequest) no longer shows a misleading chat-list badge. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-20 14:20:08 +03:00
claude code agent 227	cac7abc395	fix(ai-roles): guard update() re-fetch against concurrent soft-delete Release-cycle review: update() re-read the role via findById (filters deleted_at IS NULL) and passed it straight to toView(updated as AiAgentRole). A concurrent soft-delete between the UPDATE and the re-fetch makes findById return undefined, and toView(undefined) dereferences row.id -> opaque 500. Add the same 'Role not found' guard remove() already uses. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-20 14:03:03 +03:00
claude code agent 227	932a4080f7	fix(public-share): block restricted descendants in the anonymous assistant Release-cycle red-team found getShareForPage joins only the shares table, so it does not exclude restricted descendants. The public share VIEW (getSharedPage) compensates with hasRestrictedAncestor, but the assistant's getSharePage tool and the controller funnel did not — so an anonymous caller could read a restricted descendant's content (tool) or surface its title into the system prompt (funnel) within an includeSubPages share. - getSharePage: after the share-membership check and before returning content, reject with the generic 'not part of this published share' message when hasRestrictedAncestor(page.id) is true (page.id is the resolved UUID, so slugId inputs work). Inject PagePermissionRepo. - funnel: resolve the OPENED page to its UUID and treat a restricted opened page as not-in-share (same uniform 404, fail closed if unresolvable) so its title never reaches buildShareSystemPrompt. search/list already exclude restricted subtrees (getPageAndDescendantsExcludingRestricted), so these were the only two bypasses. Generic messages keep restricted indistinguishable from not-in-share. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-20 13:16:32 +03:00
claude code agent 227	acf3df9e9d	feat(ai): anonymous AI assistant on public shares Lets an unauthenticated viewer of a published share ask an AI scoped strictly to that share's page tree. The authenticated agent is untouched; the security boundary is the tool scope (no identity), and nothing is persisted. Server: - workspace toggle settings.ai.publicShareAssistant (default off) + optional settings.ai.provider.publicShareChatModel (cheap model id; reuses the chat driver/baseUrl/key). getChatModel(workspaceId, override) substitutes only the model id, falling back to chatModel. - POST /api/shares/ai/stream (@Public, SSE). Guardrail funnel, each failing before streaming: toggle off -> 404; share missing/wrong-workspace/sharing off -> 404; pageId not in share tree -> 404; provider unconfigured -> 503; per-IP (5/min) and per-workspace (300/h, IP-independent) rate limits -> 429. Uniform 404s never confirm a private page's existence. - forShare read-only in-process toolset: searchSharePages (existing shareId FTS branch, no spaceId/userId), getSharePage (getShareForPage gate + share.id check, content via the public sanitizer), listSharePages. No write/ comment/history/cross-space/external-MCP tools. - Locked share system prompt + immutable safety block; stepCountIs(5). - /shares/page-info exposes an aiAssistant flag (gated behind isSharingAllowed). Client: an ephemeral, text-only Ask-AI widget on the public shared page, shown only when the flag is set; useChat -> /api/shares/ai/stream, credentials omit. Admin toggle + model field in Settings -> AI. Also adds a jest moduleNameMapper for src/-rooted imports (fixes pre-existing unresolvable specs; additive). Implements docs/public-share-assistant-plan.md. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-20 07:59:56 +03:00
claude code agent 227	30c3189220	feat(ai-chat): agent roles (admin-defined persona + optional model) Reusable, workspace-shared agent roles for the built-in AI chat. A role is a named persona (system-prompt instructions) + optional model override; a chat is bound to a role at creation and applies it every turn. Backend: - migration 20260620T120000: ai_agent_roles table + ai_chats.role_id (FK ON DELETE SET NULL); hand-merged types into db.d.ts/entity.types.ts (db.d.ts is hand-curated here, full codegen would clobber it). - core/ai-chat/roles: CRUD module. list = any workspace member; create/ update/delete = admin (Manage Settings ability, like ai-settings/mcp). All repo queries scoped by workspace_id; soft-delete (deleted_at). - buildSystemPrompt gains roleInstructions: role REPLACES the persona base (admin prompt / DEFAULT_PROMPT) but SAFETY_FRAMEWORK + context are always still appended. - stream(): role resolved from ai_chats.role_id for existing chats (never the request body -> no per-turn role swap); body.roleId only on creation. Disabled (enabled=false) and soft-deleted roles fall back to universal. - getChatModel(workspaceId, override): role model_config can swap model id / driver; a driver without configured creds throws 503 with a clear message naming the driver+role, resolved BEFORE response hijack. Client: - new-chat role picker (enabled roles only, default Universal assistant), roleId sent only on the first message; role badge (emoji+name) in the chat header and conversation list; admin Agent-roles management section in Settings -> AI (add/edit/delete, MCP-form pattern). Tests: ai-chat.prompt.spec (role layering + safety always present, incl. jailbreak); ai.service.spec (override on unconfigured driver -> 503). Implements docs/ai-agent-roles-plan.md. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-20 06:30:06 +03:00
claude code agent 227	b197cbedef	feat(ai-chat): raise agent step cap 8->20, force a final text answer A narrow research question could burn all 8 steps on tool calls and end the turn with no assistant text (empty turn). Two changes: - MAX_AGENT_STEPS = 20 (was a magic stepCountIs(8)) so multi-search turns aren't cut off mid-investigation. - prepareStep reserves the LAST allowed step for a text-only synthesis: toolChoice 'none' + a FINAL_STEP_INSTRUCTION appended to (not replacing) the system prompt, so a tool-heavy turn always ends with a real answer. Logic extracted into the pure, exported prepareAgentStep(stepNumber, system) for unit testing; earlier steps return undefined (default behavior). Implements docs/backlog/ai-chat-step-limit-and-forced-final-answer.md. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-20 05:38:13 +03:00
claude code agent 227	b81819ef63	feat(tree): Expand all / Collapse all for the space page tree Adds a server-authoritative whole-tree endpoint and sidebar menu commands so a deep space tree can be expanded in one request instead of a per-level BFS storm. Server: - POST /pages/tree (SidebarPageTreeDto: spaceId \| pageId), same CASL space scoping as /sidebar-pages. Returns the whole space tree / subtree as a flat list in the sidebar item shape (id, slugId, title, icon, position, parentPageId, spaceId, hasChildren, canEdit), ordered by position (collate C byte order), content never fetched. - page.service.getSidebarPagesTree reproduces getSidebarPages' two-branch permission model: open space -> spaceCanEdit; restricted space -> seed the full descendant set then prune via filterAccessibleTreePages + filterAccessiblePageIdsWithPermissions (keeps restricted-but-granted pages, prunes inaccessible subtrees). hasChildren is derived from the final filtered set so it can never reveal inaccessible children. - page.repo.getSpaceDescendants: recursive CTE seeded by space roots. Client: - SpaceTree is forwardRef exposing expandAll/collapseAll/isExpanding; expandAll fetches the whole tree once, replaces current-space nodes, opens every branch (current space only), aborts on space switch, surfaces real errors; collapseAll collapses only current-space ids (shared open-map). - SpaceMenu gains Expand all / Collapse all items (no admin gate). Implements docs/backlog/tree-expand-collapse-all.md. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-20 05:31:34 +03:00
claude_code	732aaf54f8	refactor(import): remove non-functional DOCX/PDF/Confluence import stubs These import paths relied on the private EE module that was deleted from the repo. In the community build they either threw 'enterprise license' (DOCX/PDF) or silently no-op'd (Confluence). The frontend buttons were already removed in 38064064; this cleans up the dead backend stubs. - import.service.ts: drop processDocx/processPdf methods, their dispatcher branches, the pageId computation + insertPage spread, and the now-unused moduleRef param/ModuleRef import - file-import-task.service.ts: drop the Confluence branch and the now-unused moduleRef param/ModuleRef import - import.controller.ts: restrict file extensions to .md/.html and zip sources to generic/notion; update the error message accordingly - file.utils.ts: remove Confluence from the FileImportSource enum - features.ts: remove the unused CONFLUENCE_IMPORT/DOCX_IMPORT/PDF_IMPORT feature keys The isConfluenceImport logic in import-attachment.service.ts is intentionally left in place (real shared attachment-parsing code, not a stub); its removal is a separate, riskier refactor.	2026-06-20 04:05:29 +03:00
vvzvlad	3d03417c73	fix(import): surface real error cause in /pages/import instead of generic 400 The two catch blocks in importPage() threw an opaque "Error processing file content" / "Failed to create imported page" BadRequest, hiding the real cause from the HTTP response. This made a production 400 regression impossible to diagnose without server log access, and violated the project convention that errors must never be swallowed. Extract `${err.name}: ${err.message}` into both the log (full err object kept for the stack) and the thrown BadRequestException. Inner processMarkdown/ processHTML rethrowing catches and the EE processDocx/processPdf license catches are left unchanged. Local reproduction of the happy-dom 14->20 theory failed (full import chain + 22 edge cases pass on happy-dom@20.8.9), so the root cause is still pending the now-visible reason from a recurring 400. Diagnostic script test-import.tsx added; backlog doc updated with findings.	2026-06-19 16:25:12 +03:00
vvzvlad	1e7a306f96	feat(mcp): add hierarchical tree mode to list_pages list_pages gains an opt-in `tree` parameter on both surfaces (the @docmost/mcp server tool and the AI-chat agent tool), which share the same DocmostClient.listPages. Default behavior (recent-by-updatedAt flat list) is unchanged. - client.ts: listPages(spaceId?, limit=50, tree=false); when tree is true it requires spaceId (throws a specific error otherwise), walks the sidebar tree via the existing bounded/cycle-safe enumerateSpacePages, and returns a nested tree; limit is ignored in tree mode. - lib/tree.ts: new pure buildPageTree() — lean nodes { id, slugId, title, children? }, children sorted by position (code-unit order), orphans promoted to roots, cycle-safe. - index.ts + ai-chat-tools.service.ts: expose `tree` in the tool schemas and descriptions; docmost-client.loader.ts: mirror the new signature. - tests: add packages/mcp/test/unit/tree.test.mjs (nesting, ordering, lean shape, orphan promotion, cycle/self-reference safety). - rebuild @docmost/mcp (build/ is tracked and loaded at runtime).	2026-06-18 20:30:00 +03:00
vvzvlad	f96df1c540	feat(ai-chat): show current context size instead of total tokens spent The floating AI-chat header badge summed metadata.usage (AI SDK totalUsage, all steps) across every assistant row, showing the cumulative tokens SPENT — which grows each turn as history is re-sent. Replace it with the conversation's CURRENT context size. - server: persist metadata.contextTokens in streamText onFinish from the final-step `usage` (inputTokens + outputTokens ≈ current context window occupancy); keep usage: totalUsage for back-compat/fallback - client: derive the badge from the most recent assistant row's contextTokens (fallback to that row's usage total for older chats) instead of summing all rows - types: add metadata.contextTokens to IAiChatMessageRow - i18n: rename badge label "Tokens used in this chat" -> "Current context size" (en-US) No DB migration needed (metadata is a JSON column).	2026-06-18 19:54:34 +03:00
vvzvlad	01a5a4b5d2	refactor(ai): explicit STT request format instead of OpenRouter host-sniffing Replace the implicit `hostname endsWith openrouter.ai` detection with an explicit, admin-chosen provider field `sttApiStyle` ('multipart' = OpenAI- compatible multipart /audio/transcriptions; 'json' = OpenRouter-style JSON + base64 input_audio). The transcription path now branches on the stored field, not on the URL — nothing hidden from the admin. - ai.types: add SttApiStyle + STT_API_STYLES; field on AiProviderSettings and MaskedAiSettings (resolved via ResolvedAiConfig). - update-ai-settings.dto: validate sttApiStyle with @IsIn(STT_API_STYLES). - ai-settings.service: plumb sttApiStyle through resolve()/getMasked() and the non-secret update whitelist; workspace.repo: add it to the ALLOWED array so it persists. - ai.service: drop isOpenRouter(); transcribe() branches on cfg.sttApiStyle; rename helper to transcribeJsonBase64 with provider-neutral error text and a BadRequestException (400) when the base URL is missing for the JSON style. - client: SttApiStyle type on IAiSettings/IAiSettingsUpdate; "Request format" Select on the Voice/STT settings card; i18n.	2026-06-18 19:40:05 +03:00
vvzvlad	77249d59c6	feat(ai): OpenRouter STT support + real error surfacing + STT endpoint test - ai.service: route *.openrouter.ai STT to its JSON+base64 /audio/transcriptions API; keep the OpenAI multipart path (AI SDK) for OpenAI/self-hosted whisper. Unify transcription behind transcribe(). - /transcribe controller: surface the real provider/transport reason (describeProviderError) instead of an opaque 500; preserve HttpException. - testConnection: add an 'stt' capability (silent-WAV probe) + DTO; client gets a Test endpoint button and status dot on the Voice/STT card. - useDictation: log full errors to the console and show the real reason (mic start + transcription paths); handle NotReadable/Abort and missing mediaDevices. - docs(CLAUDE.md): require full error logging + specific user-facing messages.	2026-06-18 19:26:35 +03:00
vvzvlad	5af40e0ee5	refactor(db): replace STT credentials migration Remove the outdated 20260618T130000 migration file and add the updated 20260618T160000 version to correct the timestamp and ensure proper ordering.	2026-06-18 18:54:24 +03:00
vvzvlad	874bdd021c	feat(ai): server-side voice dictation (STT) with mic in chat and editor Add push-to-talk voice dictation that transcribes recorded audio on the server via the workspace's OpenAI-compatible AI provider (Whisper / gpt-4o-transcribe / self-hosted whisper), then inserts the text. Backend: - New `stt_api_key_enc` column + migration; STT creds parity with chat/ embeddings (sttModel/sttBaseUrl/sttApiKey, write-only key, fallbacks to chat baseUrl/key). Both provider whitelists updated (service + repo). - AiService.getTranscriptionModel + AiTranscriptionService. - Gated POST /ai-chat/transcribe (dictation flag → 403, JWT + workspace scope + throttle, 25MB cap, MIME whitelist, never logs audio/key). - New `settings.ai.dictation` workspace flag (DTO + service + audit). Frontend: - Wire up the Voice/STT settings card (model/base URL/key) and the Voice-dictation toggle. - New `features/dictation`: useDictation (MediaRecorder state machine), MicButton, transcribe service; integrated into the chat composer and a new editor-toolbar dictation group, both gated by ai.dictation.	2026-06-18 18:45:33 +03:00
vvzvlad	c6b878c514	0.91.0 Bump root, client and server package versions 0.90.1 -> 0.91.0 to match the v0.91.0 release tag. packages/mcp keeps its independent 1.0.0 version.	2026-06-18 18:07:54 +03:00
vvzvlad	a945b47749	fix(mcp): verifiable mutation results + refuse formatting edits in edit_page_text edit_page_text reported "success" when asked to change formatting (e.g. remove strikethrough): the markdown-strip fallback matched the bare text, the replace preserved marks, and the tool returned success — so the agent believed it had fixed something that never changed. Two fixes, both in the shared @docmost/mcp DocmostClient so they reach BOTH the standalone MCP server and the in-app AI chat (which loads @docmost/mcp): - Verifiable result for every content mutator: mutatePageContent now computes a `verify` change-report (text inserted/deleted, blocks changed, per-mark-type delta, integrity/structure delta) via summarizeChange() and returns it on all mutators (incl. replaceImage via mutateLiveContentUnlocked). diffDocs is text-only, so the mark/structure delta is what surfaces formatting changes. - edit_page_text hard-refuses formatting edits: applyTextEdits rejects an edit whose find/replace differ only in markdown markers (via stripBalancedWrappers, which strips balanced wrappers/links without trimming whitespace/emoji, so plain-text edits like trailing-space trims, snake_case, math are NOT refused). A fully-refused batch errors instead of silently succeeding. Also updated the model-facing edit_page_text descriptions in BOTH tool layers (packages/mcp/src/index.ts and ai-chat-tools.service.ts) to drop the misleading "strip-and-retry tolerated" wording and point formatting changes to patch_node. New unit tests: test/unit/diff-verify.test.mjs, test/unit/json-edit-refuse.test.mjs.	2026-06-18 05:46:13 +03:00
vvzvlad	87d6bdfbd9	feat(ai): redesign AI settings page with per-endpoint test buttons Rebuild the workspace AI settings page into card-based "Endpoints" (Chat / Embeddings / Voice) matching the new design, and split the single connection test into independent per-endpoint Test buttons. - server: testConnection(workspaceId, capability) probes only the requested capability ('chat' \| 'embeddings'); add TestAiConnectionDto and wire it through the /workspace/ai-settings/test controller - client: testAiConnection(capability) + capability-typed mutation; two independent test mutation instances so Chat/Embeddings results are isolated - client: full rewrite of ai-provider-settings into Endpoints section — drop the provider dropdown (driver is always openai, base URL + key always shown), move the "AI chat" and surface the "Semantic search" feature toggles into card headers, system message behind an Edit modal, pgvector/reindex footer, and a disabled Voice/STT stub - client: restyle external MCP tools and the MCP server section; collapse the AI sections in workspace-settings; remove the standalone ai-chat-settings component - toggles now surface the server error message (e.g. missing pgvector) - i18n: add new English strings Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-18 04:20:33 +03:00
vvzvlad	c8e41e8916	feat(ai): hybrid RRF retrieval, heading-breadcrumb chunks, merged search tool Improve agent RAG quality with three changes, plus a roadmap doc for the rest. - Indexer: prefix each chunk with its heading path ("Page > H1 > H2"), built by walking the ProseMirror JSON (heading nodes) so a `#` inside a fenced code block is never mistaken for a heading. Falls back to plain-text chunking on any error. buildChunkRows: drop indexOf-against-source offsets (breadcrumb prefixes break verbatim matching) for a cumulative cursor — offsets are provenance-only. - Hybrid search: new migration adds a generated `fts` tsvector column + GIN index to page_embeddings (same english+f_unaccent config as pages.tsv). New PageEmbeddingRepo.hybridSearch fuses cosine + full-text rankings via Reciprocal Rank Fusion (k=60, equal weights) in one SQL query at chunk granularity. - Tools: collapse semanticSearch + searchPages into one hybrid `searchPages` tool with a query-rewrite-oriented description; gracefully falls back to the REST full-text path when embeddings are unconfigured. Access control (space scope + page-permission post-filter) preserved. Add a query-rewrite hint to the default system prompt. - docs/rag-improvements-plan.md: record what shipped and the deferred backlog (reranker, attachment indexing, eval harness, tuning). Note: requires a corpus reindex to populate breadcrumbs on existing pages.	2026-06-18 03:43:01 +03:00
vvzvlad	91a63f0b2c	fix(ai): stop RAG coverage bar sticking below 100% on empty pages "Indexed N of M pages" stayed at e.g. "27 of 34" forever even after a successful full reindex. The numerator counted pages that have embeddings while the denominator counted ALL non-deleted pages, so empty / text-less pages (which legitimately store zero embeddings) could never be reached. Add PageRepo.countEmbeddablePages: counts non-deleted pages that have non-empty textContent OR already have a stored embedding row, and use it as the totalPages denominator in AiSettingsService.getMasked. The "has embeddings" clause covers pages indexed from the content JSON (null textContent) and guarantees indexedPages <= totalPages. No DB migration.	2026-06-18 03:33:38 +03:00
vvzvlad	80c900eb54	fix(ai): make RAG indexer observable and bound hung embedding calls The bulk embedding reindex could hang on a single page forever ("Indexed 27 of 34 pages") with zero log output: - all progress logs were debug-level, suppressed in production (pino info); - embedMany() had no timeout, so a slow/hung embeddings endpoint blocked the sequential per-page loop indefinitely. Changes: - ai.service.embedTexts: bound embedMany with AbortSignal.timeout (configurable via AI_EMBEDDING_TIMEOUT_MS, default 120000ms); on timeout throw a clear, greppable message, classified by both signal.aborted and the error name (TimeoutError/AbortError/ResponseAborted) so a real provider error racing the timer keeps its diagnostics. - embedding-indexer.reindexWorkspace: promote lifecycle/progress logs to info; log "[i/N] indexing page <id>" BEFORE the await so a hang names the stuck page; warn on slow pages (>30s); add timing + final summary. - .env.example: document AI_EMBEDDING_TIMEOUT_MS.	2026-06-18 03:07:02 +03:00
vvzvlad	b46aed53e3	feat(ai): surface provider error bodies + probe embeddings in test connection A misconfigured embeddings endpoint failed the RAG indexer with an opaque "Invalid JSON response" and was not caught by "Test connection" (which only probed the chat model), so it only surfaced silently during background indexing. - add describeProviderError(): formats AI SDK errors as "<statusCode>: <message> \| response body: <truncated one-line snippet>" (statusCode/message/responseBody never carry the API key) - use it in the bulk-reindex catch and the embedding processor's formatter so the real cause (e.g. an HTML 404 from a wrong base URL) is visible in logs - testConnection now probes chat AND embeddings independently: skips a probe when that capability is unconfigured, returns ok:false with a Chat:/Embeddings: prefix on real failure, "not configured" when neither is set	2026-06-18 02:35:01 +03:00
vvzvlad	52e19fe678	feat(ai): wire up workspace RAG bulk reindex + manual "Reindex now" The WORKSPACE_CREATE_EMBEDDINGS / WORKSPACE_DELETE_EMBEDDINGS jobs were enqueued (on AI Search enable/disable) but had no AI_QUEUE handler, so existing pages were never indexed ("Indexed 0 of N pages") and disabling never purged embeddings. - EmbeddingProcessor: handle WORKSPACE_CREATE_EMBEDDINGS (bulk reindex all live pages) and WORKSPACE_DELETE_EMBEDDINGS (purge workspace embeddings) - EmbeddingIndexerService: add reindexWorkspace() (skips when embeddings unconfigured; per-page error isolation) and removeWorkspace() - PageRepo.getIdsByWorkspace(), PageEmbeddingRepo.deleteByWorkspace() - AiSettingsService.reindex() + admin-only POST /workspace/ai-settings/reindex - Frontend: "Reindex now" button, service call and mutation - Stable per-workspace jobId with remove-before-add so a stale job can't block future reindexes; cancel the delayed purge on enable/reindex so it can't wipe freshly-built embeddings	2026-06-18 02:15:18 +03:00
vvzvlad	a7f244053b	feat(ai): separate base URL and API key for chat vs embedding model Per-workspace AI provider config previously shared a single base URL and a single API key between the chat model and the embedding model. Add dedicated, optional embedding endpoint/token that fall back to the chat values when empty, preserving backward compatibility. - db: new migration adds nullable `embedding_api_key_enc` to `ai_provider_credentials`; chat key stays in `api_key_enc` - repo: add `upsertEmbeddingKey` / `clearEmbeddingKey` (on-conflict touches only its own column, so chat/embedding keys never overwrite) - ai-settings.service: store non-secret `embeddingBaseUrl`; resolve() applies fallback (embeddingBaseUrl \|\| baseUrl; embedding key \|\| chat key); getMasked() exposes raw `embeddingBaseUrl` + `hasEmbeddingApiKey`, never the key; update() handles the embedding key write-only - ai.service: getEmbeddingModel() builds openai/gemini/ollama with the embedding-specific URL/key; chat path unchanged - client: new "Embedding base URL" and "Embedding API key" fields with fallback hints and a clear-key action Requires running the DB migration on deploy.	2026-06-18 01:33:45 +03:00
vvzvlad	41dfeeb77a	perf(ai-chat): compact large tool outputs before persisting them Read tools (getPage, getPageJson, getNode, diffPageVersions, exportPageMarkdown) return whole pages with no size cap. Their outputs were stored verbatim in metadata.parts and the tool_calls column, and metadata.parts is replayed to the provider on every later turn via convertToModelMessages. After reading a couple of large pages the prompt grew by full page bodies each turn — rising token cost, latency and DB row size. Add compactToolOutput(): a pure, recursive, size-bounded compactor used in assistantParts() and serializeSteps(). It preserves the value's kind and small scalar fields (id/title/pageId, which the client reads to build citations on reload) while truncating long strings, capping long arrays with a marker, and collapsing subtrees past a depth limit. Small outputs are returned unchanged by identity. Tool inputs are left intact so replayed tool_use arguments keep their object shape. Compaction runs only at persistence time (onFinish/onAbort), so the live stream and the current turn's multi-step reasoning still see full bodies. Add unit tests for compactToolOutput.	2026-06-17 23:44:51 +03:00
vvzvlad	1f2d20244e	feat(ai-chat): show RAG indexing coverage in AI settings Display "Indexed N of M pages" on the AI provider settings page so admins can see how much of the wiki is covered by vector-RAG semantic search. - page-embedding.repo: add countIndexedPages() — distinct non-deleted pages that have stored embeddings in the workspace - page.repo: add countByWorkspace() — total non-deleted pages - ai-settings.service: compute both counts in getMasked() (Promise.all) and return them with the masked settings; inject PageEmbeddingRepo + PageRepo - MaskedAiSettings / IAiSettings: add indexedPages + totalPages - ai-provider-settings: render a dimmed coverage line under "Embedding model" - i18n: add the "Indexed {{indexed}} of {{total}} pages" key (en-US, ru-RU)	2026-06-17 23:18:51 +03:00
vvzvlad	cb2b7a9851	fix(ai-chat): rebrand default agent persona to Gitmost The default system prompt introduced the assistant as embedded in "Docmost". Update DEFAULT_PROMPT to say "Gitmost" so the agent presents itself with the correct product name.	2026-06-17 23:12:34 +03:00
vvzvlad	551f975886	fix(collab): use '-' instead of ':' in agent page-history jobId BullMQ rejects custom job IDs containing ':' (Redis key separator), throwing "Custom Id cannot contain :" inside the onStoreDocument hook for every agent edit. This broke agent-driven page saves (MCP create_page runs as actor='agent') with HTTP 400. Switch the agent dedup suffix from `${page.id}:agent` to `${page.id}-agent`. The jobId is only used as a BullMQ dedup key and is never parsed by the history processor; page.id is a UUID, so the hyphenated id cannot collide with a human job whose id is a bare page.id.	2026-06-17 17:38:32 +03:00
vvzvlad	afd2248a75	feat(ai-chat): tolerate markdown in edit_page_text/insert_node locators Locators (edit_page_text `find`, insert_node `anchorText`) are matched against the document's plain text, so a model-supplied locator carrying markdown wrappers (bold, italic, `code`, [t](url)) or trailing emoji never matched and the edit/insert failed. Add stripInlineMarkdown() and a fallback: try the locator verbatim first (exact match wins, so literal asterisks/underscores still work), and only on zero matches retry with a markdown-stripped form. The ambiguity guard runs on the post-fallback count, and `replace` / inserted node content are never stripped, so no formatting is lost. Failed edits gain an atom-aware reason plus a bounded "closest block text" hint; the insert_node "anchor not found" error now points at plain-text anchors / anchorNodeId. New packages/mcp/src/lib/text-normalize.ts (+ unit tests); wired into json-edit.ts and node-ops.ts; tool descriptions updated. Tests: 212 pass.	2026-06-17 15:44:19 +03:00
vvzvlad	fc9088b74d	fix(ai-chat): cross-mark text edits, partial batches, JSON-string node parity edit_page_text (applyTextEdits) now matches at the inline-block level instead of per text node, so a find/replace may cross bold/italic/link boundaries; the replacement inherits marks from the unchanged common prefix/suffix via a diff splice. Atom (non-text inline) slots can never be part of a match, making the U+FFFC placeholder collision-safe, and inserted text never inherits an atom's marks. The edit batch is no longer all-or-nothing: applyTextEdits returns { doc, results, failed } and applies what it can; editPageText writes only on a real change (no spurious history version for a no-op) and throws an aggregated, actionable error only when nothing applied. The AI-chat insert_node / patch_node / update_page_json tools now JSON.parse a node/content argument that arrives as a string, matching the standalone MCP server (this is what made insert_node fail under OpenAI tool calls). Tool descriptions gain concrete ProseMirror examples and reflect the new edit_page_text behavior. Adds/updates json-edit unit tests (183 pass). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-17 06:57:58 +03:00
vvzvlad	0a9788e89a	feat(collab): separate agent edits from human edits in page history Page-history snapshots are debounced/coalesced (one per 1–5 min window, jobId=page.id). A human edit followed by an agent edit in the same window collapsed into a single snapshot, losing both the pre-agent human state and a deterministic record of the agent's result. Two provenance-aware boundaries now bracket an agent intervention: - Before: on a user->agent transition, onStoreDocument synchronously pins the current (pre-agent) human content as its own history version tagged 'user', inside the page-write transaction, before the agent overwrites it. - After: agent stores enqueue an immediate (delay 0), source-keyed history job (jobId=`${pageId}:agent`) so the agent's result snapshots deterministically as 'agent' and a later human edit (jobId=page.id) cannot coalesce/retag it. Also add an `id desc` tie-break to findPageLastHistory so "last history" stays deterministic when two snapshots share a created_at, consistent with findPageHistoryByPageId. Known trade-offs (Variant 1): the delay-0 worker re-reads the row, leaving a millisecond mis-tag window; multiple agent edits in one turn may yield multiple versions. The reverse agent->human boundary is intentionally out of scope.	2026-06-17 06:40:28 +03:00
vvzvlad	b0997cb749	feat(ai-chat)!: drop updateComment from the agent toolset Editing an existing comment's text is irreversible (not version-tracked), which breaks the agent's "only reversible operations" invariant. Remove the updateComment tool that was added in the toolset-expansion change, leaving the agent at 40 tools (comments: create/resolve only). - Remove the updateComment tool from forUser(). - Remove updateComment from the DocmostClientLike interface. - Reword SAFETY_FRAMEWORK: comments are create/resolve only; drop the comment-text-edit exception (keep the public-sharing one); keep the no-permanent-deletion guarantee and anti-prompt-injection rules. - Tests: assert updateComment is NOT exposed (mirrors the deleteComment guard). - docs(ai-agent-chat-plan): move updateComment to the "not exposed" list.	2026-06-17 06:03:19 +03:00
vvzvlad	6ec91c8a2c	feat(ai-chat): expose full Docmost toolset to the in-app agent Grow the agent tool registry in forUser() from 10 to 41 tools, wiring all remaining @docmost/mcp client capabilities: reads (workspace/spaces/pages/ sidebar/outline/json/node/table/comments/shares/history/diff/export) and reversible writes (editPageText, patch/insert/delete node, updatePageJson, table ops, copy/import content, share/unshare, restorePageVersion, updateComment, transformPage). Deliberately NOT exposed: deleteComment (irreversible hard delete) and the filePath-based image tools (uploadImage/insertImage/replaceImage — useless and unsafe for a server-side agent). transformPage omits the deleteComments option from its schema and never passes it, so the comment-deletion path is unreachable from the agent. - Extend DocmostClientLike with the new method signatures. - Update SAFETY_FRAMEWORK to describe the broader toolset while keeping the no-permanent-deletion guarantee and anti-prompt-injection rules; flag that comment-text edits are not version-tracked and sharing is public. - Add guardrail tests: no deleteComment tool; transformPage schema rejects deleteComments. - docs(ai-agent-chat-plan): record the toolset expansion and a backlog item to support image insertion by URL via the existing SSRF guard. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-17 05:14:45 +03:00
vvzvlad	65f0713a70	fix(ai-chat): live streaming, open-page context, any-dimension embeddings" -m "- streaming: give useChat a STABLE store id (chatId ?? per-mount generated) so the v6 hook stops re-creating its store every render on a new chat (which wiped the optimistic user message + streamed deltas, so nothing showed until the turn finished). Also send X-Accel-Buffering:no + flushHeaders. - context: client sends the currently-open page {id,title}; the system prompt tells the agent which page 'this page' refers to (it reads it via its CASL-scoped getPage tool; id is prompt-context only, no server-side fetch). - embeddings: make page_embeddings.embedding dimension-agnostic (drop the HNSW index + ALTER to vector), remove the hard 1536 guard, filter search by model_dimensions — so 3072-dim (and any) models index instead of being skipped. Seq-scan <=> search (wiki scale); existing pages reindex on next edit.	2026-06-17 04:58:06 +03:00
vvzvlad	a4b7919753	fix(ai-chat): OpenAI Chat Completions for multi-turn + provider settings, stream UX & errors" -m "Live-stand fixes (OpenRouter / OpenAI-compatible): - openai provider: use .chat() (Chat Completions) instead of the default callable (Responses API), which gateways reject on multi-turn -> 400. - updateAiProviderSettings: assemble settings.ai.provider via jsonb_build_object with ::text-cast bound params + jsonb_typeof self-heal (postgres.js was double-encoding it into an array; the ::text cast avoids 'could not determine data type of parameter'). - chat agent: drop the hard maxOutputTokens cap (truncated complex tool calls); keep a tiny cap only on the test-connection ping. - testConnection + chat stream: surface the real provider error (statusCode+message) to logs and the UI instead of generic masks; never log the API key. - chat UI: typing indicator, incremental streaming render, tool 'running' status, Stop. Also bundled (prior uncommitted ai-chat work): - history 'AI agent' provenance badge; vector RAG (pgvector image + page_embeddings + AI_QUEUE indexer + space-scoped semanticSearch); external MCP servers backend (@ai-sdk/mcp client, SSRF IP-pinning, encrypted headers, admin CRUD/Test); yjs duplicate-instance fix via pnpm patch (single CJS instance server-side).	2026-06-17 04:28:29 +03:00
vvzvlad	44b340dc1a	feat(ai-chat): agent write tools, provenance wiring, chat panel + provider settings UI" -m "Backend: - Add reversible write tools to the per-user agent toolset (page create/update/ move/soft-delete; comment reply + resolve), exposed under the user's JWT and enforced by Docmost CASL; no permanent/force delete (D3). - Non-spoofable agent provenance: sign actor/aiChatId into the access and collab tokens (TokenService), propagate via jwt.strategy onto the request, and set pages.last_updated_source/last_updated_ai_chat_id on REST create/update/move and comments.created_source/resolved_source/ai_chat_id. - packages/mcp: add an optional getCollabToken provider (content-edit provenance) and guard against empty tokens; service-account /mcp path unchanged. Frontend: - Admin 'AI / Models' settings section: provider/model/embedding/base URL, a write-only API key field, system prompt, and Test connection. - AI chat panel (useChat + DefaultChatTransport): conversation list, streamed messages, tool-call action log and page citations; header entry point gated on settings.ai.chat. Compile-verified (server nest build + client tsc/vite); not yet live-tested. Known gaps: history 'AI agent' badge (C3), vector RAG (D), external MCP (E); chat tool-card citation links pending a fix. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-17 02:39:26 +03:00
vvzvlad	683da7a4c5	feat(ai-chat): per-user AI agent backend — LLM config, read-only agent, provenance schema WIP checkpoint of the gitmost AI-chat backend (plan stages A + B1 + B3a). The agent acts under the requesting user's JWT (Docmost CASL enforces page access); the external service-account /mcp endpoint is untouched. LLM provider config (A2-A4): - integrations/crypto: AES-256-GCM SecretBoxService (key derived from APP_SECRET, per-record salt/iv; clear error on rotation instead of crashing). - ai_provider_credentials table/repo/types: encrypted API key stored outside workspace settings/baseFields, write-only (never returned by any endpoint). - integrations/ai: per-workspace AI SDK v6 provider driver (openai/gemini/ollama), admin-gated GET(masked)/PATCH(write-only key)/Test endpoints; settings.ai.provider holds non-secret config incl. systemPrompt. Removed unused AI_* env getters (DB is the single source of truth). Chat module (A1, A5-A8): - ai_chats/ai_chat_messages repos (workspace-scoped, soft-delete, tsv never selected). - core/ai-chat: CRUD + POST /ai-chat/stream (Fastify hijack + AI SDK v6 pipeUIMessageStreamToResponse, abort on disconnect, persist user/assistant msgs). - Agent loop: streamText + stepCountIs(8); read tools searchPages/getPage via a per-request DocmostClient over loopback REST under the user's minted access token. - Gate settings.ai.chat (+ 503 when provider unconfigured); buildSystemPrompt with a non-removable safety/anti-prompt-injection framework. Per-user rate limit. Per-user auth (B1): - @docmost/mcp DocmostClient gains an additive getToken variant (carry a user JWT, re-fetch on 401) and exports DocmostClient; the email/password service-account path (external /mcp, stdio) is unchanged. Agent-edit provenance backbone (B3a): - Migration: pages/page_history (last_updated_source, last_updated_ai_chat_id) and comments (created_source, ai_chat_id, resolved_source). - Signed actor/aiChatId claim in the collab token; onAuthenticate propagates it, onStoreDocument writes it with a sticky agent marker, saveHistory copies it. Migrations auto-run on boot (additive). Write tools, frontend, RAG and external MCP servers are not in this checkpoint. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-17 01:36:41 +03:00

1 2 3 4 5 ...

591 Commits