gitmost

Author	SHA1	Message	Date
agent_coder	f555fc87da	refactor(#345 step 2): server markdown IMPORT via canonical parser + normalizer Move every SERVER Markdown->ProseMirror path off the editor-ext markdown layer (`markdownToHtml`, a second marked-based parser) onto the canonical `@docmost/prosemirror-markdown` package, and add a foreign-markdown normalizer at the import boundary. Code: - `ImportService.processMarkdown` (single `.md` upload) now parses `markdownToProseMirror(normalizeForeignMarkdown(md))` directly — no HTML hop. - `PageService.parseProsemirrorContent` markdown case (page create/update with `format: 'markdown'`) same. - `FileImportTaskService` (zip import) parses markdown with the package, then serializes to HTML (`jsonToHtml`) so the SHARED HTML attachment / internal-link pipeline (processAttachments + formatImportHtml + processHTML) keeps handling `.md` and `.html` imports uniformly. The markdown PARSE — the drift source — no longer goes through editor-ext; the PM->HTML->PM hop that follows is lossless plumbing for attachment resolution, not a second parse. - `canonicalizeFootnotes` stays as an idempotent #228 safety net for the HTML path (a no-op on the already-canonical markdown output). Normalizer (`integrations/import/utils/foreign-markdown.ts`): a TEXT pre-pass, NOT a parser fork. The strict canonical parser does not accept GFM `[^id]` reference footnotes (and would misread `[^id]: def` as a CommonMark link-ref definition, silently corrupting the ref into a bogus link), so the normalizer rewrites reference footnotes into canonical inline `^[def]` before parsing. Callout surfaces (`:::type` and `> [!type]`) are intentionally NOT touched — the canonical parser already accepts BOTH natively, so normalizing them would be redundant and risk degrading its nesting/code-fence-aware handling. Fixtures-first: foreign-markdown.spec pins the normalizer and the end-to-end acceptance (no literal `[^id]`/`:::` leaks; re-export is canonical). The two footnote-canonicalize specs are updated to the canonical output — the parser assigns fresh `fn-*` ids, so they now assert by definition BODY order (still reference-ordered, deduped, orphan-free). FINAL CHECK: `grep -rn "htmlToMarkdown\\|markdownToHtml" apps/server/src` (non -test) is now empty — both editor-ext markdown-layer functions are gone from the server. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-05 03:21:07 +03:00
vvzvlad	f665f6fdd2	Merge pull request 'feat(ai-chat): autonomous agent runs — phase 1: durable detached runs (#184 )' (#234 ) from feat/184-autonomous-agent-runs into develop Reviewed-on: #234	2026-07-05 00:40:26 +03:00
agent_coder	5d8364bb5f	fix(#355 review round-2 F9-F11): register-gate test + shutdown idle-close + DB-path metrics gate - F10 [stability]: closeMetricsServer() now calls server.closeIdleConnections() + server.unref() after server.close(). server.close()'s callback doesn't fire until keep-alive sockets drain, and the scraper (VictoriaMetrics/vmagent) holds an idle keep-alive socket — so onModuleDestroy's awaited close would hang until the scraper disconnects or the orchestrator SIGKILLs on the kill-grace window. closeIdleConnections() drops idle keep-alive sockets so shutdown completes immediately (Node 22, per the Dockerfile base). - F9 [test]: client-telemetry.module.spec.ts pins the E1=B register() gate — the core of the "public endpoint OFF by default" decision: flag unset / any non- "true" value ("false"/""/"0"/…) → empty controllers+providers (route absent); "true"/"TRUE" → registers VitalsController + VitalsService. A flag-inversion or truthiness regression that reopened the anonymous disk-fill surface now fails. - F11 [regression/perf]: the db_query_duration_seconds token work (firstSqlToken regex + Set lookup) is now gated on isMetricsEnabled() in database.module.ts, so a non-metrics deployment pays NOTHING per query (previously observeDbQuery no-op'd but the token was still computed on every query). Also hoisted the 13-element known-token Set to a module const (KNOWN_SQL_TOKENS) so it's built once, not per query. Gate: server tsc 0; metrics + vitals + client-telemetry suites pass (incl. the new register-gate test). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-05 00:20:26 +03:00
agent_coder	d3209b5aab	fix(#355 review E1=B + F1-F8): gate client telemetry OFF by default + throttler/lifecycle/overflow fixes Maintainer resolved E1 as variant B: the public vitals sink + client collection must be OFF by default (else client_metrics grows unbounded on a self-host deploy with no external pruner, via an unauthenticated public endpoint). - F1: new operator flag CLIENT_TELEMETRY_ENABLED (default OFF), SEPARATE from METRICS_PORT (Grafana reads the table directly, independent of the scrape port). ClientTelemetryModule.register() provides VitalsController ONLY when the flag is true (route absent otherwise); the flag reaches the client via window.CONFIG (config.ts isClientTelemetryEnabled), and initVitals() early-returns when off. - F2/F3 [throttler]: this repo's ThrottlerGuard applies EVERY named throttler to every guarded route unless skipped. The new VITALS bucket therefore (a) newly bound collab-token → 429 behind shared/NAT IPs, and (b) the vitals route didn't skip the stricter public-share-ai (5/min) bucket → effective 5/min not 120. Fix (additive, global config unchanged): vitals.controller @SkipThrottle the other buckets + @Throttle VITALS 120/min; collab-token adds VITALS_THROTTLER to its existing @SkipThrottle (restoring its prior effectively-unthrottled state). - F4: metrics node:http server is closed on shutdown (MetricsServerLifecycle OnModuleDestroy → closeMetricsServer(), fired by enableShutdownHooks). - F5: docSize outside [0, int4-max] drops to null (keeping the event) instead of overflowing int4 and failing the WHOLE batch insert (+ 2 tests). - F6: .env.example documents METRICS_PORT (no default — unset = subsystem OFF) + CLIENT_TELEMETRY_ENABLED; fixed the inaccurate "default 9464" wording. - F7: disabled/non-sampled sessions install ZERO observers — isVitalsActive() (enabled && sampled) gates reportClientMetric AND the page-editor measurePageOpen + dispatchTransaction wrapping. - F8: kept db.d.ts hand-added (wontfix) — this repo HAND-CURATES db.d.ts (verified across recent fork migrations a32fba63/8c5b57eb/fdeede00); codegen would be the deviation. The ClientMetrics interface maps the migration 1:1. Gate: server tsc 0, client tsc 0, server metrics/vitals/telemetry/throttle 21 tests, client route-template 5. No new deps. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-05 00:00:03 +03:00
agent_coder	68899a2c2e	feat(ai-chat): durable detached agent runs — phase 1 (#184/#234) Squashed for a clean rebase onto develop (was 19 commits; the reviewer approved the net diff at `fb246080`). Detaches an agent run from the HTTP request/browser window: a run is a first-class lifecycle object (ai_chat_runs), a browser disconnect no longer kills it, a concurrent-run insert-gate prevents double runs, and a reopened chat live-follows a still-running run via a polled observer merge. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-04 23:35:26 +03:00
agent_coder	b9f3de80f5	feat(observability): dev-side perf metrics — /metrics + client vitals (#355 ) The metrics INFRA is already deployed (VictoriaMetrics scraping docmost:9464, Grafana dashboards, alerts) with a target `gitmost-app` that is red because the app half didn't exist. This is that half. The contract (metric names, port, table, endpoint) is FIXED by the deployed infra and matched exactly. Server (prom-client): - A bare node:http `/metrics` server on METRICS_PORT (default 9464), SEPARATE from the Fastify :3000 listener so /metrics never exists publicly; the whole subsystem is OFF when METRICS_PORT is unset. - collectDefaultMetrics() + http_request_duration_seconds{method,route,status} via a Fastify onResponse hook using the ROUTE TEMPLATE (req.routeOptions.url, never the raw URL — bounded cardinality; 404 -> "unknown"), EXCLUDING SSE/ streaming responses (would record the connection lifetime and poison p95). - db_query_duration_seconds (Kysely log callback, labelled by the leading SQL token), bullmq_queue_depth{queue} (getJobCounts every 15s) + bullmq_job_duration_seconds{queue} (worker completed/failed), collab_store_duration_seconds (around onStoreDocument). - POST /api/telemetry/vitals — PUBLIC (sendBeacon) but IP-throttled; ~16KB body cap, <=50 events/batch, metric-name + rating whitelist, attr truncated to 120 chars, batch insert; malformed/foreign/oversized silently dropped and 200'd (no browser retry). New migration `client_metrics` (schema byte-identical to the contract, both indexes, conditional grafana_ro GRANT; no app-side retention — the maintenance container prunes >90d). Client (web-vitals): - initVitals() decides sampling ONCE per session (25%, sessionStorage) BEFORE subscribing; onINP/onLCP/onCLS/onTTFB (attribution) buffered + flushed via navigator.sendBeacon on visibilitychange:hidden and a timer (not fetch-per- metric). Custom: editor_tx_ms (dispatchTransaction sync-part timer, >8ms, with doc_size), page_open_ms, longtask_ms. Route labels are templates only; no titles/slugs/text. Gate: server + client tsc 0, frozen install 0 (added prom-client + web-vitals + regenerated the lock), server metrics/vitals tests 11, client route-template 5, and the migration verified valid against real Postgres. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-04 23:10:29 +03:00
agent_coder	68caf8157a	test(ai-chat): document AI_CHAT_DEFERRED_TOOLS + pin ON-path & catalog completeness (#341 review F1-F3) - F1: document AI_CHAT_DEFERRED_TOOLS in .env.example (AI_* section) — default ON = deferred loading (compact catalog + loadTools), =false restores the old "all tools always active" behavior. - F2: integration test of the ON path in ai-chat-stream.int-spec.ts — a deferred tool activated via loadTools is active on the SAME turn's next step but a fresh turn starts cold (CORE + loadTools only), proving the per-turn activatedTools Set does not leak across turns/chats. Drives the real streamText loop with a MockLanguageModelV3 and inspects recorded per-step activeTools-filtered tools. - F3: replace the magic toHaveLength(28) in tool-tiers.spec.ts with a two-way partition against the LIVE in-app toolset (AiChatToolsService.forUser keys): every non-core tool must appear in buildInAppDeferredCatalog and every catalog entry must map to a real non-core tool — so a future tool forgotten in INLINE_TOOL_TIERS fails the suite instead of silently vanishing from the agent. No production logic change (mechanism was already reviewed correct). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-04 20:34:42 +03:00
claude code agent 227	e431b33bb1	feat(ai-chat): deferred tool loading (tiers + loadTools meta-tool) (#332 ) The in-app AI agent shipped all ~41 tool schemas on every model step. This adds a two-tier catalog: core tools (frequent or one-line) stay always-active; the rest are advertised as a compact catalog and their full schema is fetched on demand via the loadTools meta-tool, wired through ai@6 prepareStep's per-step activeTools. - tools/tool-tiers.ts: CORE_TOOL_KEYS, INLINE_TOOL_TIERS, applyLoadTools, catalog builders (+ tool-tiers.spec.ts, 13 cases). - ai-chat.service.ts prepareAgentStep: returns activeTools = [...CORE_TOOL_KEYS, loadTools, ...activatedTools]; per-turn activated Set. - ai-chat.prompt.ts: buildToolCatalogBlock renders the deferred catalog. - mcp/tool-specs.ts: tier + catalogLine metadata (external snake_case /mcp transport unchanged). - EnvironmentService.isAiChatDeferredToolsEnabled(): AI_CHAT_DEFERRED_TOOLS, default ON per issue intent (kill-switch =false restores old behavior). Gate: server ai-chat 631/631, tool-tiers 13/13, mcp 472/472, tsc clean. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-04 19:57:11 +03:00
vvzvlad	4369bbc53d	Merge pull request 'refactor(converter): единый пакет @docmost/prosemirror-markdown + канон форматов, git-sync и mcp переключены (#293 , шаги 2–5)' (#333 ) from feat/293-B-prosemirror-markdown-pkg into develop Reviewed-on: #333	2026-07-04 19:35:53 +03:00
claude code agent 227	d7fa6738e5	fix(comment): transactional childless-delete race fix + client dismiss gate + DB int-spec (#329 review round 2) F4 [critical] — the anti-join `DELETE … WHERE NOT EXISTS(child)` was still racy under Postgres READ COMMITTED: a reply INSERT holds FOR KEY SHARE on the parent; the DELETE's start snapshot doesn't see the uncommitted child (NOT EXISTS true), blocks on the reply's lock, and when the reply commits the parent was only LOCKED (not modified) so EvalPlanQual does NOT re-check → the DELETE proceeds and CASCADE destroys the just-committed reply. Replaced with a transaction: SELECT the parent FOR UPDATE (conflicts with the reply's FOR KEY SHARE → serializes the concurrent reply), re-check for a child with a FRESH statement in the same tx (a new RC snapshot sees a just-committed reply), delete only if still childless (return 1) else return 0 (caller resolves). The FOR UPDATE lock is held to end-of-tx so no reply can insert between the re-check and the delete. Signature unchanged, so the service + its mocked unit tests are untouched; docstrings updated. F5 [warning] — the client Dismiss button was gated only on canComment, but the server now gates dismiss on owner-or-space-admin, so a non-owner non-admin saw a button the server 403s. `canShowDismiss` now also requires `isOwnerOrAdmin = currentUser?.user?.id === comment.creatorId \|\| userSpaceRole === "admin"` (the same gate the comment delete-menu already uses); threaded into both call sites. F6 [warning] — added a REAL-DB int-spec (apps/server/test/integration/comment-delete-if-childless.int-spec.ts, + a createComment seeder): (a) childless → returns 1, row gone; (b) committed reply → returns 0, parent+reply survive; (c) CONCURRENCY — a second connection inserts a reply (FOR KEY SHARE) and commits mid-operation while deleteCommentIfChildless blocks on FOR UPDATE → asserts it returns 0 and both rows survive (a blind anti-join would lose the reply here). Ran against live Postgres — 3/3 pass. server tsc clean; comment jest 53 + int-spec 3 (live Postgres) pass. client tsc clean; comment vitest 56 pass. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-04 19:17:52 +03:00
claude code agent 227	e6d8eda8e5	fix(comment): dismiss owner/admin authz + atomic conditional delete + 404-only onError (#329 review) Maintainer escalation decision (B) + reviewer findings on the ephemeral- suggestion PR. Authz (decision B): POST /comments/dismiss-suggestion now gates the destructive branch on owner-OR-space-admin, mirroring POST /comments/delete exactly (same SpaceCaslAction.Manage / SpaceCaslSubject.Settings, same owner short-circuit, same ForbiddenException). A non-owner non-admin who tries to dismiss another's childless suggestion gets Forbidden before the service runs. Apply stays on canEdit (accepting an edit is the editor's semantics), unchanged. F1 [blocking] — atomic conditional delete closes the hasChildren→delete race. New repo `deleteCommentIfChildless(id)` runs a single `DELETE FROM comments WHERE id=:id AND NOT EXISTS (SELECT 1 FROM comments child WHERE child.parent_comment_id = comments.id)` (verified by compiling the Kysely expression to SQL — the correlated subquery references the OUTER comments.id). deleteEphemeralSuggestion strips the mark first, then the conditional delete: if it removed the row → commentDeleted + outcome 'deleted'; if a reply raced in (0 rows) → fall back to resolveComment (outcome 'resolved') so the discussion and the new reply survive. No reply can be cascade-deleted anymore. F2 [warning] — the apply/dismiss onError success-noop is narrowed from 404\|\|400 to 404 ONLY. A 400 means the comment is ALIVE (apply's 400 = the thread was resolved-not-applied), so it now shows a real error (surfacing the server message) and KEEPS the comment in cache instead of a false "applied" + dropping a live thread. F3 [suggestion] — the 404-race client tests assert the success toast fired. Tests: server — dismiss authz (owner ok / non-owner-non-admin Forbidden / space-admin ok), the delete→resolve race (hasChildren=false but conditional delete returns 0 → resolve, no commentDeleted), delete-path asserts switched to deleteCommentIfChildless; client — apply-400 and dismiss-400 (kept in cache, red, not success) + the toast assertions. server tsc clean, comment+collaboration jest green; client tsc clean, comment vitest 54 passed. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-04 19:17:19 +03:00
claude code agent 227	8d8ecaed82	feat(comment): ephemeral suggestion-edits — Apply/Dismiss remove the comment (#329 ) Agent suggestion-edits (comments with suggestedText, #315) piled up: Apply auto-resolved the thread, cluttering the resolved tab, and the anchors stayed in the document. Make them ephemeral: resolving (Apply OR the new Dismiss) makes the comment DISAPPEAR — hard-delete + remove the Yjs `comment` mark — UNLESS the thread has replies, in which case resolve it (preserve the discussion). Manual Resolve is unchanged. Scope: only comments with `suggestedText`. Server: - New collab event `deleteCommentMark` (collaboration.handler) mirroring resolveCommentMark, wiring the existing removeYjsMarkByAttribute to strip the anchor from the doc. - `finalizeAppliedSuggestion` forks on `hasChildren`: replies → apply + resolve (outcome 'resolved'); none → apply + hard-delete + mark removal (outcome 'deleted'). - New `dismissSuggestion` (validates top-level + suggestedText + not applied/not resolved) with the same fork; permission `canComment` (NOT canEdit — dismiss doesn't change page text); audit COMMENT_SUGGESTION_DISMISSED. New POST /comments/dismiss-suggestion; apply stays canEdit. - Both return `{ outcome: 'deleted' \| 'resolved' }` so the client picks the optimistic action. Data-integrity (review F1): the shared `deleteEphemeralSuggestion` removes the anchor mark FIRST and FATALLY, then deletes the DB row only on success. The row delete is irreversible, so a mark-removal failure — including the COLLAB_DISABLE_REDIS "no live instance" hard-error — must abort the whole operation (→ 5xx, repeatable) rather than swallow the error and leave a permanent orphan anchor pointing at a deleted comment. `deleteCommentMark` is no longer best-effort (unlike resolve, where the row is kept and a failed mark is recoverable). Client: - `canShowDismiss` (canComment) alongside `canShowApply` (canEdit); a "Dismiss" button next to Apply in the suggestion block. - `useApplySuggestionMutation`/`useDismissSuggestionMutation` reconcile the cache on `outcome` ('deleted' → remove; 'resolved' → relocate to the resolved tab). - Idempotent races (review F2): BOTH apply and dismiss onError reduce 404/400 to success (comment already gone/resolved), dropping it from the cache instead of a red error — restores the #315 apply idempotency the ephemeral delete would otherwise break. - i18n Dismiss / "Не применять" (ru/en). Not done (flagged): deleteCommentMark on the normal /comments/delete path — left out (would change every non-suggestion delete + needs gateway injection; the interactive client already strips the mark via unsetComment). Out of scope per the issue. Tests: server — apply/dismiss delete-vs-resolve fork, all four dismiss state guards, the deleteCommentMark handler, controller authz (dismiss=canComment, apply=canEdit), AND a mark-removal-failure test proving the row is NOT deleted + the error propagates (F1). client — Dismiss show-conditions, outcome cache reconciliation, and 404 idempotent race for BOTH dismiss and apply (F2). Verified: server tsc clean; comment+collaboration jest 144 passed. client tsc clean; vitest 905 passed \| 1 expected-fail. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-04 19:17:19 +03:00
claude code agent 227	eacc1c4811	Merge branch 'develop' of https://gitea.vvzvlad.xyz/vvzvlad/gitmost into feat/293-B-prosemirror-markdown-pkg # Conflicts: # packages/mcp/build/client.js # packages/mcp/build/index.js # packages/mcp/build/tool-specs.js	2026-07-04 19:02:52 +03:00
claude code agent 227	40d42d61e6	feat(mcp): search_in_page tool — in-page substring/regex search for the agent (#330 ) Editorial roles (Corrector/Factchecker) brute-forced `get_node` block-by-block to find occurrences (unquoted «ё», straight quotes, «т.е.»), burning tokens. New `search_in_page(pageId, query, {regex?, caseSensitive?, limit?})` reads the page's ProseMirror JSON via the existing getPageRaw and searches it IN MEMORY — no server endpoint, no DB/schema change, no touch to the packages/mcp/src/lib schema mirror. New pure `searchInDoc(doc, query, opts)` (packages/mcp/src/lib/page-search.ts): recursive descent to each TEXT CONTAINER (paragraph/heading/table-cell paragraph), glues its inline text via `blockPlainText` (a match survives inline-mark boundaries — e.g. «т.е.» split across bold/italic), searches literal (indexOf) or regex, and returns `{ total, truncated, matches:[{ nodeId, blockIndex, type, before, match, after }] }`. `nodeId` is the container's attrs.id or the `#<topLevelIndex>` of the enclosing top-level block — the SAME ref format get_node/patch_node/comment-anchoring accept (verified identical to getNodeByRef), so the agent goes straight from a hit to a targeted comment; `before`/`after` are ~40-char windows for a unique selection. `total`/`truncated` always reported (never silent truncation). Lives in the SHARED_TOOL_SPECS registry → exposed in BOTH transports (external /mcp + in-app AI-chat), with a SERVER_INSTRUCTIONS line and a DocmostClientLike signature + contract-test entry. Corrector/Factchecker prompts get a one-line "use search_in_page first" hint (versions bumped, catalog hash lock refreshed). Guards: empty/whitespace query → clear error; invalid regex → clear error (not a generic 500); zero-length regex matches (`\b`, `a*`) skipped with lastIndex advanced (no loop/flood); MAX_PATTERN_LENGTH=1000, MAX_CONTAINER_TEXT=100k bound each exec; limit clamped [1,200] (default 50). Tests: new page-search.test.mjs (17) — literal+regex, case-sensitivity, mark-boundary glue, nodeId for paragraph/heading (attrs.id) and table-cell (#<index> fallback), context bounds, limit/total/truncated + clamp, invalid regex/empty/over-long errors, zero-length skip, empty-doc null-safety. mcp: tsc clean; node --test 467 passed (+17). apps/server: tsc --noEmit clean (DocmostClientLike + wiring). catalog check.mjs OK. Known limitations (from internal review, non-blocking): - Residual ReDoS: a crafted catastrophic-backtracking pattern (e.g. `(a+)+$`) against a large single container can hang the event loop — JS regex is not interruptible, so the length caps bound the base but not the backtracking. Realistic exposure is low (containers are small; the pattern is supplied by the authenticated model). Candidate for a follow-up hardening (safe-regex validation or a worker+timeout) if it matters. - Case-insensitive LITERAL search folds via toLowerCase; a char whose lowercase differs in length (e.g. Turkish İ) BEFORE a match could shift the context window — negligible for the RU/EN editorial scenario. - On a `#<index>` table-cell fallback, `type` is the inline container ("paragraph") while nodeId addresses the top-level block — addressing is correct; the field is documented as the container's type. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-04 15:51:34 +03:00
claude code agent 227	bcd194ee5d	feat(mcp): hide resolved-comment anchors + feed from the agent (#328 ) The AI agent (MCP + in-app chat) saw ALL comments incl. resolved via two channels, cluttering its context and breaking fragment search. Default now: the agent sees only ACTIVE discussions; resolved is opt-in. Active anchors and threads are always kept. Channel 1 — resolved comment anchors on agent reads (converter option): `convertProseMirrorToMarkdown(content, options?)` gains `options.dropResolvedCommentAnchors` (default false — zero change for every existing caller incl. git-sync). Both `case "comment"` emitters (top-level and the raw-HTML inlineToHtml path) emit BARE text (no `<span data-comment-id>`) when `resolved && the flag`; active anchors keep their wrapper. mcp `getPage` passes the flag; `export_page_markdown` does NOT (lossless export must preserve resolved anchors — that is why it is an opt-in option, not unconditional); `get_page_json` is untouched (lossless PM JSON). Built on the #293 package converter. Channel 2 — `list_comments` default active-only: `listComments(pageId, includeResolved=false)` now returns `{ items, resolvedThreadsHidden }` (was a bare array). By default a RESOLVED top-level thread is hidden wholesale — the root AND every reply anchored to it (a thread is gated only by its root's resolvedAt; a resolved reply under an ACTIVE root stays). `resolvedThreadsHidden` counts hidden threads so the agent knows to re-query. `includeResolved:true` returns everything. The `includeResolved` param is added to both tool registrations (MCP index.ts + in-app ai-chat-tools.service.ts); `DocmostClientLike` signature updated. Server `findPageComments` is NOT touched — the web UI's tabs depend on the full feed; filtering is only at the mcp-client level. All internal call sites (export_page_markdown / checkNewComments / transformPage) updated to `.items` with `includeResolved:true` to keep their full-feed behavior. The comment model is assumed FLAT (a reply's parentCommentId points at the thread root) — documented in the filter; a future reply-of-reply model would need a root-walk there. Tests: resolved-comment-anchors.test.ts (6 — anchor dropped with flag / kept without, for BOTH emitters; active always kept); list-comments-resolved.test.mjs (4 — resolved thread+reply hidden + counter; includeResolved:true returns all; an ACTIVE thread with a RESOLVED reply is NOT hidden). package vitest: 664 passed; tsc clean. mcp: node --test 458 passed; tsc clean. apps/server + git-sync: tsc clean (converter option default-off). NOTE: based on feat/293-B (#293/#326 STEP 5) — the converter lives in the package; this PR is stacked on #333 and its base retargets to develop once #333 merges. mcp/build is gitignored (not committed). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-04 15:26:43 +03:00
agent_vscode	351615e5bc	prompt(mcp): fix inaccurate and misleading tool descriptions Audit of all 41 tool descriptions against the actual implementation found factually wrong or misleading texts: - list_comments claimed '(paginated)' — it takes only pageId and returns ALL comments in one call (internal pagination); now also states that RESOLVED threads are included and how to filter them. In-app twin synced. - search claimed the limit default is 'applied by the client' — the client deliberately omits it so the SERVER applies its default. - create_page's '(automatically moves it to the correct hierarchy)' said nothing useful — now documents parentPageId nesting semantics; move_page drops the stale 'essential for organizing pages created via create_page'. - share_page now warns the page becomes accessible to ANYONE with the URL. - get_page (both transports) now explains inline <span data-comment-id> tags are comment anchors (incl. resolved) — markup, not page text. - patch_node/delete_node/insert_node pointed only at the expensive page-JSON view for block ids — now route through the cheap page outline first. - docmost_transform marks 'Примечания переводчика' as the DEFAULT notesHeading, overridable for non-Russian pages. Checks: @docmost/mcp tests 450/450 (incl. the server-instructions guard); server ai-chat-tools spec 20/20; mcp build/ artifacts rebuilt. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>	2026-07-04 07:00:16 +03:00
claude_code	588596fb2f	prompt(agents): teach agent prompts to use comment suggestedText fixes (#315 ) - editorial roles (ru/en): proofreader and line editor attach suggestedText replacements to targeted fixes; fact-checker ALWAYS attaches the ready correction for [Incorrect] verdicts; structural editor and narrator get a light-touch rule for in-place rewordings; role versions bumped and the content-hash lock refreshed - MCP SERVER_INSTRUCTIONS: route 'propose a concrete text fix for one-click human approval' to create_comment with suggestedText (unique-selection reminder); build/ artifacts rebuilt - AI-chat SAFETY_FRAMEWORK: mention the comment-suggestion capability so the default assistant offers ready fixes instead of only describing changes Checks: catalog check.mjs OK; @docmost/mcp tests 448/448; server ai-chat.prompt spec 28/28. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>	2026-07-03 23:22:37 +03:00
vvzvlad	33d22ff164	Merge pull request 'feat(comment): предложения правок агента + кнопка «Применить» (server-side atomic apply, #315 )' (#318 ) from feat/315-comment-suggestions into develop Reviewed-on: #318	2026-07-03 21:29:28 +03:00
vvzvlad	b861266ff8	Merge pull request 'fix(ai-chat): резолв slugId→uuid в bound-chat — 500 (22P02) на открытии страницы (#312 )' (#313 ) from fix/312-bound-chat-slug into develop Reviewed-on: #313	2026-07-03 21:27:14 +03:00
claude code agent 227	48c1ec46f7	fix(comment): store the real anchored substring as expectedText + pin authz (#318 F1/F2) F1 [blocking]: a suggestion whose anchor matched via normalization could never be applied (spurious 409). The comment mark lands on the doc's ACTUAL text (Docmost auto-converts to typographic quotes/dashes/nbsp), but the stored selection — used as expectedText at apply — was the raw ASCII agent input (+substring(0,250)). So replaceYjsMarkedText's strict joined!==expectedText always failed and threw "text changed" though nobody edited. Fix: new pure getAnchoredText(doc, selection) reconstructs the exact raw doc substring the mark covers (slicing identical to spliceCommentMark); on the suggestion path client.createComment stores THAT as selection, so expectedText equals the marked text and apply returns applied:true. Live anchoring still uses the raw agent selection (normalization still finds the anchor). Truncation raised 250->2000 (+ DTO @MaxLength(2000)) so the anchored substring is never cut below the mark span. Ordinary comments unchanged. AI-chat shares client.createComment, so covered. Regression tests: getAnchoredText raw-vs-ASCII; create payload selection is the typographic substring; apply with typographic expectedText -> applied. F2 [blocking]: added comment.controller.spec.ts pinning that validateCanEdit runs before applySuggestion (Forbidden -> applySuggestion never called; happy path -> called; missing comment -> 404 without authorizing). MCP 448 pass; server comment+yjs 54 pass. MCP build/ rebuilt. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-03 20:29:42 +03:00
claude code agent 227	cd539558ed	feat(agent-tools): suggestedText on create_comment with strict anchor uniqueness (#315 phase 6) Agents can attach a suggested replacement when creating an inline comment, via both the MCP create_comment tool and the AI-chat createComment tool. Because applying a suggestion edits the EXACT anchored text, an ambiguous anchor would let Apply corrupt the wrong occurrence. So when suggestedText is set the selection must occur EXACTLY ONCE: - new countAnchorMatches(doc, selection) counts occurrences across all blocks (same normalization/traversal as canAnchorInDoc), counting occurrences (2 in one block => 2) — stricter than block-count, never under-counting distinct occurrences (false-unique is the dangerous direction). - client.createComment gains suggestedText: a pre-check (getPageJson + countAnchorMatches: 0 => not-found, >=2 => ambiguity error) before create, and an AUTHORITATIVE live check inside the anchoring mutation that recomputes on the live doc and, if != 1, aborts and rolls back the just-created comment (reusing the existing safeDeleteComment "anchor not found" path). Ordinary comments keep first-occurrence behavior unchanged. - suggestedText is rejected on a reply or without selection in all three layers (MCP handler, MCP client, AI-chat tool), mirroring the server DTO/service. - filterComment surfaces suggestedText/suggestionAppliedAt/suggestionAppliedById. - DocmostClientLike.createComment signature updated. MCP build/ rebuilt. Tests: countAnchorMatches (0/1/N, within/across/nested block, span nodes, quote normalization); createComment (ambiguous refused pre-create, reply and no-selection rejected, unique succeeds and forwards suggestedText, filterComment surfaces it); ai-chat schema accepts suggestedText. MCP 443 pass; ai-chat 601 pass. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-03 19:35:47 +03:00
claude code agent 227	ec542a924b	feat(comment): store suggestedText + POST /comments/apply-suggestion (#315 phase 4) Server side of agent comment suggestions. - CreateCommentDto gains optional suggestedText (<=2000). CommentService.create accepts it ONLY for a top-level inline comment with a non-empty selection, requires it be non-empty and differ from selection (else BadRequest), and stores it. - POST /comments/apply-suggestion (ApplySuggestionDto { commentId }): authorizes with validateCanEdit (applying edits page text) BEFORE any structural check or mutation, then CommentService.applySuggestion: - runs the phase-3 collab event applyCommentSuggestion on `page.<pageId>` to atomically check-and-replace the marked text, returning { applied, currentText }; - applied → stamp suggestion_applied_at/by, auto-resolve the thread, ws commentUpdated, audit COMMENT_SUGGESTION_APPLIED; - already-applied (DB) → idempotent success (no re-apply), self-healing the resolve if it was missed — satisfies the issue's double-click / two-user race requirement; - collab verdict applied:false && currentText===suggestedText → idempotent success (crash between doc mutation and DB write); - text changed → 409 ConflictException carrying currentText; - gateway undefined/throw → hard error, never a silent success. - audit-events: COMMENT_SUGGESTION_APPLIED. Tests: create validation (reply/no-selection/equal-to-selection rejected; valid stored) + applySuggestion verdict branches incl. both idempotent paths. jest src/core/comment: 33 passed. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-03 19:09:23 +03:00
claude code agent 227	0df6242128	fix(ai-chat): resolve page slugId to uuid in bound-chat, fixing 22P02 500 (#312 ) POST /api/ai-chat/bound-chat 500'd with Postgres 22P02 because the client sends a page slugId (10-char nanoid) in the request `pageId` field, which the server passed straight into the UUID `page_id` column. The chat-to-document binding silently broke (client fail-softs to a new chat) and every slug-URL page open logged a 500. Fix: resolve the incoming id to a real page UUID on the server. PageRepo.findById already accepts both a uuid and a slugId (isValidUUID→slugId fallback), so boundChat now resolves the page first, guards it against a foreign/unknown workspace (returns {chatId:null} before any chat lookup — no cross-workspace probe), and looks up the latest chat by the resolved page.id (real uuid). Client: renamed the local pageId→slugId for clarity (the value is a slugId); the wire body key stays `pageId` so the DTO is unchanged. DTO left @IsString() (a @IsUUID() would only turn the 500 into a 400 and still break binding). Test: bound-chat spec asserts a slugId resolves and findLatestByPage is called with the real uuid; a foreign-workspace page → {chatId:null} without a chat lookup (no leak); an unknown id → {chatId:null}, no throw. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-03 18:07:23 +03:00
vvzvlad	36b3539571	Merge pull request 'refactor(ai-chat): move patch_node/insert_node into the shared tool-spec registry (#294 )' (#305 ) from refactor/294-tool-spec-registry into develop Reviewed-on: #305	2026-07-03 18:02:40 +03:00
agent_coder	86c1307ed2	fix(#300 review): drop stray symlink, re-fetch enriched on comment update, cover history mapping (F1/F2/F3) F1: remove an accidentally-committed self-referential symlink packages/mcp/node_modules/node_modules -> an absolute build-machine path (leaked a dev home path, a pnpm artifact useless in the repo), and add a targeted ignore so it can't recommit. F2: the commentUpdated broadcast re-emitted the caller's pre-loaded comment mutated in place, so the {agent,launcher} stack survived only because the controller happened to load it with includeCreator:true — the fragile coupling that let the stack vanish on edit once already. update() now RE-FETCHES the enriched comment before broadcasting, symmetric with create()/resolveComment() (the row is already persisted), so all three broadcasts carry the stack regardless of any caller's pre-load. Adds a caller-contract test asserting all three broadcasts emit agent/launcher for an agent comment and neither for a non-agent one, spotlighting the update path (non-vacuous vs the old re-emit). F3: add a direct test of the page-history attachPageHistoryAgent mapping (its distinct lastUpdatedSource/lastUpdatedAiChatId/lastUpdatedBy column set): role / no-role / MCP / non-agent, and that the internal agentRole join column is stripped. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-03 06:38:25 +03:00
agent_coder	f720151c63	refactor(ai-chat): move patch_node/insert_node metadata into the shared tool-spec registry (#294 ) The same tool metadata (zod schema + model-facing description) was hand-duplicated between the standalone MCP server and the in-app AI-chat agent, so every tweak had to land in two places and copies drifted (a materialized parity bug). The shared transport-agnostic registry (packages/mcp/src/tool-specs.ts) already de-duplicates 14 tools; this migrates two more genuinely-identical ones — patch_node/patchNode and insert_node/insertNode. The canonical description is a strict SUPERSET of both originals (keeps MCP's "without resending the whole document" + table-structure/anchor guidance AND the in-app "reversible via page history" / "exactly one of anchorNodeId or anchorText" framing — no model-facing guidance dropped); the schema is identical (the in-app side just gains MCP's .min(1) on ids, a safe tightening). Each transport keeps its own execute/auth wrapper, and the in-app parseNodeArg node-arg normalization is unchanged. The three table tools are intentionally NOT merged (a real param-name divergence: table vs tableRef) — documented on both sides. Other per-transport divergences (search/share/create_comment/transform/list_pages) are left separate with a short comment explaining why (the issue asked to flag these as intentional). DocmostClientLike stays a hand-mirror (the ESM/CJS boundary blocks a compile-time type import; a runtime drift-guard already pins it). Also fixes a latent contract-spec bug: derive `required` from `instanceof z.ZodOptional` (matches the emitted JSON schema) instead of `isOptional()`, which wrongly reported z.any() fields as optional. Partially addresses #294. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-03 05:55:11 +03:00
agent_coder	438ef091f9	fix(#288 review): markdown-safe-escape the untrusted page title in chat export F1: pc.title (untrusted cross-user page title) was interpolated raw into the markdown export heading. Reusing escapeAttr alone (the prompt sink's XML-attribute sanitizer, strips < > ") is insufficient here because the sink is MARKDOWN: link /image syntax survives, so a title like ![x](http://evil) or [phish](http://evil) injects a remote image / clickable link into the downloaded .md disguised as a trusted system annotation. Add markdownHeadingSafe() = escapeAttr() + backslash- escape [ and ] (disables both [text](url) and ![text](url); a bare (url) is inert). F2: cover the title branch — a title that collapses to empty via escapeAttr falls to the bare heading (no ("")), and a link/image-injection title is neutralized (non-vacuous vs the escapeAttr-only version). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-02 15:46:44 +03:00
claude_code	c39fab70c1	feat(ai-chat): persist page-change diff to history and harden stale-page note The #274 page_changed marker lived only in the ephemeral system prompt, so the diff the agent saw was invisible in the chat export/history, and the note was too weak — the agent still overwrote the user's manual edits with a full-page replace. - Persist the diff the agent saw as metadata.pageChanged on the assistant row (flushAssistant), threaded into all five flush call sites in stream(). Model replay (rowToUiMessage/rowParts) reads only metadata.parts, so the sibling never re-injects the note into the model context on later turns. - Render the persisted diff as a labelled block (en/ru) before the message body in the server-side Markdown export (chat-markdown.util.ts). - Strengthen PAGE_CHANGED_NOTE: mandate a fresh getPage re-read and targeted edits (editPageText/patchNode/insertNode/deleteNode) instead of a whole-page replace, and never revert or overwrite the user's edits. Tests: prompt, export and service specs updated; 114 pass, tsc clean. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-07-02 14:31:41 +03:00
agent_coder	2f3d5d3783	docs: fix escapeAttr comment count (three, not four) (#274 review) The regex strips three attribute-breaking chars (" < >); the JSDoc said four. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-02 06:19:26 +03:00
agent_coder	6e681a9c66	fix(#274 ): escape page_changed injection surface, drop dead content_hash (review F1-F5) F1: escape the collaborative page title before interpolating into <page_changed page="..."> (and the pre-existing openedPage attr) — strip <>" and collapse whitespace, so a crafted title can't break out of the attribute into the system prompt (cross-user injection). F2: neutralize <page_changed>/</page_changed> occurrences inside the diff body so a crafted line can't close the block early. F3: remove the dead content_hash column (written every turn, never read) — migration, repo, service hashing + crypto import, db.d.ts, spec asserts. F4: test the best-effort catch branches (detectPageChange / snapshotOpenPage swallow errors and don't break the turn). F5: soften the overstated 'diff cannot smuggle instructions' comment to defense-in-depth framing referencing the F1/F2 mitigations + safety sandwich. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-02 05:43:46 +03:00
agent_coder	8c5b57ebfa	feat(ai-chat): notify the agent of user page edits between turns (closes #274 ) The agent rebuilds context from DB each turn and didn't know the user manually edited the open page since its last response, so it could overwrite those edits. Add a per-turn ephemeral <page_changed> note in the system prompt (twin of INTERRUPT_NOTE, self-clearing) carrying a unified Markdown diff of what changed since the END of the agent's previous turn. - New ai_chat_page_snapshots table (migration + hand-declared db.d.ts/entity types) storing the page Markdown per (chat,page) at each turn's end. - Pure computePageChange util (whitespace-normalized unified diff via the existing jsdiff dep, 6KB cap + getPage hint). - Turn start: if the open page's updatedAt moved past the snapshot, diff current vs snapshot; non-empty -> PAGE_CHANGED_NOTE in the safety sandwich. - Turn end: upsert the snapshot on EVERY terminal path (onFinish/onError/onAbort, once) so the agent's own edits are excluded by construction even on aborted turns. All best-effort (never breaks/latency-regresses a turn); fast path when updatedAt is unchanged. Server-only. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-02 01:54:00 +03:00
vvzvlad	22ea387495	Merge pull request 'feat(#246 ): inline spoiler mark (blur + click-reveal, lossless Markdown)' (#259 ) from feat/246-spoiler into develop Reviewed-on: #259	2026-06-30 01:47:46 +03:00
vvzvlad	7e6dd457a4	Merge pull request 'refactor(#193 ): tool-host drift-guard + staged plan (shared spec registry already merged)' (#249 ) from refactor/193-tool-spec-registry into develop Reviewed-on: #249	2026-06-30 01:47:13 +03:00
vvzvlad	a8a7fad850	Merge pull request 'test(#244 ): Part B backlog — editor-ext/mcp/client/server unit+contract tests + findBreadcrumbPath mutation fix' (#257 ) from test/244-part-b into develop Reviewed-on: #257	2026-06-30 01:36:00 +03:00
vvzvlad	d38a39e3e5	Merge pull request 'fix(ai): show live reindex progress so the embeddings counter resets to 0 and climbs' (#242 ) from fix/embeddings-reindex-progress into develop Reviewed-on: #242	2026-06-29 23:44:13 +03:00
claude code agent 227	188c5f506c	feat(editor): inline spoiler mark (blur + click-reveal, lossless Markdown) (#246 ) Add an inline spoiler (Telegram/Discord-style hidden text): a TipTap mark `spoiler` rendered as <span data-spoiler="true" class="spoiler">, blurred via CSS and revealed on click (UI-only is-revealed class, never persisted). - packages/editor-ext: the Spoiler mark (inclusive:false, set/toggle/unset commands, \|\|text\|\| input rule), exported; a lossless turndown rule emitting raw inline HTML; round-trip test. - apps/client: SpoilerView mark-view (ReactMarkViewRenderer, Link pattern), registration in extensions, bubble-menu toggle button (editable only), CSS (blur + @media print reveal), en/ru i18n. - apps/server: register Spoiler in collaboration.util tiptapExtensions so the mark survives HTML<->JSON export/index/import/Yjs; a test proving the public share keeps the spoiler (it isn't stripped with comments). No keyboard shortcut: the proposed Mod-Shift-s collides with Strike (and Mod-Shift-h with Highlight); the \|\|text\|\| input rule + the bubble-menu button cover ergonomics. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-29 23:22:30 +03:00
claude code agent 227	d0eae69086	fix(ai): raise reindex pre-seed TTL to the client poll cap; cover predicate clause; align docs (F11-F13) F11: PRE_SEED_TTL_SECONDS 45->120 (= client REINDEX_POLL_CAP_MS). At concurrency 1 a queued reindex can wait past the old 45s; if the pre-seed expired while pending, getMasked fell back to the COUNT and reported done, so the client stopped polling and missed the climb. Tie the pre-seed TTL to the client cap. F12: extend the lockstep integration spec — insertPage takes content; a text_content=null + text-node-content page is IN and a math-only page is OUT, pinning the structural "type":"text" clause (and the jsonb space-after-colon). F13: list all three embeddable clauses in the reindex JSDoc/inline comments. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-29 16:12:36 +03:00
claude code agent 227	f9b58a0e3d	test(server): SSRF guardedFetch, decryptHeaders fail-open, yjs.util, tool-spec parity, storage delegation guardedFetch blocks loopback/private/link-local/metadata IPs and never calls fetch; decryptHeaders fails open (returns undefined, warns once, no blob leak). yjs.util setYjsMark/removeYjsMarkByAttribute/updateYjsMarkAttribute on real Y.Docs. SHARED_TOOL_SPECS<->in-app parity (name/desc/input-schema; a dropped or renamed wiring fails). Replace the tautological storage.service spec with driver-delegation checks across every public method. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-29 04:49:56 +03:00
vvzvlad	4a72ee1681	Merge pull request 'refactor(agent-roles-catalog): YAML catalog with block-scalar instructions (#229 )' (#231 ) from feat/229-catalog-yaml into develop Reviewed-on: #231	2026-06-29 01:20:40 +03:00
claude code agent 227	82af0c5291	test(catalog): tighten + isolate real shipped catalog-file checks Apply review suggestions to the real-files block in ai-agent-roles-catalog.provider.spec.ts (test-only): 1. Fix inaccurate comment: there are 5 content YAML files (index + four per-bundle/lang files), not 6. 2. Improve isolation: read/parse the real index lazily inside tests (via loadRealIndex) instead of in the describe body, so a broken real file fails only these catalog tests, not collection of the whole spec (incl. the unrelated mocked-remote provider tests). 3. Add the symmetric slug check: each language file's slug set must equal the declared slug set (no undeclared/extra roles), matching scripts/check.mjs's exact two-way correspondence. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-28 23:59:41 +03:00
claude_code	62eb7d082f	test(ai-chat): stub sandboxStore.asSink in AiChatToolsService spec The blob-sandbox feature (#243/#250) made AiChatToolsService.forUser() eagerly call this.sandboxStore.asSink() while wiring the stash tool, but the spec still passed an empty {} as the sandboxStore constructor arg. That object has no asSink method, so all 19 tests in the suite failed in CI with 'TypeError: this.sandboxStore.asSink is not a function'. Replace the stale {} mock at all 4 constructor sites with a no-op sink exposing asSink() -> { put, has, evict } (jest.fn()). These tests never execute the stash tool, so a no-op sink is sufficient for forUser() to wire successfully. Test-only change; production code is unchanged. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-28 23:45:06 +03:00
claude code agent 227	997e4395c6	test(agent-roles-catalog): pin the real shipped YAML files (#231 F1) Provider tests only exercised synthetic stringifyYaml fixtures, so a hand-conversion error in one of the 6 real catalog files (index.yaml, bundles/{editorial,research}/{en,ru}.yaml) — a stray quote/colon in a description, a broken emoji/arrow, a block-scalar indent slip that silently changes or drops instructions — was caught by no automated test. scripts/check.mjs is the only other guard and is wired into no CI/turbo/husky step. Add a real-files test block that reads each shipped file off disk, parses it with the SAME options the provider uses (strict: true, maxAliasCount: 100), and validates it through the provider's own exported type guards (isCatalogIndex / isCatalogBundleFile / isCatalogRole). It is driven from the real index so new bundles/langs are auto-covered, asserts the editorial bundle still ships fact-checker, and requires every declared role to be present with non-empty instructions/name in each language file. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-28 23:44:49 +03:00
claude code agent 227	85b38d6946	fix(ai): address reindex-progress review round 1 (PR #242 ) F1: clear the "Reindex now" spinner once the poll cap fires. Gate the reindexing part of the button's loading state on the active poll window (reindexDeadline !== null) so a run that outlives the 120s cap no longer leaves the button stuck-disabled with a stale `reindexing: true`; the admin can restart. F2: rewrite reindexWorkspace JSDoc to describe the EMBEDDABLE page set (text OR existing embeddings), matching getEmbeddablePageIds / countEmbeddablePages instead of the old "every non-deleted page". F3: extract the shared embeddable-content predicate into a private PageRepo.embeddablePredicate helper, called by both countEmbeddablePages and getEmbeddablePageIds, removing the verbatim duplication. Behavior is identical (lockstep int-spec stays green). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-28 23:39:20 +03:00
claude_code	8842bc8bf3	fix(sandbox): address PR #250 follow-up review — XSS hardening, eviction reconcile, doc sync (#243 ) Security (must-fix): - sandbox.controller: the anonymous GET /api/sb/:id response now sets X-Content-Type-Options: nosniff, a restrictive CSP, and Content-Disposition= attachment for any mime outside a raster-image allowlist (png/jpeg/gif/webp/ avif). entry.mime is attacker-controlled, so an evil.svg/evil.html could otherwise execute script inline on the Docmost origin (stored XSS). Mirrors the public attachment route's hardening. Stability: - client.stashPage: reconcile mirrors AFTER the final document put, not only before it. The doc blob is the newest entry and FIFO eviction drops the oldest = this stash's own images, so the stored doc could reference an evicted blob (consumer 404) and over-report images.mirrored. A bounded loop now reverts doc-put-evicted mirrors, drops the stale doc blob, and re-puts until stable. Regenerated packages/mcp/build/. - sandbox.controller: emit Cache-Control on the 304 branch too (ttlSeconds is computed before the conditional check). Docs: - Bump the MCP tool count 39 -> 40 across all READMEs and AGENTS.md (the registry now exposes exactly 40 tools). Refactor: - SandboxStore.asSink() centralizes the {put,has,evict} sink + uri<->id mapping; the embedded-MCP and in-app agent-tools wiring sites share it. Tests: - security headers (inline vs attachment, nosniff, CSP), 304 Cache-Control, putAndLink URL form, has()/remove(), asSink() round-trip, getSandboxPublicUrl (trailing-slash trim + APP_URL fallback), and a stash test where the doc put itself evicts a mirrored image. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-28 19:08:06 +03:00
claude_code	6eb335d5e3	fix(sandbox): address PR #250 review — SSRF guard, eviction safety, cleanup (#243 ) Security: - stash_page: reject path-traversal / percent-encoded srcs before the authed loopback fetch (resolveInternalFilePath), closing an SSRF/exfiltration hole where a crafted node.attrs.src could read an arbitrary internal GET endpoint into the anonymous sandbox. Stability: - stash_page: revert + recount mirrors FIFO-evicted by a later put in the same stash (no dangling sandbox refs, honest images.mirrored/failed); free image blobs if the final document put throws. - Reject/clamp non-positive SANDBOX_TTL_MS to the 1h default (warn once). - Log mirror failures unconditionally (console.warn, no blob bodies). Cleanup / architecture: - Remove dead expiresAt from SandboxPutResult. - Centralize the /api/sb route in SANDBOX_ROUTE_SEGMENT/SANDBOX_API_PATH and move URL composition into SandboxStore.putAndLink; drop the duplicated sink closures and the now-unused EnvironmentService injection from McpService and AiChatToolsService. - Un-export isInternalFileUrl; document the process-local (instance-bound) sandbox limitation in the tool description and .env.example. Docs/tests: - README/README.ru: 38 -> 39 tools + stash_page entry. - Add traversal/normalize/recursion unit tests, stash self-eviction + doc-put-throw + empty/octet-stream mock tests, controller If-None-Match (wildcard/weak/list) + Cache-Control tests, and SANDBOX_TTL_MS validation tests. Regenerate packages/mcp/build. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-28 18:02:46 +03:00
claude code agent 227	2fe4ca8537	feat(sandbox): in-RAM blob sandbox for out-of-band page transfer (#243 ) Add an ephemeral, process-local blob store so the in-app agent (and the embedded MCP) can hand a large page document and its images to an external consumer WITHOUT routing the bytes through the model context or Docmost auth. - SandboxStore (@Injectable singleton): Map<uuid,{buf,mime,sha256,expiresAt}> in RAM only. put() picks a per-blob cap by mime (image vs doc), enforces a total-bytes RAM guard with oldest-first eviction, and stamps a TTL; get() lazily expires. sha256 computed at put() doubles as the strong ETag. An unref'd sweep interval clears expired entries and is cleared on destroy. - GET /api/sb/:uuid anonymous controller: serves raw bytes with Content-Type, Content-Length and ETag=sha256; 404 on missing/expired/non-UUID (anti- traversal), 304 on a matching If-None-Match. No tokens, no 401 — the capability is the unguessable UUID + short TTL + TLS. Auth-exempt the same way as /api/files/public (no JwtAuthGuard) plus an /api/sb entry in main.ts's workspace-resolution preHandler so a remote consumer with no workspace host is not rejected. - stash_page tool in both layers (MCP resource_link + in-app {uri,size,sha256, images}). client.stashPage serializes the get_page_json shape, mirrors every INTERNAL file/image src (type-agnostic, covers drawio/excalidraw/video/file) into the sandbox under Docmost auth and rewrites src to the sandbox URL; external http(s) srcs are left untouched; dedup by src; a failed image fetch is counted, never aborts the doc. - SANDBOX_PUBLIC_URL / SANDBOX_TTL_MS / SANDBOX_MAX_BYTES / SANDBOX_MAX_IMAGE_BYTES / SANDBOX_MAX_TOTAL_BYTES wired through the environment service + validation + .env.example. - SandboxModule (@Global) provides the shared store to the controller, McpService and AiChatToolsService (same instance for put and get). Tests: SandboxStore (round-trip, sha256, TTL lazy + sweep, caps, eviction), SandboxController (200+ETag+CT+CL, 404 missing/expired/non-UUID, 304), and a mock-HTTP stashPage test (mirror+rewrite internal, keep external, dedup, failed image counted, returns only a link). Interoperates with the vvzvlad/habr-mcp consumer's anonymous-GET + sha256-ETag + resource_link contract. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-28 15:13:11 +03:00
claude code agent 227	d0ca127d83	refactor(ai-chat): drift-guard the DocmostClientLike hand-mirror (#193 ) Issue #193's tool-half has two open items. The shared, zod-agnostic tool-spec registry (SHARED_TOOL_SPECS) for the identical tools is already merged (`f3fa15e7`) and consumed by both layers, so that subset is done. The remaining items are: (a) deriving the layer-3 hand-mirror `DocmostClientLike` from the real client type, and (b) folding more tools into the registry. Both were deferred as risky, and that deferral still holds (verified, see below) — so this change ships the safest concrete increment instead of forcing the risk. What this adds (behaviour-neutral, test-only + a doc comment): - packages/mcp/test/unit/client-host-contract.test.mjs: pins the layer-3 contract from the ESM side, where the real DocmostClient is importable. It asserts every method the in-app `DocmostClientLike` mirror declares exists as a function on a real DocmostClient instance (constructor is side-effect-free). A rename/removal in client.ts now fails this test instead of silently shipping a runtime "x is not a function" into an agent tool call. Negative-case verified (a bogus method name is detected). - docmost-client.loader.ts: replaces the vague mirror comment with a pointer to the guard test and a concrete, empirically-grounded staged plan for the full type-derivation. Verified blockers kept it deferred: @docmost/mcp emits no .d.ts (no `declaration`, no `types` export) and the server has no path mapping for it, so there is no type to import today; and the real methods' inferred CONCRETE return types conflict with the in-app adapter's loose Record<string,unknown> + `as`-cast result handling (deriving the exact type breaks the build / forces pervasive double-casts and full-surface test stubs). Out of scope (noted in the issue): the PM<->Markdown converter unification. Verified: server tsc clean; mcp tsc clean; mcp tests 369 pass (367 + 2 new); ai-chat tools specs 51 pass. No behaviour change; committed mcp build untouched (no mcp src changed). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-28 15:07:43 +03:00
claude code agent 227	38a863e5f7	refactor(agent-roles-catalog): store catalog as YAML with block-scalar instructions (#229 ) The agent-roles catalog content files move from JSON to YAML so each role's long `instructions` system prompt is stored as a literal block scalar (`\|-`): editing one sentence now produces a line-by-line diff and the prompt is editable as plain multi-line text instead of a single escaped JSON string. Data: - `index.json` -> `index.yaml`, `bundles/<id>/<lang>.json` -> `<lang>.yaml` (old `.json` deleted). Converted programmatically via the `yaml` library with `lineWidth: 0`; round-trip verified deepEqual against the old JSON, so the resolved role content is byte-for-byte identical (the only `version` bump is fact-checker v2->3, carried over from develop during the rebase; see below). Server (`AiAgentRolesCatalogProvider`): - parse with `yaml`'s safe default (JSON-compatible) schema instead of `JSON.parse` — `strict: true` (rejects duplicate keys) and `maxAliasCount: 100` (billion-laughs guard); no custom `!!` tags / no code execution. Fetched paths become `index.yaml` / `<lang>.yaml`. The streaming 1 MB size cap, `redirect: 'error'`, 10s timeout and `^[a-z0-9-]+$` path-traversal/SSRF guard are unchanged; the hand-written type guards are untouched (`instructions` is still a string after parsing). - add `yaml` as a direct server dependency (already in the lockfile as a transitive dep). Catalog tooling: - `scripts/check.mjs` parses the catalog as YAML (lockfile stays JSON); pin `yaml` as a devDependency of the catalog package. Tests: - provider spec fixtures serialized with `yaml`; new tests for the block-scalar `instructions` round-trip (exact multi-line string), malformed YAML and strict duplicate-key rejection -> BadGateway; size-cap and path-traversal cases retargeted to the `.yaml` paths. Docs: README, `.env.example`, `catalog-types.ts` comments and CHANGELOG updated to the YAML layout. `AI_AGENT_ROLES_CATALOG_URL` base-URL contract unchanged. Rebase onto develop + review (PR #231, comment 2509): - semantic conflict: develop's `89edddc5` bumped fact-checker v2->3 (flags errors instead of confirming facts) in the now-deleted `.json`. Resolved the modify/delete by taking the deletion and porting develop's v3 `description` + `instructions` (en + ru) into the YAML and setting `version: 3` in index.yaml. Verified by `node scripts/check.mjs` going green against develop's unchanged content-hash lock (the ported YAML hashes byte-identically to the v3 JSON). - doc fix: ai-agent-roles.service.ts catalog comment "untrusted JSON" -> YAML. - doc fix: parseYaml docstring no longer claims `strict: true` rejects unknown custom tags (yaml@2.8.x warns + resolves to a plain scalar, then the type guard rejects it); the duplicate-key claim is kept. - doc: note in check.mjs that `yaml` resolves from the repo-ROOT node_modules (via shamefully-hoist), not the catalog package's own pinned devDependency. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-28 04:38:50 +03:00
a	95d07d8d6f	fix(ai): align reindex live denominator with the steady-state count Review fixes for the reindex-progress counter (#242): 1. Denominator jump (478 -> 500 -> 478): reindexWorkspace iterated getIdsByWorkspace() (ALL non-deleted pages) but the seed/status use countEmbeddablePages (text OR existing-embedding), so the live total exceeded the steady-state total whenever empty/text-less pages existed. Add PageRepo.getEmbeddablePageIds() that selects the IDs of the EXACT same set countEmbeddablePages counts (deletedAt IS NULL AND (text_content matches a non-whitespace char OR an EXISTS non-deleted pageEmbeddings row)), and have reindexWorkspace iterate THAT set with total = its length. Iteration set and count source change together, so done reaches exactly total == the steady-state denominator. Dropping text-less pages is correct (reindexPage no-ops on them; a page that lost its text but still has stale embeddings is in the set via the EXISTS clause and still gets its stale rows cleared). Removed the contradictory "worker overwrites with the real page count" / "denominator matches" comment. 2. Mid-run re-trigger reset: reindex() unconditionally re-seeded done=0 before an enqueue that de-dupes a running job, so a second click/admin/tab reset the visible counter while the worker kept incrementing. Now seed only when get(workspaceId) === null; the worker's own start() remains the single authoritative reset. 3. TTL: documented that it is intentionally tied to write progress (start/increment) and never refreshed on get(), so a dead worker's record can't be kept alive forever by client polling. Tests: new embedding-reindex-progress.service.spec.ts (fake ioredis: hash -> ReindexProgress, malformed/missing/non-numeric -> null, non-finite startedAt -> 0, hgetall throws -> null, start/increment issue hset/hincrby+expire and swallow Redis errors); reindex() seed order + no-reseed-when-active guard; getMasked live test now uses progress.total=500 vs DB 478 to pin the progress branch; indexer specs updated to mock getEmbeddablePageIds. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-28 04:32:36 +03:00
a	72bb03918d	fix(ai): show live reindex progress in semantic-search settings The "Indexed X of Y pages" counter stayed stuck at "478 of 478" during a manual "Reindex now" run instead of resetting to 0 and climbing. The status reports indexedPages = countIndexedPages (DISTINCT pages with >=1 embedding row), but reindex hard-replaces each page in its OWN small transaction, so nearly all pages always have rows -> the count never drops. Add a per-workspace live reindex-progress record in Redis (reusing the existing global ioredis client via RedisService, no new Redis config): - EmbeddingReindexProgressService: start/increment/clear/get over a Redis hash with a 1h TTL self-clean; all best-effort/cosmetic so a Redis failure degrades to the existing DB-count behavior. - AiSettingsService.reindex seeds {total, done:0, startedAt} at enqueue time so the very first poll already reports done=0. - EmbeddingIndexerService.reindexWorkspace overwrites total with the real page count at start, increments done per processed page (success or handled failure), and clears the record in a finally (covers success, fatal abort, and the unconfigured early-return) so a failed run never sticks. - AiSettingsService.getMasked returns the live run numbers when a progress record is active (plus an optional reindexing flag), else falls back to countIndexedPages/countEmbeddablePages. Per-page edits (reindexPage) never touch the workspace progress record, and no mass up-front delete is introduced (search availability preserved). Tests: indexer sets/increments/clears progress (incl. fatal abort and unconfigured early-return); status reports run progress when active and falls back when not. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-28 04:32:36 +03:00

1 2 3 4 5 ...

453 Commits