open-design

mirror of https://github.com/nexu-io/open-design.git synced 2026-06-01 03:14:35 +07:00

Author	SHA1	Message	Date
shangxinyu1	95bbdbb734	[codex] test(e2e): harden settings and entry regressions (#2578 ) * test(e2e): harden settings and entry regressions * test(e2e): align entry chrome coverage with current UI Generated-By: looper 0.6.0 (runner=fixer, agent=codex) * fix(web): refresh saved media providers from daemon Generated-By: looper 0.6.0 (runner=fixer, agent=codex) * test(web): align media provider reload expectations Generated-By: looper 0.6.0 (runner=fixer, agent=codex) * fix(web): keep daemon media-provider reloads authoritative Generated-By: looper 0.6.0 (runner=fixer, agent=codex) * fix(web): make media-provider reload precedence depend on dialog edits Generated-By: looper 0.6.0 (runner=fixer, agent=codex) * fix(web): preserve pending media-provider edits across stale autosaves Generated-By: looper 0.6.0 (runner=fixer, agent=codex)	2026-05-22 10:04:12 +08:00
Eli-tangerine	edb736edbf	Preserve home example prompt selections (#2611 ) Co-authored-by: qiongyu1999 <2694684348@qq.com>	2026-05-22 01:52:08 +08:00
PerishFire	ec96585a8f	fix: harden Windows packaged updater flow (#2595 )	2026-05-21 22:32:02 +08:00
Ghxst	7631539808	Add project bulk delete feedback (#2481 ) * Add project bulk delete feedback * Restart bulk delete toast timer --------- Co-authored-by: Ghxst <200635707+GHX5T-SOL@users.noreply.github.com>	2026-05-21 20:57:19 +08:00
Ghxst	8269b7df90	Keep live artifact preview mounted across tabs (#2482 ) Co-authored-by: Ghxst <200635707+GHX5T-SOL@users.noreply.github.com>	2026-05-21 20:38:58 +08:00
lefarcen	6690dbd5bb	feat(analytics): PostHog + Langfuse instrumentation for assistant feedback (#1558 ) * feat(analytics): PostHog + Langfuse instrumentation for assistant feedback Re-bases the original three-commit PR onto release/v0.8.0. The web-side feedback UI instrumentation (surface_view / ui_click / feedback_submit_result) landed on main while this branch was open, so on this rebase that wiring is taken from main; the remaining net additions are: - Contracts: TrackingFeedback* enums and the four dedicated assistant_feedback_* event payload types (click, reason_view, reason_click, reason_submit), plus normalizeCustomReason helper. The new event-name variants are added to TrackingEventName and the AnalyticsEventPayload discriminated union next to the existing surface_view/ui_click variants — both wire formats coexist. - POST /api/runs/:id/feedback in apps/daemon/src/chat-routes.ts: thin route that validates rating, allowlists reasonCodes through a simple string filter, and fire-and-forgets into the daemon's reportFeedback hook. - apps/daemon/src/langfuse-bridge.ts reportRunFeedbackFromDaemon forwards the rating + reasonCodes into Langfuse as user_rating (NUMERIC ±1) + user_rating_reason (CATEGORICAL, one per code) score-create entries. Gates on telemetry.metrics + telemetry.content. - apps/web/src/providers/daemon.ts reportChatRunFeedback (fire-and-forget fetch) and apps/web/src/components/ProjectView.tsx wiring so each thumbs-up/down + reason submission posts the side-channel. Conflicts resolved (release/v0.8.0 vs the branch's old base): - packages/contracts/src/analytics/events.ts: keep main's file_upload_result / feedback_submit_result / settings_* event variants alongside the new assistant_feedback_* additions. - apps/daemon/src/server.ts: keep DNS-aware validateExternalApiBaseUrl, add reportFeedback closure wired into registerChatRoutes telemetry. - apps/daemon/src/chat-routes.ts: keep both /tool-result and the new /feedback routes; merge RegisterChatRoutesDeps to include both 'paths' and 'telemetry'. Drop PR's chat-routes-local reconcileAssistantMessageOnRunEnd helper (main has the equivalent in server.ts). - apps/web/src/components/ChatPane.tsx & AssistantMessage.tsx & ProjectView.tsx: keep main's projectKindForTracking prop name and its existing emission of surface_view / ui_click / feedback_submit_result; the PR's analyticsCtx-based reason_view/click/submit emission is dropped in this rebase since it would duplicate the existing wire format. - apps/web/tests/components/: rename projectKind → projectKindForTracking to match ChatPane's current prop name. Outstanding review feedback (from the pre-rebase round, will be addressed in a follow-up commit): - AssistantMessage tests not yet passing the new feedback context to the direct render path. - ProjectView clear-feedback path skips reportChatRunFeedback, leaving stale Langfuse user_rating scores. - buildFeedbackPayload has no deletion path for previously-submitted user_rating_reason scores when the user switches thumbs. - POST /api/runs/:id/feedback always returns {status:'accepted'} even when consent is off; needs to surface skipped_consent / skipped_no_sink. - reasonCodes are filtered to string[] but not allowlisted against ChatMessageFeedbackReasonCode or deduped. fix(analytics): address review on assistant feedback rebase Picks up the in-scope correctness items from the prior review round and the rebase residue without rewriting history: - chat-routes.ts: `/feedback` now awaits the daemon's preflight outcome and echoes it as the response. The contract was already shaped as `accepted \| skipped_consent \| skipped_no_sink`, but the previous handler always returned `accepted` because the network send was fire-and-forget. The consent + sink decision is local (a small file read and an env-var lookup); the actual Langfuse upload still runs as a detached promise. - chat-routes.ts: reasonCodes are now allowlisted against the contract's reason-code union and deduplicated before reaching Langfuse, so a stale or replayed client can't poison the Langfuse score table with unknown categorical values or duplicate stable ids in the same batch. - langfuse-bridge.ts: split the consent + sink resolution from the fire-and-forget network send so the route can claim `accepted` honestly. The legacy `skipped_no_sink` return on app-config read failure is preserved. Contracts + comment hygiene: - TrackingFeedbackReasonCode in packages/contracts/src/analytics/events.ts drifted from ChatMessageFeedbackReasonCode in packages/contracts/src/api/chat.ts; add `followed_design_system` and `missed_design_system` so the analytics wire format stays aligned with the persistence shape. - langfuse-trace.ts buildFeedbackPayload: the docblock claimed the raw custom-reason text is bucketed before send. Product reversed that on 2026-05-13 (raw text now ships, consent-gated). Replace the stale comment with the real semantics + a note that there is no tombstone path for reason codes the user removes in a follow-up submission (left as scope for a later PR). - AssistantMessage.tsx: remove the now-unused `AssistantFeedbackAnalyticsCtx` interface and a stray blank-line delete from the rebase; restore the analytics-context comment above the feedback hook. Left as follow-up (intentional, documented in code): - Sending a tombstone score when the user clears their rating — ProjectView still skips reportChatRunFeedback on `change===null`, so Langfuse retains the previous rating until the user re-submits. The PostHog event captures the clear separately. - Removing reason-code scores when the user re-submits with a smaller set — buildFeedbackPayload only overwrites the codes present in the current payload. * feat(analytics): wire PR's dedicated assistant_feedback_* events The four dedicated event types (`assistant_feedback_click` / `_reason_view` / `_reason_click` / `_reason_submit`) the PR added to contracts were sitting unused after the rebase because main's umbrella `surface_view` / `ui_click` / `feedback_submit_result` emissions covered the same user gestures. Wire the dedicated events alongside the umbrella ones so both wire formats fire on every feedback action — dashboards / evals can pick whichever schema they were built against without losing signal. Each dedicated event has stricter typing than its umbrella sibling (`project_id` / `project_kind` / `conversation_id` are non-null), so the new emissions are guarded behind a presence check and skipped on test renders that mount AssistantMessage without project context. The umbrella emissions retain their nullable fallbacks unchanged. Pairing: - surface_view (feedback reason panel) ↔ assistant_feedback_reason_view - ui_click (feedback button) ↔ assistant_feedback_click - ui_click (reason submit button) ↔ assistant_feedback_reason_click - feedback_submit_result ↔ assistant_feedback_reason_submit Reason click + submit share the existing `requestId` so PostHog can stitch click→result across both schemas, matching the spec.	2026-05-21 19:28:51 +08:00
shangxinyu1	10e2019c59	Fix plugin publish and Open Design PR workflow UX (#2564 ) * Fix plugin publish and PR workflow UX * Update plugin workflow test expectations * Fix fake gh repo view verification path * Fix plugin publish headless tests and preserve PATH in shell wrappers. The publish-repo flow needs real git commits and fake gh auth output that matches gh auth status parsing. Login shells no longer drop PATH so test fakes and agent wrappers stay visible to nested gh/git calls. Co-authored-by: Cursor <cursoragent@cursor.com> * Restore plugin action card when share-task startup fails. If startGeneratedPluginShareTask rejects before a task is created, clear hiddenAssistantPluginActionPaths so the assistant action card reappears. Co-authored-by: Cursor <cursoragent@cursor.com> * Make daemon vitest self-contained for publish-github CLI shell-outs. Build dist/cli.js in tests/setup.ts when missing and set OD_DAEMON_CLI_PATH before server.ts resolves OD_BIN, so headless plugin tests pass from a clean checkout without a prior manual daemon build. Co-authored-by: Cursor <cursoragent@cursor.com> --------- Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-21 19:21:17 +08:00
Eli-tangerine	255bbc1d06	Show published user design systems on home (#2572 ) Co-authored-by: qiongyu1999 <2694684348@qq.com>	2026-05-21 19:21:05 +08:00
Ghxst	93036afa2e	Fix sketch default color in dark mode (#2488 ) Co-authored-by: Ghxst <200635707+GHX5T-SOL@users.noreply.github.com>	2026-05-21 19:18:32 +08:00
Siri-Ray	3a33a7b475	fix(web): localize quick brief prompt (#2520 ) * fix(web): localize quick brief prompt Generated-By: looper 0.8.1 (runner=worker, agent=codex) * fix(web): pass locale from design system chat Generated-By: looper 0.8.1 (runner=fixer, agent=codex) * fix(web): preserve task-type routing options Generated-By: looper 0.8.1 (runner=fixer, agent=codex) * fix(web): preserve task-type routing options Generated-By: looper 0.8.1 (runner=fixer, agent=codex)	2026-05-21 19:18:13 +08:00
Marc Chan	047d2bdd95	fix(orbit): respect selected app language (#2522 ) * fix(orbit): respect selected app language Generated-By: looper 0.8.1 (runner=worker, agent=opencode) * fix(orbit): respect selected app language Generated-By: looper 0.8.1 (runner=fixer, agent=opencode)	2026-05-21 19:17:53 +08:00
nettee	0108d3a655	fix(web): localize Chinese integrations copy (#2081 ) (#2563 ) Generated-By: looper 0.8.1 (runner=worker, agent=opencode)	2026-05-21 19:07:45 +08:00
Marc Chan	fe709870ec	fix(web): make PPTX export prompt availability-safe (#2546 ) * fix(web): make PPTX export prompt availability-safe Generated-By: looper 0.8.1 (runner=worker, agent=opencode) * fix(web): restore audited PPTX export fallback Generated-By: looper 0.8.1 (runner=fixer, agent=opencode)	2026-05-21 19:07:25 +08:00
Marc Chan	5f20487942	test(web): cover Continue transcript regression (#2547 ) * fix(ci): catch nix hash drift before merge * test(web): cover Continue transcript regression Generated-By: looper 0.8.1 (runner=worker, agent=opencode)	2026-05-21 19:07:07 +08:00
kami	3c1195106a	fix(web): align HomeHero prompt overlay metrics (#2331 ) Co-authored-by: multica-agent <github@multica.ai>	2026-05-21 18:49:07 +08:00
lefarcen	fab172b782	feat(analytics): emit file_upload_result from all three upload entries (#2459 ) * feat(analytics): emit file_upload_result from all three upload entries `file_upload_result` was wired only on the Design Files Upload button in FileWorkspace. The chat composer paperclip (project page) and the home hero composer paperclip uploaded files silently — PostHog dashboards saw upload activity from one of three real entry points, so per-surface funnels were invisible and totals undercounted. Three problems are fixed together: 1. `FileUploadResultProps` hard-coded `page_name: 'file_manager' / area: 'file_manager'`, which prevented the other two surfaces from type-checking. Widened to a discriminated union over the three v2 doc surfaces (`file_manager` / `chat_panel` chat_composer / `home` chat_composer). 2. `HomeChatComposerClickProps.element` was missing `'attachment'`, so the home composer paperclip had no usable click value even if we wanted to instrument it. Added the literal, mirroring the chat_panel composer. 3. Three call sites for `file_upload_result` would duplicate the per-file mime + total-bytes cohort math. Extracted to `apps/web/src/analytics/upload-tracking.ts#deriveUploadCohort` so FileWorkspace, ChatComposer, and the App.tsx Home submit path all compute the same `file_count` / `file_type` / `file_size_bucket` triplet. FileWorkspace's inline math is replaced with the shared helper to prevent drift. Call-site wiring: - HomeHero attach button: `ui_click` (`element='attachment'`) at click time. The actual upload is deferred to submit, so the `file_upload_result` for this surface fires from App.tsx after `uploadProjectFiles` resolves. - ChatComposer.uploadFiles: `file_upload_result` on success / failed / throw branches; existing `ui_click` (`element='attachment'`) at the paperclip stays as-is. - FileWorkspace.uploadFiles: refactored to use `deriveUploadCohort`; behavior unchanged. * test(analytics): cover deriveUploadCohort matrix Reviewer flagged that deriveUploadCohort silently fans out to three upload entry points (file_manager / chat_panel / home) but has no focused coverage, so a regression in zip detection, mixed-type collapsing, or the 1/10/100 MB thresholds would skew analytics without breaking any visible UI behavior. Adds homogeneous-image, zip-by-mime, zip-by-extension, mixed-type, empty-batch, bucket-boundary (1/10/100 MB), and defensive empty-mime cases.	2026-05-21 18:28:56 +08:00
lefarcen	b2b94dbde7	feat(desktop): follow OS language in packaged builds (cherry-pick of #2544 into release/v0.8.0) (#2560 ) * feat(desktop): follow OS language in packaged builds Packaged Electron currently shows Open Design in en-US regardless of the OS language setting, because the renderer's i18n picks its locale from `navigator.language` and Chromium hard-codes that to en-US unless the host process intervenes. Browser users and `tools-dev` users are unaffected because their `navigator.language` already reflects the OS / browser preference. This change: - Adds `applyOsLocaleSwitch(app)` in `@open-design/desktop/main`. It reads `app.getPreferredSystemLanguages()[0]` and (when called before Electron's `ready` event) points Chromium's `--lang` flag at it, so the renderer's `navigator.language` follows the OS. Safe to call more than once: `appendSwitch` is a no-op once `app.isReady()`. - Calls the helper from both Electron entries: `apps/packaged` before its own `whenReady`, and `runDesktopMain` for tools-dev parity. - Forwards the resolved locale through `BrowserWindow.webPreferences.additionalArguments` as `--od-os-locale=<bcp-47>`, parsed by the preload and exposed at `window.__od__.client.osLocale`. The host bridge type (`OpenDesignHostClient.osLocale`) is extended accordingly. - Updates `detectInitialLocale` in `apps/web/src/i18n/index.tsx` to read that field as a new step between the existing localStorage and navigator fallbacks. Browser/web continues to fall through to `navigator.languages` unchanged. The explicit `osLocale` channel exists in addition to `--lang` because some `app.getPreferredSystemLanguages()` strings (e.g. `zh-Hant-TW`, `pt-PT`) need to round-trip through `resolveSystemLocale` to land on the right supported locale, which Chromium's `navigator.language` cannot do on its own. * fix(web): route OS locale read through getOpenDesignHost The first cut of detectInitialLocale read `window.__od__.client.osLocale` directly, which trips `tests/host-boundary.test.ts` — that guard test keeps web source from referencing preload globals by name so the boundary stays single-source. Switch to `getOpenDesignHost()` from `@open-design/host`, and rewrite the i18n test to install the host via `installMockOpenDesignHost` instead of poking the global directly. * fix(tools-pack): unblock mac packaged build on pnpm workspaces Two independent issues prevented `pnpm tools-pack mac build --to all` from completing on a clean macOS workspace, both unrelated to the desktop OS-locale change in this PR but bundled here because verifying that change end-to-end required the packaged pipeline to actually finish. 1. `apps/web/.next/standalone/node_modules/.pnpm/node_modules/<pkg>` contained dangling symlinks left by Next's nft trace (e.g. a `semver -> ../semver@5.7.2/node_modules/semver` link to a `.pnpm/<pkg>@<ver>` directory pnpm never created). The downstream `cp { dereference: true }` aborted the whole packaged pipeline with ENOENT. Walk every artifact tree before copy and unlink symlinks whose target doesn't resolve. Targets that do resolve stay untouched. 2. Next 16's standalone build under pnpm workspaces does not hoist peer-dep packages (react, react-dom, styled-jsx) into `<standalone>/apps/web/node_modules`. The downstream `web-standalone-after-pack.cjs` audit then does `createRequire(server.js).resolve('react/package.json')`, whose module walk falls out of the standalone tree and aborts the electron-builder phase. Add a `hoistStandaloneNextPeerDeps` step for the web standalone artifact only: it locates the `<pkg>@<version>` (not peer-resolved sibling) directory under `.pnpm` and symlinks it into `apps/web/node_modules/<pkg>`. The subsequent `cp { dereference: true }` then writes the real directory into the cache so the packaged tree stays self-contained. Verified by `pnpm tools-pack mac build --to all` succeeding end-to-end (zip + dmg + app), then `pnpm tools-pack mac install` and `pnpm exec tools-pack mac inspect --expr` reading the desired `__od__.client.osLocale` from the packaged renderer. * feat(desktop): fold encodeURIComponent + manual locale source + pet window from #2554 Three defensive improvements lifted from @Eli-tangerine's parallel implementation on #2554, kept consistent with the OS-locale chain already on this branch: - The argv value crossing main → preload is now wrapped with encodeURIComponent / decodeURIComponent so a locale string with `;`, `=`, or any other Chromium argv special char round-trips cleanly. BCP-47 region tags don't carry those today, but the renderer parser no longer has to assume it. - `setLocale` now also writes `open-design:locale-source = "manual"` to localStorage, and `detectInitialLocale` only treats the stored locale as winning when that marker is present. An untagged value (left over from a future auto-write path, or a stale install) no longer pins the app to an old language once the host injects a fresh OS locale. Today `setLocale` is the only writer so the marker has no behaviour difference yet — this is a defensive net. - `createDesktopPetWindow` now receives `osLocale` and forwards the same `additionalArguments` as the main `BrowserWindow`, so the pet renderer's `__od__.client.osLocale` is consistent with the main window's instead of being silently undefined. Co-authored idea credit: changes mirror the locale-piece of @Eli-tangerine on #2554 — that PR is closing in favour of this one. Tests: detect-initial-locale gets a new "untagged localStorage value loses to host locale" case. desktop 62/62, host 13/13, web i18n + host-boundary 15/15 stay green. * feat(web): fold onboarding view styles from #2554 Pulls the 747-line addition to `apps/web/src/styles/home/entry-layout.css` from @Eli-tangerine's #2554 — the visual layer for the global onboarding flow (`/onboarding` view, Connect / About-you / Design-system steps). The view itself was already plumbed through `EntryShell.tsx`; this adds the styling that makes it shippable on v0.8.0. #2554 is closing in favour of this branch, so the CSS lands here so the onboarding work doesn't get dropped on the floor. Co-authored idea credit: @Eli-tangerine — original styling on #2554. * fix(tools-pack): make hoistStandaloneNextPeerDeps idempotent across builds Addresses non-blocking review by @PerishCode on #2560: the previous `if (await pathExists(linkPath)) continue;` guard uses `access()`, which follows symlinks. A stale symlink from a previous build whose `.pnpm/<pkg>@<version>` target moved (e.g. after a react/react-dom version bump that invalidates the workspace-build cache key and forces a re-run) reports as missing through `pathExists`, then `symlink()` rejects with EEXIST and the unhandled rejection aborts the packaged build. Switch to `lstat` (which does not follow the link) so we can tell "genuinely empty slot", "real directory left by Next" and "stale symlink" apart, then unlink stale entries before re-creating. Also move `stripBrokenSymlinks` ahead of `hoistStandaloneNextPeerDeps` in `copyWorkspaceBuildArtifactsToCache` so any leftover dangling links that survived a previous run are cleared before hoist tries to write.	2026-05-21 18:23:20 +08:00
PerishFire	526c7f7c26	Fix packaged auto-update release validation (#2565 ) * fix: tighten packaged updater flow * test: prune noisy extended ui coverage * fix: hide unpublished release artifacts * test: validate release updater channels * fix: align prerelease release namespaces	2026-05-21 18:15:53 +08:00
Eli-tangerine	a56f559f9e	Merge pull request #2550 from nexu-io/codex/fix-home-example-prompts [codex] fix home example prompt presets	2026-05-21 17:59:52 +08:00
Siri-Ray	b236b37b7d	Remove resume conversation button (#2562 )	2026-05-21 17:55:03 +08:00
qiongyu1999	7fe4c3f581	fix(web): preserve live artifact preset metadata	2026-05-21 17:47:26 +08:00
qiongyu1999	bdc57655d6	localize handoff editor button	2026-05-21 17:25:09 +08:00
qiongyu1999	48f3404051	fix home example prompt presets	2026-05-21 17:25:09 +08:00
lefarcen	81eaf596b0	fix(web): show feedback prompt on every successful assistant turn (#2529 ) (#2532 ) * fix(web): show feedback prompt on every successful assistant turn Previously the thumbs-up/down widget only appeared when the turn produced an `<artifact>`, wrote a file via Write/Edit, or emitted a live_artifact event. That left whole categories of completed turns — image/video generation via `generate_image` / `generate_video`, MCP tool runs, plain text answers — without any way for the user to rate them. Drop the `hasArtifactWork` gate from `isFeedbackEligible` and remove the helper functions that fed it. The remaining filters (streaming in progress, empty response, unfinished todos, runStatus !== "succeeded") still suppress the widget for genuinely incomplete turns. The PostHog `has_produced_files` field is still emitted so the analytics side can keep slicing feedback by whether the turn produced artifacts. * test(web): flip artifact-only feedback assertion in AssistantMessage.test.tsx Companion test file to `chat-feedback.test.tsx` — the same artifact-only visibility rule was duplicated here and asserted the text-only success case stays hidden. Flip that case to assert the widget now appears, and update the file-level comment so the new rule reads cleanly. The streaming / failed-run / empty-response cases keep their existing "hidden" assertions; those are still excluded under the new rule.	2026-05-21 15:39:30 +08:00
lefarcen	6bb0f0fd91	feat(observability): web lifecycle telemetry + stable installationId migration (#2527 ) * feat(observability): web lifecycle telemetry + stable installationId migration Two intertwined safety-telemetry additions for the 0.8.0 release. Web lifecycle observability --------------------------- New `apps/web/src/observability/` module installed at module load via client-app.tsx — alongside the existing error-tracking exception hooks from #2521. Reuses error-tracking's direct-fetch transport (the same consent-bypass + early-buffer guarantees) so every event flows even when the user has opted out of general analytics: - client_long_task PerformanceObserver longtask >100ms (real "feels janky" signal, FPS proxy) - client_white_screen app fails to mount after 5s; MutationObserver cancels the timer the moment the React root renders so a normal boot is zero events - client_resource_error capture-phase window.error catches failed <script>/<link>/<img>/<iframe> loads (chunk-load failures, broken artifact refs) - client_boot_timing navigationStart → load timings via Navigation Timing v2 - client_visibility_change visibilitychange + page lifetime - client_session_summary real foreground duration emitted on pagehide - client_run_stuck 5min watchdog on SSE runs that don't progress (#2464 / #2405 / #1451 in data form) - client_iframe_error FileViewer iframe load failures (iframe errors don't bubble to window, so the global resource-error observer can't see them) - desktop_renderer_crash Electron main observes render-process-gone and forwards to daemon /api/observability/event - daemon_uncaught_exception daemon_unhandled_rejection process-level handlers on the daemon error-tracking.ts is generalised: `reportSafetyEvent(name, props)` now exposes the same buffer + direct-fetch transport that `reportHandledException` used, with identical $exception wire shape preserved for the existing exception path. Daemon cross-process bridge --------------------------- New `AnalyticsService.captureSafety()` skips the consent re-check and posts via posthog-node with installationId as distinct_id. Wired into: - `POST /api/observability/event` for desktop main and any future helper process that needs to ship a safety event (no consent check — same contract as web's direct-fetch path) - `process.on('uncaughtException')` / `unhandledRejection` on the daemon itself Stable installationId across reinstalls (critical for 0.8.0 rollout) -------------------------------------------------------------------- installationId previously lived in `<namespace>/data/app-config.json`, so a packaged reinstall that churned the namespace token (or any future namespace-scoped data wipe) rotated the id and the user showed up as a brand-new PostHog person. This is the immediate trigger: when 0.8.0 ships, every 0.7.x user upgrading would silently double the user count. New module `apps/daemon/src/installation.ts` reads/writes `<installationDir>/installation.json` at the channel root. The daemon gets the path from `OD_INSTALLATION_DIR`, set by `apps/packaged/src/sidecars.ts` to `paths.installationRoot` (one level above `namespaces/` — e.g. `~/Library/Application Support/Open Design Nightly/` on mac). `readAppConfig` transparently merges: if installation.json has an id it wins; if only app-config.json has one (the 0.7.x state), it gets mirrored to installation.json on the next read. `writeAppConfig` mirrors any explicit installationId write, including the null-clear path used by Settings → "Delete my data". 7 call sites of readAppConfig keep their signatures unchanged. Survives: - same-channel reinstall (DMG drag-replace, NSIS reinstall) - namespace churn between packaged builds - per-namespace data reset (future installer that clears `<ns>/data/`) Still rotates (intentionally): - explicit "Delete my data" - manual `rm -rf "~/Library/Application Support/Open Design <Channel>/"` - different channel (Stable vs Nightly stay distinct because userData paths differ; that's the existing channel-isolation contract) What this changes for posthog-js -------------------------------- client.ts had `capture_exceptions: false` from #2521; nothing else changes. autocapture / $pageview / $autocapture / track() / daemon analyticsService.capture() — all unchanged. New events are additive. Validation ---------- - pnpm guard pass - pnpm typecheck whole repo pass - pnpm --filter @open-design/web test 200 files / 1824 tests - pnpm --filter @open-design/daemon test 251 files / 2981 tests (includes 10 new tests in installation.test.ts pinning the 0.7.x → 0.8.0 migration, namespace-wipe survival, delete-my-data clear, and fresh-id rotation) - pnpm --filter @open-design/packaged test 9 files / 89 tests - Pre-existing baseline: apps/desktop/src/main/updater.ts has typecheck references to RELEASE_CHANNEL_NAMES.PREVIEW/NIGHTLY on release/v0.8.0; unrelated to this PR. * fix(observability): preserve fatal exit on uncaught + skip loading shell in white-screen check Addresses codex review on PR #2527 (Siri-Ray). 1) Daemon process handlers must keep Node fatal semantics Installing an uncaughtException listener silences Node's default crash/exit; Node 15+ does the same for unhandledRejection when a listener is present. The previous handlers logged telemetry and let control return to the event loop, leaving a corrupted daemon serving requests instead of letting the supervisor restart it cleanly. triggerFatalShutdown() now: - dispatches captureSafety once (guarded against re-entry from cascading faults) - races posthog-node's shutdown against a 1s bounded timeout so a slow flush can't keep the process alive - calls process.exit(1) after the race resolves Both uncaughtException and unhandledRejection route through it. apps/daemon/tests/uncaught-fatal-shutdown.test.ts pins: - captureSafety is invoked exactly once even on repeated faults - exit(1) fires on the happy path - exit(1) still fires when shutdown hangs past the timeout - exit(1) still fires when captureSafety itself throws 2) White-screen detector treated the loading shell as a successful mount apps/web/app/[[...slug]]/client-app.tsx renders the dynamic-import fallback as <div class="od-loading-shell">Loading Open Design…</div> whose visible text (19 chars) exceeded the previous 10-char floor. monitorMount() would therefore cancel the 5s timer the instant Next swapped the loading shell in, completely missing the white-screen signal the observer is meant to add. isAppMounted() now: - primary signal: <html data-od-app-mounted="1"> set by App.tsx's first useEffect — authoritative because once App has mounted at least once, any later tree crash is an $exception story, not a white-screen story - fallback: only counts children of the root container whose classList does NOT include known loading-shell markers (od-loading-shell). Their visible text drives the > MIN_VISIBLE_TEXT check, so the loading sentinel can never be mistaken for a mount. apps/web/tests/observability/white-screen.test.ts pins: - fires client_white_screen when only the loading shell is present after the timeout - does NOT fire when data-od-app-mounted is set before the timeout - cancels the timer the moment a real workspace-shell child appears alongside the loading shell - still fires when only sub-MIN_VISIBLE_TEXT non-shell content is present (effectively blank) Validation: - pnpm guard pass - pnpm typecheck pass - pnpm --filter @open-design/daemon test 252 files / 2985 tests - pnpm --filter @open-design/web test 201 files / 1828 tests * fix(observability): await captureSafety enqueue before fatal shutdown flush Addresses second-pass codex review on PR #2527 (Siri-Ray, 3279268246). The previous fatal-shutdown path called `analyticsService.captureSafety()` synchronously and immediately raced `analyticsService.shutdown()` against the bounded timeout. captureSafety in apps/daemon/src/analytics.ts does its real `client.capture()` call only inside an async IIFE after `await readInstallationIdSafe()` — so shutdown could win the race, drain an empty posthog-node queue, and let `process.exit(1)` run BEFORE the daemon crash event ever got enqueued. We'd then preserve the process-lifecycle contract but lose the exact signal this PR is adding. Changes: - AnalyticsService.captureSafety now returns Promise<void>. The async IIFE is gone; the body awaits readInstallationIdSafe directly so the returned promise resolves only AFTER client.capture() has been invoked (which is when posthog-node's local buffer contains the event). - server.ts triggerFatalShutdown awaits captureSafety, then calls shutdown, and races that whole sequence against the 1s bounded timeout. Capture failures still don't block exit (try/catch around the await). - NOOP_SERVICE.captureSafety becomes `async () => undefined` to match the new signature. - Fire-and-forget callers (/api/observability/event) are unaffected; voiding the returned promise keeps them non-blocking. apps/daemon/tests/uncaught-fatal-shutdown.test.ts adds the reviewer- requested fixture: - 'waits for the captureSafety promise to settle before invoking shutdown' — gives capture a 50ms delay and shutdown a separate 50ms delay so the intermediate "capture done / shutdown not yet" state is observable. - 'still aborts and exits if captureSafety hangs past the bounded timeout' — captureSafety never resolves; the outer 1s timeout still forces process.exit(1). Validation: - pnpm guard pass - pnpm typecheck whole repo pass - pnpm --filter @open-design/daemon test 252 files / 2987 tests	2026-05-21 15:37:48 +08:00
lefarcen	abfe7ba008	fix(web): show feedback prompt on every successful assistant turn (#2529 ) * fix(web): show feedback prompt on every successful assistant turn Previously the thumbs-up/down widget only appeared when the turn produced an `<artifact>`, wrote a file via Write/Edit, or emitted a live_artifact event. That left whole categories of completed turns — image/video generation via `generate_image` / `generate_video`, MCP tool runs, plain text answers — without any way for the user to rate them. Drop the `hasArtifactWork` gate from `isFeedbackEligible` and remove the helper functions that fed it. The remaining filters (streaming in progress, empty response, unfinished todos, runStatus !== "succeeded") still suppress the widget for genuinely incomplete turns. The PostHog `has_produced_files` field is still emitted so the analytics side can keep slicing feedback by whether the turn produced artifacts. * test(web): flip artifact-only feedback assertion in AssistantMessage.test.tsx Companion test file to `chat-feedback.test.tsx` — the same artifact-only visibility rule was duplicated here and asserted the text-only success case stays hidden. Flip that case to assert the widget now appears, and update the file-level comment so the new rule reads cleanly. The streaming / failed-run / empty-response cases keep their existing "hidden" assertions; those are still excluded under the new rule.	2026-05-21 15:23:59 +08:00
lefarcen	e149616dbe	fix(web): decouple privacy banner from onboarding and Settings lifecycles (#2525 ) * fix(web): decouple privacy banner from onboarding and Settings lifecycles The first-run privacy banner used to be tightly bound to two unrelated surfaces: it was hidden whenever Settings was open, and the onboarding panel only navigated in after the user had resolved the banner. The coupling existed because the banner's z-index sat below modal backdrops, so showing both at once collided visually, and the banner+onboarding were linearized to avoid a "two unfinished things on screen" feel. This change makes the three surfaces independent: - Lift `.privacy-consent-banner` z-index above the modal-backdrop layer so the banner stays visible (and clickable) when Settings is open. The banner is already `pointer-events: none` with opt-in on its actionable children, so it does not steal clicks from the layer below. - Drop the `!settingsOpen` guard from `showPrivacyConsent`. - Drop the `privacyDecisionAt != null` guard from the bootstrap onboarding route; first-run users land on `/onboarding` purely based on `!onboardingCompleted`, and the banner sits on top in parallel. - Drop the `navigate(... onboarding)` side effect from the banner's `onAccept` — the banner only persists the privacy decision now. Bootstrap also had to be reshaped: the merged config is now computed outside the `setConfig` updater so navigation can happen synchronously after the state update. Calling `navigate` inside the updater triggered a React "setState while rendering" warning, and reading a captured flag after `setConfig` was unreliable because React 18+ batches the updater to the next render — the navigate condition was never observed. Existing test that asserted the old coupling ("banner unmounts while Settings is open") is inverted to lock in the new contract. * fix(web): defer privacy banner until onboarding is done and user lands on home Product feedback on the previous lifecycle change: the banner should not appear during the welcome panel. It should surface only: - immediately after the user Skips onboarding (lands directly on home), or - after the user finishes the design-system step and later returns to a home view from the project view they were dropped into. To capture both paths with a single rule, the banner now requires: 1. Daemon config hydrated (unchanged). 2. No privacy decision recorded yet (unchanged). 3. onboardingCompleted === true. 4. The current route is a home route (route.kind === 'home'). The Skip path already routes through finishOnboarding, which calls onCompleteOnboarding() + changeView('home') — that satisfies all four gates the moment Skip is clicked. The finish path (step 2: create design system) previously navigated to a project view without marking onboardingCompleted. This commit mirrors the Skip path by calling handleCompleteOnboarding() from the App-level renderDesignSystemCreation onCreated callback (the onboarding-specific use of DesignSystemCreationFlow). The shared DesignSystemFlow component is left untouched so the create-from-Settings entry point keeps its existing semantics. The route gate keeps the banner suppressed while the user is reading their just-created design system project. As soon as they navigate back to the entry shell (home route), the banner appears. Tests: - "withholds the privacy banner until onboarding completes" — covers gate 3 (onboardingCompleted=false while still on onboarding/home). - "withholds the privacy banner outside the home route" — covers gate 4 (user is on a project route, onboardingCompleted=true). - Existing "keeps the first-run privacy banner mounted while settings is open" still passes; the Settings/banner z-index relationship is independent of these gates. * fix(web): allow privacy banner to surface on non-home routes after onboarding Follow-up to the previous lifecycle change. After exercising the design- system finish path end-to-end, product wants the banner to appear in the project view the user is dropped into — the first generation is running in the background and the user is already waiting, so the disclosure can be acknowledged inline rather than being held back until they navigate back to a home view. The Skip path is unchanged: Skip routes the user to home and the banner appears there. This drops the `route.kind === 'home'` guard and the matching test, and adds a contract test that locks in banner visibility on a project route when `onboardingCompleted=true` and no privacy decision has been made.	2026-05-21 14:51:59 +08:00
lefarcen	50a4dc8a62	Merge origin/main into release/v0.8.0	2026-05-21 13:17:52 +08:00
Eli-tangerine	10940ab7b6	[codex] Update home starter template categories (#2501 ) * Update home starter template categories * fix(web): address PR #2501 review follow-ups - usePluginFacets.clearFacets() now also resets `mode` to 'all' so the empty-state "Clear filters" CTA escapes Saved mode in one click. A fresh browser clicking Saved (or Saved combined with a zero-match search) was previously stranded on the empty view because the CTA only cleared `selection`/`query`. - Restore the "Link code folder" item in the composer Tools -> Import panel. The earlier refactor dropped its `onLinkFolder` wiring and left every remaining ImportItem disabled, so chats without linked dirs lost the only enabled entry point to `openFolderDialog()` / `patchProject({ metadata.linkedDirs })`. Adds regressions: - plugins-home-section: Clear filters from the Saved empty state - ChatComposer.import-menu: folder-link click-through hits the folder dialog and patches `linkedDirs` --------- Co-authored-by: qiongyu1999 <2694684348@qq.com> Co-authored-by: lefarcen <935902669@qq.com>	2026-05-21 13:14:08 +08:00
lefarcen	88dee44892	feat(analytics): always-on $exception capture with early window hooks (#2521 ) PostHog Error tracking was missing the vast majority of real exceptions: 1. posthog-js's capture_exceptions: true is silenced by opt_out_capturing, so every opted-out user vanished from the error feed even though we could perfectly safely keep collecting their stacks (the consent toggle's user copy gates analytics, not safety telemetry). 2. posthog-js is dynamically imported only after /api/analytics/config resolves AND the user has consented. Errors thrown during the first 1-2 seconds (React hydration, early effects) had no listener to catch them. Net effect: 14d $exception count was 54 events / 10 users across ~5k DAU, producing the misleading 99.93% crash-free curve in PostHog's dashboard. This PR makes exception capture independent of both gates: - apps/web/src/analytics/error-tracking.ts (new): own window.error + unhandledrejection handlers, in-memory buffer (capped at 50 entries), direct fetch to https://<host>/i/v0/e/ with the public phc_ key. Same scrub layer as the posthog-js path so file paths still get redacted. - apps/web/app/[[...slug]]/client-app.tsx: installErrorHandlers() at module-load, before React or any feature code can throw. - apps/web/src/analytics/provider.tsx: bootstrapExceptionTracking() in the identity useEffect, parallel to getAnalyticsClient() — runs regardless of consent state, fetches /api/analytics/config, hands the phc_ key + host + distinctId to the error tracker so buffered events can flush. - apps/web/src/analytics/client.ts: capture_exceptions: false so posthog-js stops also emitting $exception (would have produced duplicate events server-side); also re-bridges the error-tracking context inside the loaded() callback so future events inherit the fully-resolved appVersion / sessionId. - apps/daemon/src/server.ts + packages/contracts: /api/analytics/config now returns key + host even when consent=false. enabled still reflects only the analytics consent toggle (posthog-js full autocapture stays off when enabled=false), but the always-on error tracker can read key directly. Forks without POSTHOG_KEY still get key=null and the whole pipeline becomes a no-op — fork-safe by construction. - apps/web/src/analytics/scrub.ts: regex fix so packaged-mac paths like /Applications/Open Design.app/Contents/Resources/apps/web/... (which contain a space) get fully rewritten to app://apps/web/...; previously the [^\s] guard stopped at 'Open' and leaked the install dir. Validation: - pnpm --filter @open-design/web typecheck: pass - pnpm --filter @open-design/web test: 199 files / 1823 tests pass (includes 8 new error-tracking.test.ts cases for buffer cap, hook install, scrub, and direct dispatch) - pnpm --filter @open-design/daemon test: 250 files / 2971 tests pass - pnpm guard: pass After release/v0.8.0 ships and rolls out, expect the crash-free curve to drop from the artificial 99.93% to a realistic 95-98% — that's not a regression, it's the first time we're measuring it.	2026-05-21 13:07:26 +08:00
lefarcen	c4a891b184	Merge origin/main into release/v0.8.0	2026-05-21 11:56:39 +08:00
chaoxiaoche	6359132d04	Polish Design Systems settings gallery controls (#2392 ) * Polish design systems gallery settings * Address design systems settings review --------- Co-authored-by: chaoxiaoche <chaoxiaoche@chaoxiaochedeMacBook-Pro.local>	2026-05-21 11:30:15 +08:00
lefarcen	f5f8937421	Merge origin/main into release/v0.8.0 Conflict resolved by taking origin/main: - apps/web/src/components/EntryNavRail.tsx design-systems rail button icon name palette-filled (release-side) -> blocks (main); main's icon swap is part of the more recent design-systems rail pass.	2026-05-21 10:52:08 +08:00
Eli-tangerine	ce95266586	[codex] Polish home composer working-directory controls (#2468 ) Some checks failed visual-baseline / Capture visual baselines (push) Waiting to run Details ci / Detect CI change scopes (push) Successful in 1s Details nix-check / build (push) Failing after 3s Details ci / Preflight (push) Failing after 2s Details ci / Core package tests (push) Failing after 1s Details ci / Tools workspace tests (push) Failing after 1s Details ci / Daemon workspace tests (1/2) (push) Failing after 1s Details ci / Daemon workspace tests (2/2) (push) Failing after 1s Details ci / Web workspace tests (push) Failing after 1s Details ci / E2E vitest (push) Failing after 1s Details ci / Playwright critical (starters) (push) Failing after 1s Details ci / Playwright critical (core) (push) Failing after 1s Details ci / Build workspaces (push) Failing after 1s Details ci / App workspace tests (push) Failing after 0s Details ci / Validate workspace (push) Failing after 0s Details ci / Runtime trace (push) Has been skipped Details * Polish design system home flows * Polish home prompt presets * Polish home working directory controls * test: align home hero chrome smoke * fix: stabilize home composer ci checks --------- Co-authored-by: qiongyu1999 <2694684348@qq.com>	2026-05-21 00:22:46 +08:00
lefarcen	722ddfa235	Merge origin/main into release/v0.8.0 Conflicts resolved by taking origin/main on both files. Root cause: main's PR #2460 (fix(landing): align logo.webp with brand icon) changed HomeHero.tsx's .home-hero__brand-mark to render <img src=/app-icon.svg> instead of an inlined <HeroBrandIcon /> SVG, and bundled the matching CSS (26px round badge with bg-panel + border + padding 2px) plus a gap/font-size tune. The release-side visual-refresh CSS still targeted the SVG layout (38px square, transparent, inset SVG selector). Keeping release's CSS would leave main's <img> unstyled. - apps/web/src/styles/home/home-hero.css three blocks, all taken from main: .home-hero__brand gap 8px, .home-hero__brand-mark redesigned for <img> child, .home-hero__brand-name font-size 16px. - apps/web/src/index.css two blocks, both taken from main: workspace tab close column 22px and .workspace-tab__close 18x18 (paired tune-down of tab UI spacing).	2026-05-20 22:28:38 +08:00
Eli-tangerine	8193981511	Keep PR 2400 changes without folder pickers (#2462 ) * feat(daemon): add project working directory management and editor hand-off functionality - Introduced new flags for project commands to manage working directories, including `--working-dir` and `--dir`. - Implemented API routes for listing available editors and opening projects in selected editors. - Added a hand-off button in the ChatPane header to facilitate opening project folders in local applications. - Enhanced the HomeHero component to include working directory and design system settings, improving user experience in project creation. - Created HomeHeroSettingsChips component for inline management of working directory and design system selection. * feat(chat): implement voice transcription proxy and enhance UI components - Added a new API route for voice transcription using OpenAI's `/audio/transcriptions` endpoint, allowing users to send audio blobs directly for transcription. - Integrated multer for handling audio file uploads in memory, ensuring efficient processing without disk storage. - Updated the HomeHero component to include example prompt suggestions for plugins, enhancing user interaction. - Introduced the EditorIcon component to visually represent different editors in the hand-off menu, improving the user experience. - Refined the HandoffButton component to utilize the new EditorIcon, providing a more cohesive interface for selecting editors. - Enhanced CSS styles for various components to improve layout and responsiveness, including adjustments to tab and button sizes for better usability. * style(workspace-shell): enhance layout and overflow handling - Updated CSS for .workspace-shell to ensure full viewport width and height, with proper overflow management. - Adjusted grid layout to prevent content overflow and maintain responsiveness. - Modified styles for .workspace-tabs-chrome to improve width handling and prevent overflow issues. * refactor(chat): remove voice transcription proxy and related components - Deleted the voice transcription proxy implementation, including the associated API route and multer configuration. - Removed the MicButton component from the ChatComposer and HomeHero components to streamline the UI. - Updated HomeHero to include example suggestions without the voice input functionality. - Adjusted CSS styles for various components to maintain layout consistency after the removal of the MicButton. * feat(daemon): implement minting of HMAC tokens for working directory management - Added a new function `mintImportTokenFromCurrentSecret` to generate HMAC tokens bound to a specified base directory, enhancing security for working directory operations. - Updated the `desktop-auth.ts` file to include the new token minting functionality, which returns structured errors when the desktop auth secret is cleared. - Introduced new IPC message types for minting import tokens in the sidecar protocol, allowing seamless integration with the daemon's working directory management. - Enhanced the `WorkingDirPill` component to utilize the new token minting flow for secure directory selection in desktop builds. - Updated CSS styles for the HomeHero component to accommodate new example suggestion features and maintain layout consistency. * fix(HomeView): import HOME_HERO_CHIPS constant for improved chip management - Updated the HomeView component to import the HOME_HERO_CHIPS constant from the chips module, enhancing the management of hero chips within the component. * feat(daemon): implement mintImportTokenViaSidecar for secure working directory management - Introduced the `mintImportTokenViaSidecar` function to facilitate the minting of HMAC tokens for desktop-import operations via the daemon's sidecar IPC. This allows CLI commands to bypass authentication when the desktop-auth gate is active. - Updated the CLI to utilize the new token minting function when setting the working directory, ensuring secure access to trust-gated API endpoints. - Enhanced the sidecar server to handle minting requests and return structured error messages for improved user feedback. - Added tests to validate the new token minting functionality and its integration with the working directory management process. - Refactored related components to support the new token flow, improving overall security and user experience. * feat(HomeHero): enhance UI components and styles for improved user experience - Updated HomeHero component to replace active dot indicators with Plug icons for better visual representation of active plugins. - Adjusted CSS styles for various elements, including padding and dimensions, to enhance layout consistency and responsiveness. - Introduced new styles for active type icons and improved hover effects for buttons. - Updated HomeHeroSettingsChips to change button titles and icons for clarity. - Added tests to ensure proper rendering and functionality of updated components. * feat(ProjectDesignSystemPicker): enhance design system selection with preview functionality - Updated the ProjectDesignSystemPicker component to include a preview feature for design systems, allowing users to see a preview of the selected design system. - Implemented hover functionality to update the preview based on the hovered design system. - Added fullscreen preview capability for a more immersive experience. - Enhanced CSS styles for the design system picker to improve layout and responsiveness. - Introduced tests to validate the new preview functionality and ensure proper interaction within the component. * feat: refactor project metadata handling and enhance design system picker - Updated the default scenario plugin ID retrieval to use project metadata, improving the logic for determining the appropriate plugin based on project intent. - Enhanced the ProjectDesignSystemPicker and related components to support localized design system summaries and categories, improving user experience. - Introduced new translations for working directory and design system picker components, ensuring better accessibility and usability across different locales. - Added a new 'live-artifact' project type to the HomeHero chips, expanding the functionality for users creating refreshable artifacts. - Updated tests to validate the new project metadata handling and design system picker functionalities. * feat: enhance localization and styling for design system components - Added French translations for working directory and design system picker components, improving accessibility for French-speaking users. - Updated CSS styles for the pet task item to ensure consistent padding and layout. - Introduced a new test suite for HomeHeroSettingsChips to validate localization and design system selection functionality. - Enhanced ProjectDesignSystemPicker tests to ensure proper localization and interaction with design system categories. * fix: update .gitignore to include all claude-sessions directories and remove specific session files - Modified .gitignore to ensure all claude-sessions directories are ignored by using a wildcard pattern. - Deleted two specific claude-sessions markdown files to clean up unnecessary session data. * fix: repair home automation ci regressions * fix: stabilize artifact consistency e2e * Remove folder picker changes from PR 2400 --------- Co-authored-by: pftom <1043269994@qq.com> Co-authored-by: qiongyu1999 <2694684348@qq.com>	2026-05-20 22:07:30 +08:00
lefarcen	255c3058c5	fix(analytics): app_version=0.0.0 + media providers clicks + lock run_finished error_code (#2453 ) * fix(analytics): use state for runtime app version so PostHog gets the real value `useAppVersion()` stored the fetched `/api/version` result in a `useRef`, but ref writes do NOT trigger a re-render. The hook therefore kept returning '0.0.0' forever and the downstream `useEffect` that calls `client.register({ app_version, ui_version })` never re-ran with the real version. PostHog dashboards then showed `app_version=0.0.0` and `ui_version=0.0.0` on every event ever shipped from the web client. Switching to `useState` lets the resolved version flow through React's render cycle so the register-on-change effect picks it up. The boot placeholder still ships as '0.0.0' for the first events before the fetch resolves (we don't re-emit those), but every event after init now carries the real daemon-pinned version. Adds a red-spec at apps/web/tests/analytics-app-version.test.tsx that went red on the `useRef` shape (`expected '0.0.0' to be '1.2.3'`) and green on the `useState` shape, so a future refactor can't silently regress it. * feat(analytics): wire media providers click events + lock run_finished error_code invariant Two analytics gaps shipped together because both came out of the same PostHog spot-check after PR #2390 landed: 1. Settings → Media providers (CSV row "client_type=desktop / mason / media_providers") wasn't emitting any ui_click events. The contract type `SettingsMediaProvidersClickProps` and helper `trackSettingsMediaProvidersClick` were defined but no call site used them, so the dashboard showed zero traffic on every element. Added the four v2 elements: - `reload` on the "Reload from daemon" button - `key_input` on every per-provider API key field (onFocus, mirrors the BYOK key field pattern in this same dialog) - `url_input` on every per-provider base-URL field - `clear` on each row's Clear button (fires before the confirm dialog so the intent signal is recorded even if the user backs out) Each event carries `providers_id` (provider.id) and `is_configured` (truthy when the row has a stored entry). 2. `run_finished` with `result=failed` was reported as missing `error_code` on PostHog. Audited every failure path: the daemon's `child.on('close', ...)` handler has several branches that call `runs.finish('failed', code, signal)` directly without first emitting an SSE `error` event (ACP fatal, agentStreamError fall through, child close without diagnostic), leaving `run.errorCode === null` in the status body. The existing fallback in `server.ts` already derives `AGENT_SIGNAL_` / `AGENT_EXIT_` / `AGENT_TERMINATED_UNKNOWN` from `signal` / `exitCode` for those cases, so the wire emission should never blank out — but the logic was inline and had no unit coverage. Extracted the result/error_code derivation into `apps/daemon/src/run-result.ts` and added 12 unit tests covering: - explicit errorCode forwarding - signal-only failures - exit-code-only failures - clean (code=0) failures (ACP fatal shape) - cancelled runs (with and without stamped code) - empty-string errorCode defensive case - status→result mapping for succeeded/canceled/failed/unknown All 12 pass — confirming the invariant "result=failed always carries error_code" holds for every failure shape the daemon produces. The refactor pins that invariant so a future change loses test coverage rather than silently regressing on PostHog. If `error_code` still looks empty on a live event, share the PostHog event JSON + the agent id and I'll dig further — at this point the daemon emission itself is exercised end-to-end.	2026-05-20 21:50:11 +08:00
lefarcen	1cfe274a90	Merge origin/main into release/v0.8.0 Conflicts resolved by taking origin/main on all six points: - apps/web/src/components/HomeHero.tsx:479-487 brand div removed (main dropped the .home-hero__brand wrapper; the release-side visual refresh still had it). - apps/web/src/components/HomeHero.tsx:894-898 attach Icon size 18 (main's update) replaces 20 from release. - apps/web/src/components/HomeHero.tsx:913-927 submit button uses <Icon name="arrow-up" size={22} /> (main's component refactor) instead of the release-side inline SVG. - apps/web/src/components/EntryShell.tsx:578-582 Discord Icon size 14 (main) instead of 16 (release). - apps/web/src/styles/home/home-hero.css drop .home-hero__brand / __brand-mark / __brand-name rules — main removed both the component div and these CSS rules together; keeping the CSS would be dead code. - apps/web/src/styles/home/entry-layout.css Discord badge icon color #5865f2 (main, the brand color introduced by PR #2386) instead of release's neutral var(--text-strong).	2026-05-20 20:59:00 +08:00
Siri-Ray	0ee363f23e	Polish design systems library layout (#2421 )	2026-05-20 20:31:10 +08:00
lefarcen	41a33aed9e	Reapply "fix(web): demote Plugins and Integrations to nav rail footer (#1806 )" (#2360 ) (#2397 ) * Reapply "fix(web): demote Plugins and Integrations to nav rail footer (#1806)" (#2360) This reverts commit `1ab8758045`. * fixup: align EntryHelpMenu Discord URL with #2386 update The revert of #2360 brought back EntryHelpMenu.tsx as #1806 originally added it, with DISCORD_URL = 'https://discord.gg/BYShPgWpq'. #2386 later rotated the Discord invite to mHAjSMV6gz, but only in the places that existed on main at the time (EntryShell avatar dropdown + the e2e test); EntryHelpMenu didn't exist then, so it never got updated. The e2e test the revert reintroduced asserts the new URL, so the component must match.	2026-05-20 18:27:48 +08:00
Chris Seifert	59c8d72ae4	feat(tweaks): bind toolbar toggle to artifact panel (#2348 ) * feat(tweaks): bind toolbar toggle to artifact panel Wires the host viewer's Tweaks toggle to artifact panels via two protocols: postMessage __edit_mode_* for agent-generated .twk-panel artifacts, and the existing class-based bridge for tweaks-skill .tw-panel artifacts. Toggle disables itself when the artifact exposes no panel, syncs state when the user closes the panel locally, and no longer regenerates srcdoc on toggle (no iframe remount). - bridge: emit od:tweaks-available + MutationObserver-driven od:tweaks-panel-state so host learns availability and mirrors local closes; transform-based hide preserves the artifact's transition - host: listen for both od:tweaks-* (bridge) and __edit_mode_* (artifact) dialects, send both on toggle, disable button when no panel, reset state on file change - skill: document the two protocols as a host integration contract - i18n: add fileViewer.tweaksUnavailable for all 13 locales * fix(tweaks): keep srcDoc path for `.tw-panel` artifacts The class based tweaks template (`.tw-panel` / `.tw-hidden`) needs the bridge `injectTweaksBridge` emits, but the default plain HTML preview URL loads the iframe, which bypasses buildSrcdoc entirely. Without the bridge there is no `od:tweaks-available` ping, so the toolbar toggle stays disabled on first load of a tweaks template artifact unless an unrelated mode (palette, inspect, etc.) coincidentally forces srcDoc. Add a `tweaksBridge` flag to `shouldUrlLoadHtmlPreview` and detect the fixed `.tw-panel` / `.tw-hidden` template selectors in the artifact source via a new `hasTweaksTemplate` helper. FileViewer passes the detected flag through so tweaks template artifacts pick the srcDoc render path on first load. Tests in `file-viewer-render-mode.test.ts` cover the new disqualifier, the helper positive and negative cases, and combinations with the existing flags. * fix(tweaks): resolve v0.7 UI ambiguity between toolbar toggle and palette After rebasing onto v0.7, three problems surfaced. v0.7 ships a palette popover button also labeled `Tweaks` with the same sliders icon as the new toolbar toggle. Toggling that popover flipped render mode (URL load to srcDoc) and reloaded the iframe, flashing the preview. The resulting iframe remount caused agent protocol artifacts to re-announce `__edit_mode_available`, which flipped the toolbar toggle back on without user input. Rename the palette popover button to `Themes` (button label, dialog title, `aria-label`) and swap its icon to a new `paint-bucket` glyph. The artifact tweaks toggle keeps the `tweaks` sliders icon. Internal identifiers (`data-testid="palette-tweaks-toggle"`, CSS classes, the `PaletteTweaks` component name) stay stable so existing tests and styles still target the same elements. Drop the auto set true on `__edit_mode_available`; that signal now flips `tweaksAvailable` only. `syncBridgeModes` posts the current `tweaksMode` to the artifact (both the bridge dialect and `__activate_edit_mode` / `__deactivate_edit_mode`) on every iframe load so the panel matches the toolbar. Mount both URL load and srcDoc iframes simultaneously, absolutely positioned and overlapping, with CSS visibility flipping between them. Toggling render mode no longer reloads the iframe so there is no flash. `isOurIframe(source)` accepts messages from either iframe so startup announcements from the hidden iframe are not lost; six receive filter sites switch from `iframeRef.current?.contentWindow` to the helper. Sends still target `iframeRef.current`, kept aligned with the active iframe via a `useEffect`, and a `syncBridgeModesRef` pushes current bridge state to the now visible iframe whenever the render mode flips. Tests that previously asserted exclusive render mode (`url-load` vs. `srcdoc` presence) now assert the active `data-testid` sits on the expected iframe via a co-attribute regex. The Draw bar element picking test switches from a cached frame reference to a `getFrame()` helper since `data-testid` follows the active iframe across toggles. Add a `paint-bucket` entry to `Icon` (Lucide style stroke icon). * fix(tweaks): scope `od:tweaks-available` to the active iframe The dual iframe setup mounts both the URL load and srcDoc iframes at once and accepts postMessage events from either via `isOurIframe`. The srcDoc iframe always carries the always injected tweaks bridge, which runs `document.querySelector('.tw-panel')` on mount and posts `{ type: 'od:tweaks-available', available: false }` for any artifact that does not ship the class based panel. For an agent protocol artifact (`.twk-panel`, `__edit_mode_`), the URL load iframe correctly announces `__edit_mode_available` and the host sets `tweaksAvailable = true`. The hidden srcDoc iframe's `available: false` ping arrives shortly after and overrides that to false, silently disabling the toolbar button. Scope `od:tweaks-available` to the active iframe only by re-checking `ev.source === iframeRef.current?.contentWindow` before applying it. `__edit_mode_available` and `__edit_mode_dismissed` stay accepted from either iframe so the artifact's own announcement still drives the toolbar toggle across render mode flips. Spotted by Siri-Ray on PR #1643. fix(tweaks): start the toolbar toggle ON when the artifact mounts its panel visible Both tweaks dialects (the class-based `.tw-panel` skill template and the `.twk-panel` agent-generated edit-mode protocol) mount their panel visible by default. Before this change the toolbar `Tweaks` toggle started in the OFF state regardless, so the user saw the panel but had to click toggle-on → toggle-off to actually hide it — confusing because the toggle disagreed with what they could plainly see in the preview. Two changes wire the initial state through to the toolbar: - `srcdoc.ts` (class-based dialect): the tweaks bridge's `onReady` now fires `postState()` alongside `postAvailability()`. `postState()` reads `!panel.classList.contains('tw-hidden')` and posts the artifact's actual initial visibility, so the host's existing `od:tweaks-panel-state` handler picks it up and mirrors it into `tweaksMode`. Previously only MutationObserver-driven changes were posted, so the host never learned the artifact's initial state. - `FileViewer.tsx` (twk-panel dialect): the agent dialect's `__edit_mode_available` carries no visibility payload, so we infer default-open from the fact that the artifact bothered to announce availability at all (the SDK pattern is `useState(true)`). Mirror that into `tweaksMode` exactly once per file (tracked by `firstEditModeAvailableSeenForFileRef`), so an iframe remount triggered by, e.g., flipping render mode through the Themes popover does not snap a user-driven OFF back to ON. Also fix a runtime `ReferenceError: panel is not defined` regression this same change introduced when first written with backticks inside the new code comment — the comment lived inside a `\`...\``-delimited script template literal, so the embedded backticks closed and reopened the outer literal and broke the bridge's JS body. Replaced with plain text. Validation: web typecheck clean, 1597/1597 tests pass. Manually verified with a `.twk-panel` artifact: open file → tweaks toggle is ON, panel visible → one click hides both. * fix(tweaks): seed bridge state from the panel's authored class, not from the host hidden attribute The bridge installs `data-od-tweaks-hidden` on `<html>` synchronously in `<head>` so the panel never flashes on initial paint. That attribute is therefore always present by the time `onReady()` fires, which meant the previous `applyClassesToPanel(!hasAttribute(...))` call unconditionally forced `.tw-hidden` onto the panel, the follow-up `postState()` read that forced-hidden class, and the host saw `visible: false` even when the artifact had authored the panel as default-visible. The PR-1643 attempt at "start ON when the artifact mounts visible" therefore still reported OFF for the class-based template path. Read the panel's authored class state first (the artifact body has just parsed, so the panel's class is what the artifact wrote and nothing else has touched it yet), then drive the attribute, the applied class, and the `od:tweaks-panel-state` post from that captured value: ```ts var panel = panelEl(); var initialVisible = !!panel && !panel.classList.contains('tw-hidden'); document.documentElement.toggleAttribute('data-od-tweaks-hidden', !initialVisible); applyClassesToPanel(initialVisible); attachObserver(); postAvailability(); postState(); ``` A default-visible `.tw-panel` now reports `visible: true` on mount, the host mirrors that into `tweaksMode = true`, and the toolbar Tweaks toggle starts in the ON state instead of disagreeing with what the user sees in the preview. The `.twk-panel` agent-protocol path is unaffected; its initial-state mirror still goes through the `firstEditModeAvailableSeenForFileRef` guard in `FileViewer.tsx`. Surfaced by Siri-Ray in https://github.com/nexu-io/open-design/pull/1643#discussion_r3263571196. Validation: web typecheck clean, 1597/1597 tests pass. * fix(tweaks): re-mirror __edit_mode_available default-open state when switching .twk-panel files The once-per-file guard that mirrors a `.twk-panel` artifact's default-open state into the toolbar `tweaksMode` lives inside a `window.addEventListener('message', ...)` handler installed in a `useEffect(..., [])` with an empty dep list. The handler therefore closed over the first-render `file.name`. After opening one `.twk-panel` artifact, `firstEditModeAvailableSeenForFileRef.current` got set to that first file; switching to a second `.twk-panel` file left the message listener still comparing the new artifact's `__edit_mode_available` against the stale captured name, so the guard never re-fired and the toolbar stayed OFF while the new artifact's panel was clearly visible — exactly the mismatch the guard was supposed to prevent on initial load. Add `file.name` to the listener effect's dep list so the handler gets a fresh closure on every file switch. The bridge-message setters (`setTweaksAvailable`, `setTweaksMode`), `isOurPreviewIframeSource`, and `firstEditModeAvailableSeenForFileRef` are stable across renders, so re-binding the listener has no other side effects beyond updating the captured `file.name`. Surfaced by Siri-Ray in https://github.com/nexu-io/open-design/pull/1643#discussion_r3266838151. Red-spec regression test added: `FileViewer tweaks toolbar > mirrors __edit_mode_available default-open state for each switched-to .twk-panel file`. Verified to go red on the bug (deps `[]`) and green on the fix (deps `[file.name]`). Validation: web typecheck clean, 1598/1598 tests pass (was 1597). * i18n(tweaks): add fileViewer.tweaksUnavailable to the remaining 6 locales The toolbar's disabled-Tweaks tooltip key landed in 13 locale files but 6 were missed (ar, fr, id, it, th, uk). Those locales were still falling through to the English string via the `...en` spread, which contradicts the repo convention that every key be defined explicitly in each locale. Add the translation alongside the existing `fileViewer.tweaks` entry so the full set of 19 locales now ships native copy for the disabled state. Surfaced by Siri-Ray in https://github.com/nexu-io/open-design/pull/1643#discussion_r3267654385. * fix(tweaks): respect default-closed dynamic panels in __edit_mode_available Protocol A in `design-templates/tweaks/SKILL.md` documents that the artifact may default the panel to either open or closed and the host should sync its toolbar toggle to whichever state the artifact reports. The previous handler ignored that and unconditionally mirrored availability into `tweaksMode = true`, so a default-closed dynamic artifact would be force-opened the moment `syncBridgeModes` ran and fired `__activate_edit_mode` — the artifact could not stay closed even though the contract said it could. Extend the message shape so the artifact can report its initial state on the same payload: { type: '__edit_mode_available', visible?: boolean } The host now reads `data.visible`: - omitted → treat as `true` (back-compat: existing artifacts emitting the legacy zero-arg shape mount with the panel already on screen, which is the SDK pattern `useState(true)`). - `visible: true` → toolbar starts ON. - `visible: false` → toolbar starts OFF, panel stays closed; the user opts in by clicking the toggle, which then fires `__activate_edit_mode` via the existing `syncBridgeModes` path. Update `design-templates/tweaks/SKILL.md` to document the new optional field alongside the legacy shape. Surfaced by Siri-Ray in https://github.com/nexu-io/open-design/pull/1643#discussion_r3269955351. Red-spec regression test added: `FileViewer tweaks toolbar > respects __edit_mode_available { visible: false } for default-closed dynamic artifacts`. Verified red without the fix (always-true mirror) and green with the fix (`data.visible !== false`). Validation: web typecheck clean, 1599/1599 tests pass.	2026-05-20 18:00:49 +08:00
Mason	1855b22d24	Improve desktop updater ready UI (#2403 )	2026-05-20 17:29:28 +08:00
Siri-Ray	f4b8fbece2	Fix template project creation flow (#2399 )	2026-05-20 17:18:24 +08:00
Eli	a5e43ae2a4	add discord feedback entry points (#2386 )	2026-05-20 16:22:45 +08:00
shangxinyu1	71044bd3d6	test(e2e): harden extended coverage state assertions (#2245 ) * test(e2e): harden extended coverage contracts * docs(testing): add e2e hardening status * fix(web): persist artifact chips after daemon runs * ci: install playwright browsers for e2e vitest * Fix daemon run recovery across reloads Pin daemon-created runs to assistant messages immediately so hard reloads before the create response can reattach. Replay terminal and active run events from the beginning on reload so restored turns keep assistant text, thinking events, produced files, and artifacts. Fixes #2366 Fixes #2368 Fixes #2371 * test(e2e): preserve fake runtime selection across reload * fix(web): scope daemon run recovery to daemon mode * fix(e2e): remove duplicate delayed smoke flag * fix(web): scope replay artifact recovery to current run * fix(daemon): remove duplicate run-create pin	2026-05-20 16:21:01 +08:00
lefarcen	c80acfefeb	fix(daemon,web): block pitch-deck placeholder publishes and unbreak framework decks (#2384 ) Two preview-time bugs surfaced ahead of 0.8.0: 1. Pitch-deck example (#2215): the official html-ppt-pitch-deck prompt asked the agent to confirm three facts first, but the manifest had no structured `od.inputs`, so the platform's required-input gate had no fields to enforce and the run could publish HTML that still contained unresolved fundraising placeholders (`Name to confirm`, `$X.XM`, `Replace this panel with`, ...). Add structured required inputs to the manifest and a daemon-side publication guard that rejects HTML/deck artifact writes whose body still contains those placeholders. Scope is the file-write boundary only (no assistant-text scanning), so the guard cannot trip on the agent's chat prose mid-clarification. 2. Framework deck preview off-screen: `injectDeckBridge` injected `place-content: center !important` on `.deck-shell` for every deck-mode srcdoc, which forced the framework's `display: grid` shell to re-center its implicit track. The framework's `fit()` already centers a `transform-origin: top left` stage with an explicit `translate(tx, ty)` that assumes the stage's natural layout position is (0, 0); the two centerings stacked and the scaled stage landed ~1000px off-screen, so the preview showed a sliver of slide content in the top-left with the rest black. Skip the override when the framework's `id="deck-stage"` marker is in the doc, and drop the dead `display: grid; place-items: center` from the deck framework template so future drift can't re-introduce the same stack.	2026-05-20 16:20:34 +08:00
lefarcen	d03b8fd095	fix(plugins): stop recommending raw publish CLIs from authoring summary (#2380 ) QA repro (transcript shared on PR #2363): the agent finishes the Home "Create plugin" → plugin-authoring chip flow, validates / packs / installs the generated plugin, then in its summary turn freeform- recommends shell commands like: od plugin publish ./generated-plugin --to open-design cd generated-plugin && git init && git add . && git commit -m "..." gh repo create lefarcen/<name> --public --source=. --push Two things go wrong: 1. The summary recreates the exact flows the plugin-folder card buttons already drive end-to-end (PR #2363 wired the buttons through the right gh + git sequences with auth gates, jq fallback, and retry rules baked in). The freeform suggestions drop those guarantees — they're the source of bug #2332 where `od plugin publish --to open-design` was the recommended action even though it produces an issue URL rather than creating a public repo. 2. The prompt explicitly hinted at `od plugin publish` in step 3 ("Then run or prepare the CLI path: ... and `od plugin publish` when the user is ready to open a registry PR"). The agent dutifully repeated that as the next step, even though the publish work belongs to the plugin-folder card button prompts now. Rewrites `PLUGIN_AUTHORING_PROMPT_TEMPLATE` to: * Keep the original scaffolding instructions (generated-plugin/ + SKILL.md + open-design.json + plugin.repo + inputs). * Replace the loose "od plugin whoami/login through gh, and od plugin publish when the user is ready" sentence with a precise local- validation chain: `od plugin validate` → `od plugin pack` → `od plugin install --source <abs-path>`. * Tell the agent to STOP after the summary, and explicitly ban follow-up suggestions of `od plugin publish`, `od plugin publish --to open-design`, `gh repo create`, `git init` / `git remote add` / `git push`. The ban names the workarounds it has actually emitted in QA transcripts. * Point the user at the plugin-folder card buttons (Add to My plugins / Publish repo / Open Design PR) and call out that those prompts encode the auth gates and fallback rules the summary suggestions would skip. * Same jq guidance carried over from PR #2363: do not assume the standalone `jq` binary, prefer the built-in Read tool / cat / node -e, and disambiguate from `gh ... --jq` (which is fine because gh ships its own embedded library). Tests: * `apps/web/tests/components/home-hero/plugin-authoring-prompt.test.ts` (new) — nine assertions covering: goal-placeholder interpolation, generated-plugin scaffolding mentions, local validation chain, the follow-up CLI ban list (`od plugin publish --to open-design`, `gh repo create`, `git push`), the plugin-folder card hand-off, the jq fallback + gh-flag carve-out, and the round-trip helpers (`buildPluginAuthoringInputs` / `buildPluginAuthoringPromptForInputs` / `createPluginAuthoringHandoff`). Validation: * `pnpm --filter @open-design/web exec vitest run tests/components/home-hero/plugin-authoring-prompt.test.ts` → 9/9 passed * `pnpm --filter @open-design/web exec vitest run tests/components/HomeView.prefill.test.tsx` → 11/11 passed (no regression in the adjacent HomeView prompt-prefill suite; that test references the template literal directly so a silent shape change would surface here) Related: PR #2363 fixes the same source-of-bug shape on the plugin-folder card "Publish repo" button. That PR is the canonical home for the publish flow; this PR makes sure the authoring summary hands off to it rather than re-implementing it inline.	2026-05-20 15:39:42 +08:00
lefarcen	c617e30e27	fix(plugins): make Publish repo actually create the author's repo (#2332 ) (#2363 ) * fix(plugins): make Publish repo actually create the author's repo (#2332) QA repro from 0.8.0 preview: clicking "Publish repo" on a generated plugin's `DesignFilesPanel` card ran the agent down a path that produced an Open Design registry-submission URL but never created the author's GitHub repo. After the action finished, `gh repo view nuomi/cat` still returned 404 and `git ls-remote https://github.com/nuomi/cat.git HEAD` failed with "Repository not found". Root cause is the action prompt at `apps/web/src/components/design-files/pluginFolderActions.ts:11-12`: publish: 'Use the supported `od plugin publish` or repository-publish flow after confirming the manifest.' That sentence let the agent pick the legacy registry-link CLI (`od plugin publish --to open-design`), which mirrors the path the "Open Design PR" button takes and emits an issue URL instead of creating a public repo. The button label said "Publish repo" but the behavior collapsed onto the registry-submission flow. This PR rewrites the `publish` prompt in the same shape PR #2182 used for `contribute` — a numbered gh + git sequence that drives the real action end-to-end: 1. Pre-flight `gh --version` / `gh auth status`. Invalid / expired tokens are treated the same as not-logged-in (the bug-report agent kept going past an "invalid token" warning). 2. Read manifest, capture `name`, `version`, `description`, `plugin.repo`. Fall back to `https://github.com/<gh-login>/<name>` when `plugin.repo` is missing and write it back into the manifest. 3. `gh repo view <owner>/<name>` to decide create-vs-update. 4a. Repo does not exist → `git init` + commit + tag + `gh repo create <owner>/<name> --public --source . --push`. 4b. Repo exists → reuse the remote, `git add -A` + `git commit -m "Update: <name> v<version>"` (skip if working tree is clean), `git tag v<version>` (skip if already published), `git push`. 5. Verify with `gh repo view <owner>/<name> --json url,nameWithOwner`. 6. Hand off the resolved `https://github.com/<owner>/<name>` URL to chat. End the turn. Hard constraints encoded in the prompt: * Do NOT call `od plugin publish --to open-design` (or any `--to <catalog>` variant). That is the registry-submission flow. * Do NOT call `AskUserQuestion` — fire-and-forget, same as the `contribute` flow's stall fix. * Do NOT auto-install gh/git. Detect-and-instruct only. * Do NOT force-push or overwrite a published tag. * Do NOT retry a failed step. Report and stop. Refactor: pulled `publish` out of the shared `ACTION_TITLES`/ `ACTION_NOTES` template into its own `buildPublishPrompt(folderPath)` function (mirrors `buildContributePrompt` from PR #2182). `install` keeps the simple shared template — that action stays inferrable from the manifest and doesn't need the same blast radius. Tests: * `apps/web/tests/components/pluginFolderActions.test.ts` — extends the existing contract suite with seven new `publish` assertions: targets `plugin.repo` not the registry catalog, drives the full gh + git command list, handles both new-repo and existing-repo branches, explicit ban on the registry-submission CLI, hard-bans on AskUserQuestion / auto-install / force-push / retry, "invalid token" treated as STOP, `${folderPath}` interpolation guard, ends by handing the repo URL back to chat. Validation: * `pnpm --filter @open-design/web exec vitest run tests/components/pluginFolderActions.test.ts` → 16/16 passed (was 9/9 before this PR; +7 new publish-flow assertions, the old generic "mentions od plugin publish" assertion replaced with the precise contract above) (Local `pnpm --filter @open-design/web typecheck` fails on `tests/runtime/exports.test.ts` because `packages/host/dist/testing` isn't built in this checkout — pre-existing breakage from `2c128e0e refactor desktop host bridge` on main, unrelated to this prompt change. CI runs a fresh install and was green on the four previous prompt-only PRs that touched the same module.) Closes #2332. * fix(plugins): don't assume standalone jq when reading the manifest QA repro from the Open Design PR button (transcript shared with the PR #2363 thread): the agent reached step 2 of the contribute prompt, ran `jq '{name,title,description,version}' generated-plugin/open-design.json`, got `zsh:1: command not found: jq`, and stopped per the prompt's "stop on first hard failure" rule. No fork, no branch, no PR. jq is not part of the OD agent runtime baseline — default macOS and Windows shells don't ship it. The agent reached for it first because "jq" is the default JSON tool in claude/codex's shell training distribution, not because the prompt asked for it. The prompt just said "Load and capture", which the agent interpreted as "shell out to the most common JSON parser". Updates both step-2 instructions (contribute + publish prompts) to: - List portable manifest-read alternatives in priority order: the built-in Read tool (always available); `cat` + manual JSON parsing; `node -e 'JSON.parse(...)'` as the shell-only fallback. - Add an explicit "Do not assume the standalone `jq` binary is installed" guard with the macOS / Windows shell rationale. - Disambiguate the standalone `jq` CLI from `gh ... --jq`. The gh flag uses an embedded library and is fine — without the disambiguation the agent reads the ban literally and stops using `gh api user --jq .login` at step 3. Tests: * `apps/web/tests/components/pluginFolderActions.test.ts` — two new contract assertions: - publish prompt: warns against assuming standalone jq is installed; lists cat and node -e as alternatives. Closes the regression on its own surface. - shared block: both contribute and publish prompts disambiguate standalone jq from `gh ... --jq`. One assertion guards both flows so a future prose edit can't drop the carve-out on one side. Validation: * `pnpm --filter @open-design/web exec vitest run tests/components/pluginFolderActions.test.ts` → 18/18 passed (was 16/16 on the previous PR #2363 commit; +2 new jq-guidance assertions) Continues PR #2363. Same source-of-bug shape as the registry-submission fallback issue this PR was opened to fix: agent picks a tool the prompt didn't actually ask for because the prompt was loose.	2026-05-20 15:38:29 +08:00
chaoxiaoche	25f977c84c	fix(web): rename FileViewer Share button label to Export (#2233 ) * fix(web): rename FileViewer Share button label to Export The toolbar button in FileViewer triggers a menu where 5 of 8 items are Export/Download actions and only Deploy to Vercel + Copy link are real "share" semantics. The previous "Share" label mismatched user mental model (Share = send a link to someone, Export = save a file locally), which likely contributed to a steep studio_view -> studio_click drop in the artifact-export funnel (~18% step conversion). Rename only the i18n value for fileViewer.shareLabel across all 18 locales so the button reads "Export" (and the locale-equivalent). Keys, component code, icon, and menu contents are unchanged. The unused fileViewer.share key and the unrelated examples.shareMenu / preview.shareMenu keys are left intact. Co-authored-by: Cursor <cursoragent@cursor.com> * Improve export CTA and BYOK test feedback * Fix BYOK provider review regressions * Fix settings locale regressions * Fix design system import header layout --------- Co-authored-by: chaoxiaoche <chaoxiaoche@chaoxiaochedeMacBook-Pro.local> Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-20 15:22:32 +08:00
shangxinyu1	5fc27f8923	Fix daemon run recovery across reloads (#2374 ) * Fix daemon run recovery across reloads Pin daemon-created runs to assistant messages immediately so hard reloads before the create response can reattach. Replay terminal and active run events from the beginning on reload so restored turns keep assistant text, thinking events, produced files, and artifacts. Fixes #2366 Fixes #2368 Fixes #2371 * Fix ProjectView daemon run recovery tests	2026-05-20 15:10:23 +08:00
Ethan Guo	38b59fa4d9	fix(web): confirm before deleting saved template in New Project (#2330 ) * fix(web): confirm before deleting saved template in New Project EntryShell and NewProjectModal were dropping the onDeleteTemplate prop, so the trash button in the template picker was inert; thread it through and gate it behind an alertdialog confirm. Fixes #2237 * fix(web/i18n): translate deleteTemplate keys across 18 locales The initial commit shipped English placeholders for the 4 new newproj.deleteTemplate* keys; replace with localized strings matching each locale's existing designs.deleteConfirm style. * fix(web): gate confirm-dialog backdrop on in-flight delete + style error message Backdrop now no-ops while `deleting` is true (matching the Cancel and Delete button gating), preventing a false-cancellation race where the backgrounded DELETE keeps running and leaks a stale `deleteError=true` into the next dialog open; also adds the missing `.modal-confirm-error` rule so the inline failure copy renders as a destructive state instead of default body styling.	2026-05-20 14:50:36 +08:00
Bryan	c530d163f8	feat(web): "Resume conversation in new chat" UI — #462 Commit B (companion to #1718 ) (#2264 ) * feat(contracts): add handoff request/response DTOs Adds HandoffRequest, HandoffResponse, and HANDOFF_SCHEMA_VERSION for the upcoming POST /api/projects/:id/handoff synthesis endpoint. Mirrors the finalize.ts subpath pattern (package.json#exports + esbuild entry + index re-export) so daemon and web can import @open-design/contracts/api/handoff. Refs nexu-io/open-design#462. * feat(daemon): add handoff synthesis pipeline (buildHandoffPrompt + synthesizeHandoffPrompt) Adds `apps/daemon/src/handoff-design.ts` exposing the resume-conversation synthesis primitives the upcoming `POST /api/projects/:id/handoff` route will call into. - `buildHandoffPrompt({ projectId, transcriptJsonl, transcriptMessageCount, now })` returns the system + user prompts. System prompt asks Claude to emit a structured Markdown body with Context / Decisions made / Open questions / Current focus / Provenance, with Provenance bullets explicitly flat (no Markdown emphasis on labels) to preempt the PR #1584 round-2 parser bug. - `synthesizeHandoffPrompt(db, projectsRoot, projectId, options)` reuses the existing finalize-design pipeline pieces: `exportProjectTranscript` → `truncateTranscriptForPrompt` → `buildHandoffPrompt` → `callAnthropicWithRetry` → `extractDesignMd`, but without the lockfile, disk write, design-system, or artifact-resolution paths. - Promotes `DEFAULT_TIMEOUT_MS` in finalize-design.ts to `export const` so handoff shares the same 120s upstream-call bound. Refs nexu-io/open-design#462. * feat(daemon): wire POST /api/projects/:id/handoff route Adds the handoff HTTP route and registers it in server.ts. Validation block + error-mapping shape mirror registerFinalizeRoutes (BYOK payload, upstream-error → ApiErrorCode mapping, redactSecrets on the raw upstream body). Handoff has no lockfile, so the CONFLICT branch is omitted. `res.on('close')` is wired to flip an AbortController whose signal is threaded into synthesizeHandoffPrompt, so a UI-side cancel actually aborts the daemon-side Anthropic call rather than letting it keep running after the client walks away (mirrors the PR #974 fix for finalize). - `apps/daemon/src/handoff-routes.ts` — new, exports registerHandoffRoutes + RegisterHandoffRoutesDeps. - `apps/daemon/src/server-context.ts` — adds handoff slot to ServerContext. - `apps/daemon/src/route-context-contract.ts` — adds RegisterHandoffRoutesDeps to the compile-time coverage assertion. - `apps/daemon/src/server.ts` — imports synthesizeHandoffPrompt + registerHandoffRoutes, builds handoffDeps, registers the route next to finalize. - `apps/daemon/tests/handoff-route.test.ts` — 12 HTTP-layer tests: validation (400/403/404), happy path, upstream error mapping (401/429/502/502 non-JSON), api-key redaction. - `apps/daemon/tests/handoff-route-abort.test.ts` — client-disconnect aborts the daemon-side controller. Refs nexu-io/open-design#462. * fix(daemon): map TranscriptExportLockedError to 409 CONFLICT on handoff route `exportProjectTranscript` acquires a per-project `.transcript.lock` internally (apps/daemon/src/transcript-export.ts:131-163) and throws `TranscriptExportLockedError` on EEXIST. Concurrent handoff requests — or a handoff that races `/api/projects/:id/finalize/anthropic` — lost that lock and surfaced as 500 INTERNAL_ERROR through the route's generic catch. - `apps/daemon/src/handoff-routes.ts` — catch `TranscriptExportLockedError` and return `409 CONFLICT` ahead of the generic 500 branch, mirroring the existing `FinalizePackageLockedError → 409 CONFLICT` mapping at `apps/daemon/src/import-export-routes.ts:603-605`. - `apps/daemon/src/server.ts` — thread `TranscriptExportLockedError` through `handoffDeps` so the route can match without a direct import. - `apps/daemon/src/handoff-design.ts` — correct the module header comment that incorrectly claimed "no lockfile (concurrent handoff calls are safe)" — handoff does not add its own lock, but it does transitively acquire `.transcript.lock` via the transcript-export call. - `apps/daemon/tests/handoff-route.test.ts` — regression test that pre-acquires `.transcript.lock` on disk via `fs.openSync(lockPath, 'wx')` before firing a handoff request, asserts 409 CONFLICT. Refs nexu-io/open-design#462 — addresses @nettee's blocking review on PR #1718 (comment 3242251338). * fix(daemon): keep handoff request timeout armed through the response body read `synthesizeHandoffPrompt` cleared the upstream-call timeout in a `finally` that ran as soon as `callAnthropicWithRetry` returned. But `fetch()` resolves once the upstream sends headers — so the subsequent `await response.json()` body read ran with no timeout. A response that sends headers and then stalls its body could hang `/api/projects/:id/handoff` indefinitely instead of failing. - `apps/daemon/src/handoff-design.ts` — move `clearTimeout(timeoutId)` into a single outer `finally` spanning both the call and the `response.json()` body parse, so the timeout stays armed until the body is fully consumed. - `apps/daemon/src/handoff-design.ts` — the body-parse catch now re-throws `AbortError` as-is, mirroring the call-phase catch. Without this a body-phase timeout would surface as `502` "non-JSON body"; re-throwing lets the route map it to the intended `503` "handoff timed out" (`handoff-routes.ts:122-124`). - `apps/daemon/tests/handoff-design.test.ts` — regression test: a `fetchImpl` returning a `Response` whose body never closes after headers, raced against a 500ms deadline, asserts the call aborts (not hangs) and rejects with `AbortError`. Refs nexu-io/open-design#462 — addresses @nettee's round-2 blocking review on PR #1718 (`handoff-design.ts:196`). * fix(daemon): map upstream 400 to 400 BAD_REQUEST on handoff route `callAnthropicWithRetry` preserves a non-retryable upstream status, so an Anthropic HTTP 400 (`invalid_request_error` — unknown model, invalid maxTokens, malformed body) reached the route's `FinalizeUpstreamError` branch and fell through to `502 UPSTREAM_UNAVAILABLE`. That reported deterministic caller input as a transient server outage, inviting pointless retries and hiding which field was wrong. - `apps/daemon/src/handoff-routes.ts` — special-case `err.status === 400` to `400 BAD_REQUEST` with the redacted upstream detail, ahead of the generic 502. Also refresh the route docblock: it claimed the 409 branch was omitted (stale since the R1 TranscriptExportLockedError fix) and that error mapping fully mirrors finalize (now diverges on 400). - `apps/daemon/tests/handoff-route.test.ts` — route test driving an Anthropic `400 invalid_request_error`: asserts 400 BAD_REQUEST, the upstream detail is surfaced, and an echoed key is redacted. - `packages/contracts/tests/package-runtime.test.ts` — import `@open-design/contracts/api/handoff` through the package `exports` map and assert `HANDOFF_SCHEMA_VERSION`, covering the built publish surface (esbuild entry + exports map + root re-export) that the source-only `handoff-contract.test.ts` does not exercise. Refs nexu-io/open-design#462 — addresses @nettee's round-3 blocking review on PR #1718. * fix(daemon): await the now-async external base-URL validator on handoff route Main's #1176 (`9a64fccd`) made `validateExternalApiBaseUrl` DNS-aware and asynchronous (`validateBaseUrlResolved`) and updated the proxy and finalize callers to `await` it. The handoff route — added on this branch in parallel, against the old synchronous validator — still called it without `await`, so `validated` was a Promise: `validated.error` / `validated.forbidden` were `undefined`, the SSRF / malformed-URL guard silently no-opped, and a bad `baseUrl` fell through to the upstream call and surfaced as 502. A semantic merge break — no textual conflict, green on the branch in isolation, red once CI re-merged latest main. - `apps/daemon/src/handoff-routes.ts` — `await validateExternalApiBaseUrl(...)`, mirroring the finalize route (`import-export-routes.ts:561`). The handler is already `async`. The existing `handoff-route.test.ts` cases "400 BAD_REQUEST when baseUrl is not a valid URL" and "403 FORBIDDEN when baseUrl points at a private internal IP" already encode this — red against branch + latest main, green now. Refs nexu-io/open-design#462 — PR #1718 CI fix. * chore(daemon): list handoff in the assertServerContextSatisfiesRoutes literal The `assertServerContextSatisfiesRoutes({...})` call in `server.ts` enumerates every route registrar's deps but omitted `handoff`. Adding `handoff: handoffDeps` makes the literal complete and consistent with the other route deps. This was not a typecheck break: route-dep coverage is guaranteed by the `Assert<ServerContext extends AllRegisteredRouteDeps>` type in `route-context-contract.ts` — and `AllRegisteredRouteDeps` already includes `RegisterHandoffRoutesDeps` — not by this assertion-call literal. The literal has omitted `handoff` since this branch's first push (`806db576`) through green CI throughout; `tsc -p tsconfig.json --noEmit` is clean before and after. Refs nexu-io/open-design#462 — addresses @nettee's round-4 review note on PR #1718. * feat(web): add "Resume conversation in new chat" action (#462) Adds a Resume control to the chat header, next to "New conversation". Clicking it synthesizes a handoff prompt from the current transcript via POST /api/projects/:id/handoff, opens a fresh conversation, and auto-sends the synthesized prompt as its first user message — so a drifted session resumes without the user replaying context by hand. The old conversation is preserved. - synthesizeHandoff() web-state wrapper in apps/web/src/state/projects.ts - resume-conversation icon button in ChatPane (onResumeConversation / resumeConversationDisabled props) - handleResumeConversation + pendingResumeRef + auto-send effect in ProjectView; effect gates on messagesConversationId so the prompt cannot fire before the new conversation's message read settles - chat.resumeConversation i18n key across all 19 locales Commit B of #462; Commit A is the daemon endpoint (PR #1718). This branch is stacked on feat/handoff-endpoint so the web code resolves @open-design/contracts/api/handoff. * fix(daemon): scope handoff to one conversation + reject empty transcripts (#462) Addresses the review on #1718 and #2264: - mrcfps (#2264): the handoff endpoint exported the whole project's transcript, so a multi-conversation project blended unrelated chats into the synthesized prompt. HandoffRequest now carries a required conversationId; the route validates it belongs to the project (404 CONVERSATION_NOT_FOUND), and exportProjectTranscript takes an optional conversationId filter so only that conversation is exported. - nettee (#1718): a zero-message conversation still called Anthropic and fabricated a handoff. synthesizeHandoffPrompt now throws EmptyTranscriptError on messageCount === 0; the route maps it to 400 EMPTY_TRANSCRIPT before any BYOK tokens are spent. HANDOFF_SCHEMA_VERSION bumped to 2 (conversationId is a new required request field). Regression tests: a two-conversation scoping test, an empty-conversation route + pipeline test, and a transcript-export conversationId-filter unit test. * feat(web): send conversationId with the resume handoff request (#462) Follows the handoff endpoint becoming conversation-scoped. The resume flow now passes the active conversationId to POST /handoff so the synthesized prompt summarizes only the conversation being resumed. handleResumeConversation bails when there is no active conversation; synthesizeHandoff and the resume tests carry the new field. * feat(daemon): add `od project handoff` CLI + register handoff error codes (#462) Addresses the second-round review on #1718 and #2264: - mrcfps (#2264): per AGENTS.md "Capability exposure (UI/CLI dual-track)", a user-facing capability must be reachable through the `od` CLI, not only the web UI. Adds `od project handoff <id> --conversation <id> --api-key <key> --model <model> [--base-url] [--max-tokens] [--json]`, driving the same POST /api/projects/:id/handoff endpoint. The logic lives in a testable handoff-cli.ts sibling module (mirrors artifacts-cli.ts) so cli.ts's import-time dispatch stays out of tests. - nettee (#1718): the route emitted CONVERSATION_NOT_FOUND and EMPTY_TRANSCRIPT, which were absent from the shared API_ERROR_CODES union. Both are now registered in packages/contracts/src/errors.ts, with a contract test pinning them so the route and contract cannot drift again. A CLI contract test covers the conversation-scoped request shape, --json output, flag validation, and daemon-error surfacing. * fix(daemon): fail `od project handoff` on a malformed 2xx response (#462) Addresses nettee's review on #1718: runProjectHandoff treated any 2xx response as success, so a broken daemon/proxy 200 with malformed or shape-invalid JSON would print `undefined` (or `{}` under --json) and still exit 0 — breaking the fail-fast contract scripts rely on. It now validates the body is a well-formed HandoffResponse via an isHandoffResponse type guard and fails fast otherwise. Regression tests cover a shape-invalid and an unparseable 200 body. * feat(web): surface the daemon's classified handoff error in the resume toast (#462) Addresses mrcfps's non-blocking note on #2264: synthesizeHandoff returned null for every non-2xx response, so RATE_LIMITED, EMPTY_TRANSCRIPT, and an upstream 400 with provider detail all collapsed into one generic "check your API key" toast — even though handoff-routes.ts had already classified and sanitized them. synthesizeHandoff now returns the daemon's structured `{ error }` on a classified failure; `null` stays reserved for a transport failure or an unparseable body. handleResumeConversation surfaces error.message plus redacted details for the `{ error }` case, and a distinct daemon-unreachable message for null. * fix(web): omit empty baseUrl from the resume handoff request (#462) Addresses mrcfps's review on #2264: the default Anthropic config normalizes baseUrl to '' (config.ts), and the handoff route 400s an explicit empty baseUrl — so the Resume action failed before synthesis for every user who never set a custom base URL. handleResumeConversation now forwards baseUrl only when config.baseUrl is a non-empty string, matching the contract's optional-field semantics. Tests: the default-config path asserts baseUrl is absent from the request, and a new case covers a custom baseUrl being forwarded. * refactor(daemon): dispatch `od project handoff` before the generic project parser (#462) Addresses nettee's non-blocking note on #1718: runProject ran the shared parseFlags(PROJECT_*) before reaching the handoff switch case, so a malformed `od project handoff` invocation (`--unknown`, `--max-tokens` with no value) threw out of the generic parser instead of hitting handoff-cli's structured fail() — the entrypoint behaved differently from the unit-tested runProjectHandoff helper. The handoff sub now short-circuits before parseFlags / projectDaemonUrl, so `od project handoff` runs exactly runProjectHandoff with no intervening parsing. handoff-cli.test.ts gains unknown-flag and missing-value cases covering the structured fail path. --------- Co-authored-by: DevForgeAI CI/CD Engineer <devforge-ai@development.ai>	2026-05-20 13:28:27 +08:00
lefarcen	204599a7ae	feat(analytics): ship PostHog v2 event schema (#2285 ) * feat(analytics): ship PostHog v2 event schema Aligns the PostHog wire format with the product team's v2 tracking spec (Open Design 埋点文档 2.0). The previous v1 catalogue defined a flat per-page event name (home_view / studio_click / settings_view…); v2 collapses everything to four core events identified through the page_name + area + element triplet so dashboards can group by surface without owning a separate event per page. Key changes - packages/contracts/src/analytics: collapse to page_view / ui_click / surface_view / _result event names; bump EVENT_SCHEMA_VERSION to 2; rename the wire field anonymous_id → device_id (value unchanged); promote the configure-state triplet (has_available_configure_cli / configure_type / configure_availability) to a global PostHog register so every event inherits it without per-helper boilerplate. - apps/web/src/analytics: rewrite the 43 trackXxx helpers behind the new typed catalogue; opt out of PostHog's built-in UA bot filter so legitimate embedded webviews, fingerprinted browsers, and the Playwright-based e2e runs ingest captures (the Privacy → "Share usage data" toggle remains the single consent gate). - apps/web components: wire P0/P1/P2 click + view + result surfaces end-to-end — left nav, toolbar, home chat composer, recent projects, new project modal, plugins / design systems / integrations / automations pages, file manager, artifact toolbar/header/share popup, feedback panel, settings sidebar / language / appearance / notifications / pets / privacy / connectors. Fixes the v1 feedback bug where action=clear_feedback_rating shipped rating=null instead of the rating being cleared. - apps/daemon: extend run_created / run_finished with the v2 context (entry_from / project_kind / target_platforms / fidelity / connectors / etc.), add explicit error_code classification on result=failed (run.errorCode → AGENT_SIGNAL_ → AGENT_EXIT_* → AGENT_TERMINATED_UNKNOWN), and read device_id from the new x-od-analytics-device-id header. Also moves the run_created / run_finished emission to the canonical /api/runs handler in server.ts; the chat-routes copy was shadowed by Express's earlier registration and never executed, which also meant run.clientType never made it to Langfuse — fixed in the same move. Verification - pnpm guard / pnpm typecheck clean for daemon, web, and contracts. - pnpm --filter @open-design/web test: 1645/1645 passing. - End-to-end smoke through Playwright + local PostHog ingest project 420348: every page_view (home/projects/automations/design_systems/ plugins/integrations/chat_panel/file_manager), every nav element, the new_project_modal surface_view + tab + create flow, the plugin_replacement_modal surface_view, settings_view across nine sections, settings_cli_test_result (codex CLI), the project_create_result success path, and run_created + run_finished (result=failed, error_code=AGENT_EXIT_1) all reached PostHog with the v2 schema and the expected device_id / page_name / area / element / fidelity / target_platforms props. The remaining _result events (artifact_export / feedback_submit / file_upload / plugin_replacement / settings_byok_test / settings_connector_auth) are wired in code; production traffic will trigger them. fix(analytics): preserve style category on design-systems surface chip switch The merge resolution in DesignSystemsTab incorrectly re-introduced a `setCategory('All')` call alongside the new `trackDesignSystemsTopClick` emit. main intentionally keeps the active style category when the surface filter refines within it; the regression was caught by the existing "keeps the style category when a surface chip refines within it" test in tests/components/DesignSystemsTab.test.tsx. * fix(analytics): address review — senseaudio passthrough + daemon-side configure-state Two follow-ups from the v2 schema review on #2285: 1. `byokProtocolToTracking()` was still falling through to `null` for `senseaudio` even though the v2 BYOK provider enum now lists it. Every `SettingsDialog` BYOK call site guards on `if (byokProviderId)`, so a user on SenseAudio was silently dropping the provider-option, field-focus, and test-result captures. Added the missing case so SenseAudio gets the same analytics coverage as the other providers. 2. The daemon-authoritative `run_created` / `run_finished` events were missing the configure-state triplet (`has_available_configure_cli` / `configure_type` / `configure_availability`) that v2 promotes to a global register on the web side. Daemon captures don't go through the PostHog global register, so dashboards couldn't segment run lifecycle by execution setup after the migration. The fix derives the triplet server-side from `detectAgents()` and the request's `agentId` before `design.analytics.capture(...)`: - has_available_configure_cli: any CLI on PATH reports installed - configure_type: 'local_cli' when the run targets an installed CLI, otherwise 'unknown' (daemon can't see BYOK keys, which live in web-client storage) - configure_availability: 'available' / 'unavailable' / 'unknown' based on the requested agent's install status, with a fallback to 'available' when any CLI is installed This keeps the v2 schema consistent across both daemon-side and web-side captures. * fix(analytics): wire setConfigureGlobals so browser events carry fresh state Third follow-up from the v2 schema review on #2285. The previous fix addressed senseaudio + daemon-side configure-state, but reviewer flagged that `setConfigureGlobals` was still defined-only — no caller — so every browser-side capture inherited the boot defaults (`has_available_configure_cli=false`, `configure_type='unknown'`, `configure_availability='unknown'`). PostHog dashboards therefore could not segment the new `page_view` / `ui_click` / `surface_view` events by execution setup after a user configured their environment. Changes: - `packages/contracts/src/analytics/events.ts` — add a pure `deriveConfigureGlobals(mode, agentId, agents, byokConfigured)` helper so the web client and the daemon can derive the triplet from the same source of truth. The helper covers all 5 `configure_type` buckets (`local_cli` / `byok` / `both` / `none` / `unknown`) and the 3 `configure_availability` buckets (`available` / `unavailable` / `unknown`). - `apps/web/src/App.tsx` — add a useEffect that re-derives the triplet whenever the user changes execution mode, selects a new CLI, saves a BYOK key, or the detected-agent list refreshes, then pushes it to PostHog via `analytics.setConfigureGlobals(...)`. The setter goes through the provider so the analytics module stays the single source of truth. - `apps/web/src/analytics/provider.tsx` — expose `setConfigureGlobals` on the analytics context and the test stub so consumers route through the provider boundary. - `apps/daemon/src/server.ts` — switch the daemon-side derive in `/api/runs` to the shared `deriveConfigureGlobals` helper so the authoritative run_created/run_finished captures match the web-side payload. BYOK credentials live in the web client and stay invisible to the daemon, so the daemon arm passes `byokConfigured: undefined` and falls back to the installed-CLI signal. - `apps/web/tests/analytics-configure-globals.test.ts` — new regression test that pins the derive behavior across all branches and confirms the setter actually mutates the client-side store. Locks the wire-up so a future refactor can't silently turn the setter back into a no-op. Verification: pnpm guard clean; daemon / web typecheck clean; web tests 1703/1703 passing (up from 1696 — 7 new tests in the configure-globals suite). * fix(analytics): emit projects page_view + drop misattributed chat_panel source Fourth review pass on PR #2285. Two follow-ups from mrcfps: 1. DesignsTab (projects landing) was emitting click events but no matching page_view. Opening /projects without clicking anything left the surface invisible in PostHog. Added a once-per-mount trackPageView({ page_name: 'projects' }) with the same ref-keyed pattern HomeView / PluginsView use. 2. ChatComposer was hard-coding source: 'recent_project' on every chat_panel page_view. The web router currently only carries projectId / conversationId / fileName, so we cannot distinguish a New-project launch from a template-pick or a Recent-projects click from this layer. A false constant would over-attribute every chat launch to 'recent_project' and break the funnel slice this schema was meant to unlock. Dropped the field for now — better no source than the wrong source — until the router grows a launch-source channel; the field is still defined as optional on PageViewProps so the channel can land in a follow-up PR. Verification: web typecheck clean; web tests 1703/1703 passing. * fix(analytics): correct plugin-replacement async result + heterogeneous upload + missing requestId Three follow-ups from the fifth review pass on PR #2285: 1. plugin_replacement_result emitted before the apply settled (`apps/web/src/components/HomeView.tsx`). The modal's confirm action was a synchronous wrapper around an async `usePlugin(...)` call, so the surrounding try/catch never observed real failures and every attempt was reported as `result=success`. Changed `PendingReplacement. confirm` to return `Promise<void>`, made the wrapper return the underlying promise, and moved the analytics emit into an async IIFE in the click handler so the success/failure branches reflect the actual outcome. 2. file_upload_result mis-typed heterogeneous batches (`apps/web/src/components/FileWorkspace.tsx`). The earlier implementation only inspected `picked[0]`, so a mixed batch like `image.png + demo.mp4` reported `file_type=image`. Per the comment above the block ("mixed batches collapse to other"), the implementation now maps every file to a tracking type, collapses to `other` when more than one distinct type is present, and falls back to the single type otherwise. 3. project_create_result lost the click→result correlation id (`apps/web/src/components/NewProjectPanel.tsx`). The click event no longer carried the locally-generated `requestId` that `project_create_result` keeps, so the two could not be joined. `trackNewProjectModalElementClick()` now accepts an optional `{ requestId }`, mirroring the other helpers, and the create-button click threads the same id used for the result. Verification: web typecheck clean; web tests 1703/1703 passing. * fix(analytics): gate configure-state on agents probe + drop unsent run_created fields Two follow-ups from the sixth review pass on PR #2285: 1. Cold-start configure-state was stamped before fetchAgents() landed (`apps/web/src/App.tsx`). The useEffect that pushes the v2 triplet into the PostHog global register fired on first paint with `agents=[]`, so the first home/projects/plugins page_view reported `has_available_configure_cli=false` / `configure_availability= unavailable` even on machines that did have an installed CLI. The effect now waits on `agentsLoading === false` and leaves the boot defaults ('unknown'/'unknown') in place until the probe resolves. 2. Daemon read run-context fields the web never sends (`apps/daemon/src/server.ts`). The daemon-side run_created / run_finished baseProps read `projectKind`, `entryFrom`, `projectSource`, `targetPlatforms`, `companionSurfaces`, `fidelity`, `connectors`, `useSpeakerNotes`, `includeAnimations`, `referenceTemplate`, and `aspect` from `req.body`, but `packages/contracts/src/api/chat.ts` and `apps/web/src/providers/daemon.ts` don't carry those keys on the wire. Reading them therefore always produced null/undefined. Dropped the unsent fields from the daemon capture; a follow-up can extend the create payload to thread the real context through. The `design_system_id` field stays because the chat contract does send it. Tests: added 3 regression tests in `tests/analytics-configure-globals. test.ts` covering the boot-time gating contract (empty agents + daemon mode → unavailable / local_cli; installed agent → available; undefined agents list → unavailable). Verification: web typecheck clean; daemon typecheck clean; web tests 1706/1706 passing (up from 1703 — 3 new cold-start tests). * fix(analytics): pin mode='daemon' so missing-agent run reports unavailable Eleventh review pass on PR #2285. mrcfps flagged that `apps/daemon/src/server.ts` was calling `deriveConfigureGlobals(...)` without `mode`, so the helper fell through to the generic branch. Result: a run for an uninstalled agent was tagged `configure_availability: 'available'` whenever any OTHER CLI was on PATH, because the generic branch only looks at the cohort-wide "any installed?" signal. That precisely undermines the slice the daemon emit is trying to power. The daemon's /api/runs handler is always a daemon-mode capture (daemon is the local CLI runner — BYOK lives in the web layer), so we now pin `mode: 'daemon'` on the call site. The helper then judges `configure_availability` from the REQUESTED agent's install status and reports `unavailable` when the user picked an agent that is not installed, even if peers are. Added a regression case in `tests/analytics-configure-globals.test.ts`: `{ mode: 'daemon', agentId: 'codex', agents: [{claude,true},{codex,false}] }` → `{ has_available_configure_cli: true, configure_type: 'local_cli', configure_availability: 'unavailable' }`. Verification: daemon typecheck clean; web tests 1707/1707 passing (up from 1706 — 1 new regression test). * fix(analytics): hoist chat_panel page_view + thread requestId - Move chat_panel page_view emit from ChatComposer to ProjectView so it survives activeConversationId-driven ChatPane remounts. ProjectView keys the dedupe ref by project.id; the composer drops its duplicate. - Thread { requestId } into trackAssistantFeedbackReasonSubmitClick so the click pairs with the existing feedback_submit_result on the same request id (mirrors the trackNewProjectModalElementClick pattern). * fix(analytics): keep v2 super-props alive across reset and stamp design_system_source - Snapshot the register payload in client.ts on PostHog init and re-register it from applyConsent(true) and applyIdentity() so a privacy-toggle or Delete-my-data rotation does not resume capture without event_schema_version / device_id / session_id / locale / configure-state globals. setConfigureGlobals() also patches the cache so a later restore picks up the current configure state. - Stamp design_system_source on daemon-side run_created / run_finished (it is required by RunCreatedProps / RunFinishedProps). Daemon can't tell default vs user_selected vs inherited from the wire, so it derives 'unknown' when designSystemId is present, 'not_applicable' otherwise — a follow-up that threads designSystemSource through CreateRunRequest can replace this with the precise source.	2026-05-20 13:04:20 +08:00
PerishFire	15d08d4158	feat: add windows packaged auto update flow (#2362 )	2026-05-20 12:56:14 +08:00
lefarcen	1ab8758045	Revert "fix(web): demote Plugins and Integrations to nav rail footer (#1806 )" (#2360 ) This reverts commit `a38e09f931`.	2026-05-20 12:43:49 +08:00
Ethan Guo	41e0e26d86	fix(web): stop re-syncing composer scroll on every keystroke (#2282 ) The `draft` dep on the overlay scroll-snap effect triggered setComposerScrollTop on every keystroke, looping through the v0.8.0 mention overlay state and crashing the chat composer with Maximum update depth exceeded. Fixes #2097	2026-05-20 11:15:59 +08:00
kami	d72d689430	fix(web): preserve daemon run reattach across project switch (#2317 ) Co-authored-by: multica-agent <github@multica.ai>	2026-05-20 11:14:25 +08:00
Sid	ba6497b399	fix(web): gate srcDoc transport activation on shell ready (#2320 ) Opening Tweaks could leave the HTML preview iframe blank when the host posted `od:srcdoc-transport-activate` before the lazy transport shell had registered its message listener. The dropped activation was then suppressed by the dedupe check in subsequent onLoad calls, stranding the iframe on the 536-byte empty shell. The race surfaces after the iframe re-mounts: closing Tweaks increments `srcDocTransportResetKey`, which re-mounts the srcDoc iframe with a fresh shell. Re-opening Tweaks immediately fires the `useUrlLoadPreview`-driven `activateSrcDocTransport()` while the new shell is still loading. The post lands in a window with no listener yet, but `activatedSrcDocTransportHtmlRef.current` is marked to the current srcDoc, so the iframe's onLoad call later short-circuits. Fix: - The lazy transport shell now posts `od:srcdoc-transport-ready` to its parent once its activate-listener is installed. - FileViewer tracks `srcDocShellReady`, reset on iframe re-mount and set when ready arrives (with onLoad as belt-and-suspenders). - Activation is funneled through a new pure helper `canActivateSrcDocTransport()` that adds the ready gate to the existing pre-conditions. When ready flips later, the `activateSrcDocTransport` useCallback regenerates and the enclosing useEffect re-fires the activation cleanly. Tests cover the shell handshake (red on main: shell never posts ready) and the gating helper (red on main: function did not exist). Fixes #2253	2026-05-20 11:13:40 +08:00
kami	dea07840f3	fix: stop stale pinned todos after terminal runs (#2321 ) Co-authored-by: multica-agent <github@multica.ai>	2026-05-20 11:13:20 +08:00
Sid	8b16d21785	fix(web): coalesce chokidar rewrite bursts before refreshing files (#2326 ) Agent rewrites surface to chokidar as `unlink` + `add` (+ optional `change`) within a single tick. ProjectView refreshes the file list on every event, so the open tab's active file vanishes for one frame between the `unlink` and the `add`. FileWorkspace's `activeFile` resolver returns null when the active name disappears from `visibleFiles`, and the preview falls back to an empty state mid-run. Add a trailing-coalesce around the file-changed refresh: - First event arms an 80ms quiet window. - Subsequent events inside the window reset it. - A 250ms maxWait cap ensures a sustained edit storm still flushes. - The intermediate `unlink` is absorbed by the next `add` and the UI only sees a single, file-present refresh. The coalesce is a small `useCoalescedCallback` hook so the timing contract is unit-testable without standing up ProjectView. Tests cover the rewrite burst, isolated single triggers, the maxWait cap under sustained triggers, latest-callback semantics, and unmount cleanup. Fixes #2195	2026-05-20 11:12:53 +08:00
Bryan	b426f614f5	fix(web): scope design-systems surface chip counts to active style filter (#2141 ) On the Design systems page the built-in library surface chips (All / Web / Image / …) counted the whole catalog, so applying a style category left them showing stale totals while the grid shrank. Clicking a chip also called setCategory('All'), discarding the user's style filter instead of refining within it. Surface chip counts and the grid now derive from a query-scoped set (style category + search) over librarySystems; the chip click only switches surface. An unscoped surfaceTotals count still backs the "surface is empty" effect guard so it reacts to the catalog, not a transient filter. The chip row always keeps "all" and the active surface chip rendered, so a transient search can never hide the active filter. Re-applied onto the DesignSystemsTab restructure from #2187. Adds 4 tests covering scoped counts, category preservation on chip click, empty-chip hiding, and active-chip persistence under a zero-result search. Fixes #2062 Co-authored-by: DevForgeAI CI/CD Engineer <devforge-ai@development.ai>	2026-05-19 23:42:45 +08:00
mzl163	210b94069a	feat(senseaudio): BYOK chat with image + video generation tools (#2065 ) * feat(senseaudio): BYOK chat with image + video generation tools Adds SenseAudio as a first-class BYOK chat protocol and wires the daemon's chat proxy with a tool loop so BYOK users can generate images and videos without dropping to a CLI agent. - BYOK protocol: new senseaudio tab + /api/proxy/senseaudio/stream route + connection-test + provider-models discovery (OpenAI-compatible wire) - Tool loop: generate_image (synchronous /v1/image/sync) and generate_video (async /v1/video/create + 5s polling /v1/video/status, 10-min ceiling, periodic progress log every 30s) - Settings dropdown + chat-composer dropdown for the BYOK image model default; generate_image's model enum lets the LLM override per call - Seed-on-success: a successful BYOK chat call idempotently mirrors the key into media-config (preserves env-resolved + already-stored keys) - Generated artifacts land in <projectsRoot>/<projectId>/ so FileViewer, DesignFilesPanel, and project export pick them up automatically; legacy /api/byok-image/:id route kept for old conversation links - Markdown renderer learns ![alt](url) image syntax with a scheme allowlist (http(s) / data:image/ / blob: / relative paths) - i18n key settings.byokImageModel across all 19 locales - 3 SenseAudio image models registered (2.0, 1.0, doubao-seedream-5.0); 1 video model (doubao-seedance-2.0) - Tests: byok-tools (29), media-senseaudio-image (8), media-config seed (7), proxy-routes (47), markdown image rendering (8) * fix(senseaudio): unblock image gen + design file preview switching - SenseAudio /v1/image/sync rejected the previous size mapping with `参数错误：size` (1664x936, 936x1664, 1280x960, 960x1280 are not in the gateway's accepted set). Switched to standard HD / SD sizes that every aspect bucket can hit: 1024×1024, 1280×720, 720×1280, 1024×768, 768×1024. Kept the byok-tools and media.ts tables in sync so the BYOK chat tool and the CLI agent path both stop failing on non-square aspects. - DesignFilesPanel's <DfPreview> was missing a key prop, so React reused the same iframe DOM node when the user picked a different file — the src prop changed but the iframe never navigated. Added key={previewFile.name} so the previous preview unmounts cleanly. - Updated byok-tools + media-senseaudio-image tests for the new size expectations. * docs(senseaudio): clear stale provider hint + update README - Settings → Media → SenseAudio: clear the auto-promoted "Image · TTS · 70+ voices · clone" hint; the provider label alone is enough now that the BYOK chat surface covers image + video tooling. - README: list the new senseaudio (and missing ollama) proxy routes so the BYOK section reflects what the daemon actually serves, and mention the generate_image / generate_video chat tools that ship with the SenseAudio path. * fix(senseaudio): address PR #2065 review feedback Three non-blocking review notes from @PerishCode on PR #2065: 1. Drop the dead /api/byok-image/:id route. The PR description claimed it was "legacy fallback for old chat history" but that storage layout never existed on main, so the route can only ever 400 or 404 — never 200. Removed the handler, the isSafeByokImageId export, the unused createReadStream / stat / path / Request / Response imports, and the two byok-image regression tests. 2. Add rejectProxyPluginContext guard to the senseaudio proxy handler so it matches the invariant the other five proxy paths already enforce (plugin runs must go through /api/runs for snapshot pinning). Extended the existing "API fallback rejects plugin runs" describe to also cover /api/proxy/senseaudio/stream with the 409 PLUGIN_REQUIRES_DAEMON expectation. 3. Wrap the secondary image / video downloads (the URLs the SenseAudio gateway hands back in /v1/image/sync .url and /v1/video/status .video_url) in validateBaseUrlResolved so a malicious gateway can't point us at 169.254.169.254 (AWS / Azure metadata) or RFC1918 hosts via the response payload. Also passed `redirect: 'error'` on both fetches to match the SSRF posture the primary proxy fetch already uses. The new assertExternalAssetUrl helper lives next to executeGenerateImage so future tool downloads can reuse it. Tests: 120/120 daemon tests pass; guard + typecheck green. * fix(senseaudio): mirror SSRF guard onto renderSenseAudioImage CLI path Follow-up to `01b1260a` — the chat-tool fix in byok-tools.ts wasn't mirrored onto the parallel renderSenseAudioImage path in media.ts. Same attacker-controllable shape (gateway-returned `data.url`), same one-line fix. - Hoist assertExternalAssetUrl from byok-tools.ts into connectionTest.ts next to validateBaseUrlResolved so both call sites (the BYOK chat tool loop AND the CLI agent media dispatcher) share one helper. Made the error strings provider-agnostic so a future caller doesn't get a misleading "senseaudio" attribution for a Volcengine / Grok / etc. download. - renderSenseAudioImage now runs the response url through assertExternalAssetUrl before fetching bytes, and passes redirect: 'error' to block a 3xx hop into private space. Scope intentionally limited to the senseaudio path PerishCode flagged; the other unguarded fetch(entry.url) call sites in media.ts (OpenAI / Volcengine / Grok / Nano-Banana) are pre-existing patterns and belong in a separate follow-up if the daemon wants defense-in-depth across every provider. Tests: 127/127 daemon tests pass; guard + typecheck green. --------- Co-authored-by: unknown <mazeliang@sensetime.com>	2026-05-19 23:14:56 +08:00
Eli	431a5e2d79	[codex] Add global onboarding flow without AMR (#2272 ) * Add global onboarding flow * Remove AMR from onboarding variant * Add onboarding role question	2026-05-19 22:00:40 +08:00
PerishFire	ad37fd30cf	Add desktop updater UI flow (#2270 )	2026-05-19 21:36:51 +08:00
Eli	e94663bfbd	Add connector memory extraction flow (#2265 )	2026-05-19 21:27:41 +08:00
kami	d1eb8b7bef	Fix editorial deck navigation (#2173 )	2026-05-19 19:30:30 +08:00
Neha Prasad	716f06cb73	fix(web): allow Comment/Inspect picker to select iframe-backed components (#2254 )	2026-05-19 19:26:11 +08:00
Neha Prasad	e6d0f4eeab	fix(web): restore scrolling in New project modal for tall tabs (#2032 )	2026-05-19 19:09:23 +08:00
PerishFire	2c128e0e91	refactor desktop host bridge (#2246 )	2026-05-19 18:27:05 +08:00
Neha Prasad	c6b7c44424	confirm before clearing memory extraction history (#2241 )	2026-05-19 17:56:14 +08:00
Zellux Wang	52b702648e	Enable LAN web dev access (#1947 )	2026-05-19 17:50:50 +08:00
Tom Huang	86ec951fb9	[codex] Add automation templates and proposal workflows (#2193 ) * feat(web): introduce Automations tab with dual-track capability for routines This commit adds a new Automations tab that consolidates routines, schedules, and live artifacts, allowing users to manage automations seamlessly. The tab features a modal for creating and editing automations, which supports various scheduling options (hourly, daily, weekdays, weekly) and project modes (create_each_run, reuse). The CLI is also updated to expose automation commands, ensuring consistency between the web UI and CLI interfaces. Key changes include: - New `NewAutomationModal` component for automation creation and editing. - Updated `TasksView` to integrate the new Automations functionality. - Enhanced styling for the Automations tab to improve user experience. This implementation aligns with the dual-track capability exposure policy, ensuring all features are accessible via both the web UI and CLI. * feat(daemon): enhance automation context handling and CLI commands This commit introduces several improvements to the automation context management and updates the CLI commands accordingly. Key changes include: - Added support for new context fields (`plugin`, `mcp`, `connector`) in automation commands. - Updated the CLI to reflect new target options (`new-project`). - Enhanced error messages for invalid target inputs. - Introduced functions to handle context selection and normalization for routines, including the ability to parse and store context data in the database. - Updated the database schema to include a new `context_json` field for routines. - Improved the handling of context in routine routes and the web interface, ensuring that selected contexts are properly managed and displayed. These changes aim to provide a more robust and flexible automation experience, aligning with the recent enhancements in the web UI. * feat(web): enhance TasksView with automation run history and status indicators This commit introduces several new features to the TasksView component, including: - Added functionality to display automation run history for each routine, showing metadata such as status, timestamps, and project details. - Implemented status indicators for routine runs, providing visual feedback on their current state (succeeded, failed, running, queued). - Enhanced the UI to allow users to expand and view detailed run history, including the ability to open the corresponding project conversation. - Updated styles to improve the presentation of automation statuses and history. These changes aim to provide users with better insights into their automation routines and improve overall usability. * feat(daemon): implement automation ingestion and proposal management This commit introduces several new features related to automation ingestion and proposal management within the daemon. Key changes include: - Added new modules for handling automation source packets and proposals, allowing for the storage, retrieval, and management of automation-related data. - Implemented functions to list, create, and apply automation proposals, enhancing the automation workflow. - Introduced new CLI commands for interacting with memory entries and automation sources, providing users with more control over their automation processes. - Enhanced the server routes to support automation source and proposal APIs, enabling seamless integration with the existing system. These changes aim to improve the overall automation experience, making it easier for users to manage and utilize automation proposals and ingestions effectively.	2026-05-19 16:35:28 +08:00
Siri-Ray	eb127e0f79	Remove live artifact home chip (#2221 )	2026-05-19 16:35:09 +08:00
Eli	4376d8a8ec	[codex] Add pet task center and desktop pet (#1833 ) * feat: add pet task center and desktop pet * Fix pet task center review regressions	2026-05-19 15:38:39 +08:00
kami	6990291217	fix: escape chat transcript role delimiters (#2156 ) Co-authored-by: multica-agent <github@multica.ai>	2026-05-19 15:37:35 +08:00
lefarcen	387dc83b27	fix(plugins): wire Open Design "Open Design PR" button end-to-end (#2182 ) Some checks failed ci / Detect CI change scopes (push) Failing after 2s Details landing-page-ci / Validate landing page (push) Failing after 1s Details landing-page-deploy / Deploy landing page (push) Has been skipped Details nix-check / build (push) Failing after 1s Details ci / Preflight (push) Has been skipped Details ci / Core package tests (push) Has been skipped Details ci / Tools workspace tests (push) Has been skipped Details ci / Daemon workspace tests (push) Has been skipped Details ci / Web workspace tests (push) Has been skipped Details ci / E2E vitest (push) Has been skipped Details ci / Playwright critical (push) Has been skipped Details ci / Build workspaces (push) Has been skipped Details ci / App workspace tests (push) Has been skipped Details ci / Validate workspace (push) Failing after 0s Details Two-part fix for the Plugin folder card's "Open Design PR" button. The flow was broken end-to-end: the CLI emitted a 404 URL, and the agent prompt under-specified the contribution steps so the agent stalled mid-turn or fell back to the legacy issue-URL path. Catalog target — `apps/daemon/src/plugins/publish.ts`: `buildPublishLink({ catalog: 'open-design' })` hardcoded a submission URL at `github.com/open-design/plugin-registry`, the dedicated registry repo proposed in docs/plans/plugin-registry.md §1.2. That repo doesn't exist yet (P3.1 notes "creating the external GitHub repo is an operational launch step, not a code blocker"), so every generated URL 404'd. Retarget the catalog at the live `nexu-io/open-design` monorepo and update the target-path hint in the PR body to `plugins/community/<plugin-name>/` (the actual layout under main). Plan §1.2 stays the long-term goal — see the code comment. Also updates the matching `od plugin yank` issue URL in cli.ts. Agent prompt — `apps/web/src/components/design-files/pluginFolderActions.ts`: The `contribute` action prompt only said "use the supported `od plugin publish` Open Design registry flow", which produces an issue URL (the legacy path) and left the agent to invent the remaining steps. The agent ended up calling `AskUserQuestion` mid-turn waiting for input that the DesignFilesPanel buttons couldn't satisfy, then stalled for 600s. Rewrite the `contribute` prompt to drive the full PR flow via raw `gh` commands: 1. preflight `gh --version` / `gh auth status` (detect-and-instruct on missing CLI, never auto-install anything) 2. read manifest, resolve author login 3. `gh repo fork nexu-io/open-design --remote=false` 4. clone fork, branch, cp plugin into `plugins/community/<name>/`, commit/push using author's git identity (not a hardcoded bot) 5. `gh pr create ... --web` so the author reviews and clicks Create Hard bans on `AskUserQuestion`, retry-on-failure, and the legacy `od plugin publish --to open-design` CLI keep the turn fire-and-forget. The `install` / `publish` action prompts are unchanged. Tests: * `apps/daemon/tests/plugins-publish.test.ts` — assert URL host + catalogLabel match `nexu-io/open-design`, body contains the new path hint. * `apps/web/tests/components/pluginFolderActions.test.ts` (new) — lock the contribute prompt's command surface, the `--web` review window, the `AskUserQuestion` ban, the install-tool ban, and folder-path interpolation. Nine assertions covering the prompt contract without coupling to exact wording. Validation: * `pnpm --filter @open-design/daemon exec vitest run tests/plugins-publish.test.ts` → 11/11 passed * `pnpm --filter @open-design/web exec vitest run tests/components/pluginFolderActions.test.ts` → 9/9 passed * `pnpm --filter @open-design/web typecheck` clean * E2E: button click in DesignFilesPanel under namespace=e2e-pr-test ran `gh --version` → fork → clone → branch → commit/push → ended at `gh pr create --web` opening the GitHub PR draft in browser. No AskUserQuestion stall this time. Doesn't touch: `5f71968f` server-side endpoints (`/contribute-open-design` + `/publish-github` still in main as dead code — separate cleanup), plan/spec docs (per maintainer call to keep the dedicated-repo as the long-term goal), or `registry-backends.test.ts` (plan terminal-state fixture; no production caller).	2026-05-19 15:26:59 +08:00
lefarcen	9596a0ccd5	feat(privacy): collapse first-run consent banner to a single "I get it" button (#2202 ) * feat(privacy): collapse first-run banner to a single "I get it" button Replaces the first-run privacy disclosure's two-button decision picker ("Share usage data" / "Don't share") with a single "I get it" acknowledgement. Clicking it accepts the same default telemetry surface the previous "Share usage data" path enabled — the banner shifts from binary consent picker to informed disclosure. To keep the surface honest, the banner footer is rewritten to spell out the new default and point at the off switch: "Data sharing is on by default. You can turn it off any time in Settings → Privacy. We never upload the contents of your generated artifact files." Settings → Privacy (PrivacySection.tsx) is unchanged — that surface still exposes both Share and Don't share buttons so users who arrive there later (or come back to flip the choice) keep the explicit picker. Mechanics: * `PrivacyConsentModal.tsx`: drop `onDecline` prop and the second button; rename `onShare` → `onAccept` to match the new semantic. Footer hint now reads from `settings.privacyConsentBannerFooter` (new key) so the banner copy can speak in single-button voice without disturbing the reused `settings.privacyConsentFooter` that PrivacySection still displays. * `App.tsx`: drop the `onDecline` handler. Single `onAccept` handler applies the same opt-in payload as the previous `onShare` branch (`telemetry.metrics = true`, `telemetry.content = true`, fresh `installationId`), so the wire format daemon-side is unchanged. * `i18n/types.ts` + `locales/en.ts`: two new keys — `settings.privacyConsentAccept` ("I get it") and `settings.privacyConsentBannerFooter` (the default-on disclosure copy). * `i18n/locales/.ts` (all 18 non-en dictionaries): added the two new keys. zh-CN and zh-TW are translated; the remaining 16 locales follow the project convention of leaving the EN string as a fallback for later contributor passes (same shape used by privacyConsentShare / Decline today). `tests/components/PrivacyConsentModal.test.tsx`: rewritten. The four new tests lock the new contract — single "I get it" button, no Share/Decline labels, default-on disclosure text in the footer, the external privacy-policy link, and onAccept firing on click. Replaces the prior "equal-prominence" tests, which only made sense for the two-button shape. Validation: * `pnpm --filter @open-design/web exec vitest run tests/components/PrivacyConsentModal.test.tsx` → 4/4 passed * `pnpm --filter @open-design/web exec vitest run tests/i18n/locales.test.ts` → 5/5 passed (every locale aligned with English keys + placeholders; the new keys ship to all 19 dictionaries) * `pnpm --filter @open-design/web typecheck` clean * test(privacy): update App.connectors fixture to match single-button banner `App.connectors.test.tsx > does not show first-run privacy consent until daemon config hydration finishes` hardcoded the previous "Share usage data" affirmative button label. The single-button banner now renders "I get it", so the assertion was looking for a button that no longer exists. CI signal: 26081467446 → Web workspace tests → `× does not show first-run privacy consent until daemon config hydration finishes`. The App workspace tests + Validate workspace failures cascaded from this one — both are aggregator jobs. Local: vitest run tests/components/App.connectors.test.tsx → 5/5 passed.	2026-05-19 15:26:56 +08:00
Eli	3cecb1c881	Polish project workspace UI (#2201 )	2026-05-19 15:14:16 +08:00
chaoxiaoche	a38e09f931	fix(web): demote Plugins and Integrations to nav rail footer (#1806 ) * fix(web): demote Plugins and Integrations to nav rail footer Plugins and Integrations are platform-configuration surfaces, not daily-use destinations. Moving them to the footer section of the left nav rail — separated from primary items by a thin divider — keeps them reachable while giving the primary four items (Home, Projects, Automations, Design Systems) the visual weight they deserve. - EntryNavRail: remove Plugins/Integrations from the main __group and place them in the __footer above the help launcher - entry-layout.css: add __divider rule (1 px separator) to visually mark the boundary between primary and secondary nav regions Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): remove settings dropdown, gear opens settings directly The gear/cog button previously opened a dropdown that mixed three unrelated concerns: community links (X, Discord), preference quick- access (Language, Appearance), a feature shortcut (Use everywhere), and a redundant Settings entry — creating two separate paths to the same Settings dialog and duplicating Language/Appearance relative to the Settings sidebar. Changes: - Gear button now directly opens the Settings dialog (no intermediate dropdown layer) - Follow @nexudotio on X and Join Discord moved to the Help menu at the bottom of the nav rail, where community/external links belong - Language and Appearance remain exclusively in the Settings dialog - Use everywhere remains exclusively in the topbar chip - Remove dead state (avatarMenuOpen, languageExpanded, appearanceExpanded), ref, outside-click effect, and module-level constants (APPEARANCE_THEMES, APPEARANCE_LABEL, describeModelChip) that were introduced solely to support the dropdown Co-authored-by: Cursor <cursoragent@cursor.com> * feat(web): move output-type chips above chat input as tab bar The home hero previously had two chip rows below the chat input that mixed two unrelated dimensions: output type (Prototype, Slide deck, Image…) and workflow source (Create plugin, From Figma, From folder, From template). Users had no visual cue that these were different categories. This change separates the two dimensions clearly: - Output type (create group) becomes a tab bar positioned above the input card. Tabs share the same chip data and onPickChip dispatch, so plugin selection, active state, and pending state are unchanged. Active tab shows a colored underline; the bar border visually connects to the input card below. - Workflow source (migrate group) stays as the chip row below the input card, now standing alone with unambiguous "how to start" semantics. - Subtitle updated from "Pick a plugin below" to "Pick a type" to match the new placement. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): skip replace-prompt confirmation when switching output-type tabs Output-type tab clicks (create group) are mode-selection gestures; the user expects the prompt to update immediately without a dialog. The confirmation is still shown for migrate-group chips (From Figma etc.) where the replacement carries meaningful user-provided content. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): use brand logo as Home destination, drop redundant brand mentions The entry nav rail previously rendered the brand logo and a separate Home icon back-to-back; both invoked `onViewChange('home')`, so the Home button was pure duplicate affordance. The hero pane also displayed a third "Open Design" lockup that competed with the rail logo and the rebranded title. This collapses those affordances: * `EntryNavRail` drops the dedicated Home `NavButton`. The brand logo button now carries `aria-current="page"` and an `is-active` visual state when the home view is showing, and its tooltip reads "Open Design · Home" off-home so the navigation behavior stays discoverable for new users. * `entry-layout.css` adds the matching `.entry-nav-rail__logo.is-active` accent treatment so the "you are here" cue reads at parity with primary rail buttons. * `HomeHero` removes the inline `home-hero__brand` lockup and the associated CSS, then retunes the title/subtitle/type-tab spacing so the headline group still pairs tightly with the type tabs and input card below. * `entry-chrome-flows` is updated to assert the logo carries the active page treatment and that no `entry-nav-home` test id resurfaces by mistake. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): auto-grow home chat input, disable internal scroll and drag-resize The home hero textarea was capped at a fixed `min-height: 84px` with `resize: vertical`, so users had to either drag the bottom-right corner to enlarge it or scroll the textarea internally to read longer prompts. That hid context (loaded plugin templates routinely overflow three lines) and exposed a manual grip whose state was easy to leave in an awkward height. This change makes the chat box grow with its content: * `HomeHero` adds a `useLayoutEffect` that, on every prompt change, resets the textarea height to `auto` so the browser can measure a smaller content, then writes back `scrollHeight` in pixels. The effect uses layout phase (not effect phase) to avoid a one-frame flash at the previous height when a plugin loads a long example prompt. * `home-hero.css` swaps `resize: vertical` for `resize: none` and adds `overflow: hidden`, so the manual drag grip disappears and the textarea never scrolls internally. `min-height: 84px` is kept so the empty input still reads as a chunky chat box. The outer page handles overflow when the prompt is genuinely very long. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): make output-type tabs read as folder-tabs attached to chat box The previous tab bar used a small icon + label per tab and signaled the active tab with a 2px accent underline sitting on a horizontal divider line. That divider visually broke the relationship between the tabs and the input card below — the active tab looked like a free-floating header, not a flap of the chat surface, so users had to do extra work to mentally connect "I picked Prototype" with the prompt area immediately underneath. This switches the tab bar to a folder-tab pattern: * `HomeHero` drops the per-tab `Icon` element. The seven labels (Prototype, Live artifact, Slide deck, Image, Video, HyperFrames, Audio) already disambiguate at the type sizes used here, and the icons were primarily decorative. * `home-hero.css` rewrites the tab styling: - The bar no longer paints its own baseline border; the input card's top edge serves as the baseline. - Each tab is a rounded-top container with a 1px transparent border and a `-1px` bottom margin, so its bottom edge overlaps the card's top border by exactly one pixel. - The active tab borrows the card's panel background and border color, and overrides its bottom border with the panel color so it visually erases the card's top border for the tab's width. The result reads as one continuous "Prototype is this chat box" surface. - The card's prior `margin-top: 8px` is removed so the tab bar bottom and card top sit at the same y coordinate, which is the geometric precondition for the 1px overlap to land cleanly. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): enlarge home chat attach and submit buttons for legibility The attach (paper-clip) and submit (arrow-up) controls on the home chat input rendered at 32px diameter with 14px and 18px glyphs respectively. After stroke antialiasing the icons read at roughly 11–12px on typical displays — small enough that users reported the glyphs were illegible and had to be discovered by trial-and-error. This bumps both controls to 38px circles and grows their glyphs (attach 14 → 18, submit 18 → 22). The primary call-to-action now has clearly more visual weight than the surrounding muted hint text, and the paper-clip is recognisable at a glance. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): paint a chevron on inline select slots in the home prompt Inline select slots in the home prompt overlay (e.g. the `high-fidelity` / `wireframe` fidelity picker on the Live artifact template) rendered as plain pill highlights, visually identical to free-text and read-only text slots. The slot's `appearance: none` strips the browser's default chevron, so users had no affordance hinting the value was switchable. This wraps select-type slots in an `inline-flex` span and overlays an explicit chevron-down `Icon` against the slot's trailing padding (bumped from 18px to 22px to make room). The chevron inherits the accent color via the wrap's `color` property so it themes correctly in light and dark modes. Click semantics are unchanged because the chevron is `pointer-events: none`. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): size inline number slots to their value, not the whole row Number-type inline slots in the home prompt (e.g. the slide-count spinner on the Slide deck template) inherited the generic slot `min-width: 8ch` and then stretched to the browser's default `<input type=number>` width, so a two-digit value like "10" ate the entire remaining line and pushed the native spinner buttons to the far right edge — far away from the value they control. This sizes the number input to its actual content plus four characters of trailing room for the spinner buttons (clamped to 6–14ch), and adds a `home-hero__prompt-slot-input--number` modifier that overrides the slot `min-width` to 4ch. The spinner now sits flush against the value and the slot reads as a compact inline pill matching the surrounding text/select slots. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): widen prompt line-height so slot pills do not collide vertically Each interactive slot and mention in the home prompt paints a 2px outline ring via `box-shadow`. At the previous `line-height: 1.55` on a 15px font, two lines were ~23px apart while a single pill occupied ~20px of vertical space (text box + ring on both sides), leaving roughly 2px of clear space between rows. When the prompt wrapped onto multiple lines — common for the Image template's "Generate a {kind} of {subject}. Style: {style}. Aspect: {ratio}." example — the rings from line N and line N+1 visually merged into a single bar, making it ambiguous which pill the user was about to click or edit. This bumps the line-height of both the highlight overlay and the underlying editable textarea to 1.85 (~28px per line), restoring ~8px of clear space between pill rows. The two values must stay identical so the overlay glyphs continue to track the textarea caret positions exactly; a brief comment in each rule documents this coupling for future edits. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): paint dropdown chevron on select element with neutral color The previous chevron treatment wrapped each select slot in an inline-flex span and laid an accent-orange `<Icon>` over the trailing padding. At the prompt's normal display size the orange chevron blended into the orange pill ring corners, so users perceived "a bunch of dropdown arrows" across every highlighted slot, even though only the actual select rendered one. Move the chevron onto the `<select>` itself as a small neutral- gray `background-image` SVG. The grey contrasts clearly with the pill's accent ring and the chevron lives inside the select's own padding, so it can never visually overlap the value text and can never appear on non-select slots. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): shrink select-slot dropdown caret so it stops reading as a check-mark The 8×6 stroked chevron looked like a check-mark in zoomed views because the stroke width was a large fraction of the glyph. Swap for a small (9×5) filled triangle drawn as a `background-image`, with `background-size` pinned so browsers can't scale it to its intrinsic SVG box. The caret is now unambiguously a dropdown indicator without crowding the value text. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): make read-only prompt slots visually distinct from editable pills Multi-word context values are rendered as plain <span>s with pointer-events: none, but they inherited the same orange pill + ring treatment as the truly editable <input>/<select> slots. Users couldn't tell which pills they could click into. Strip the pill background, the ring, the radius, and the padding from `.home-hero__prompt-slot-text` and replace them with a subtle dashed bottom border. The orange foreground keeps the slot family link, but read-only highlights now clearly read as "context value spliced into the prompt" rather than as "interactive control". Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): scale select-slot dropdown caret up to 12px wide for legibility The 9×5 caret was too quiet at the prompt's font size to read as a dropdown affordance. Bump to a 12×6 filled triangle and pad the select's trailing space (24px) so the value text still has clear breathing room from the caret. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): collapse plugins search and filter strips into one bar The plugins-home gallery laid out search, count, mode (Featured), total-in-catalog, clear-filters, the main category row, and the sub-category row as four floating clusters. Search lived up in the section header, far from the chips it actually scopes; the mode strip wedged "Featured + 386 in catalog + Clear filters" between the header and the category row with no obvious ownership; and the two clear-filter affordances duplicated each other (chip strip + empty-state). Fold the mode strip into the category row: Featured is now the leading chip on the same line as the category pills, and the search field, the result count, and a compact Clear link sit as a right-aligned tools cluster on that same row. The header shrinks back to just the title, subtitle, and Browse registry link. The sub-category row stays as the contextual second line when a category is active. Tests: keeps the existing data-testids (plugins-home-chip-featured, plugins-home-row-category, plugins-home-clear, plugins-home-count, plugins-home-search) so the existing 11-case section spec passes without modification. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): hide Recent projects rail on first-run home when empty A brand-new user landing on Home saw an empty dashed box with the copy "No projects yet — type a prompt to start one." sitting above the plugin gallery. The hero already prompts the user to type, so the empty rail just adds vertical noise without telling them anything new. Return null from RecentProjectsStrip when the recent list is empty (loading or not) so the rail appears only when there is actually something to show. Update the entry-chrome e2e to assert toHaveCount(0) on first run — codifying the new contract that the rail is conditional, not always-mounted. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): hide structured inputs form when every field is in the prompt The PluginInputsForm below the chat textarea surfaced every plugin input — even the ones the prompt template already substitutes inline as highlighted slot pills. For a plugin like Prototype that's five identical labelled inputs duplicating the five slot pills above them, making the chat box look like it has grown a second composer. Compute the set of placeholder keys actually referenced in the prompt template (via INPUT_PLACEHOLDER_PATTERN) and skip those when deciding what the structured form needs to render. The form still appears for plugins that expose inputs not referenced in the prompt text (e.g. a "Run in background" toggle), but template-only plugins collapse the redundant second editor. Update the picker spec accordingly: when every field is in the template the form is now expected to be absent rather than to render a duplicate of the inline slot. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): drop redundant filtered-count and Clear link beside the search The combined plugins filter bar trailed the search field with "59 / 386" and an inline "Clear" link, but both repeat what the chip strip already shows: every chip carries its own count (Slides 59, All 386, …) and clicking the All chip already resets the filters. Users had no way to know what the bare "59 / 386" fraction or "Clear" referred to without inference. Strip the trailing tools cluster down to just the search input. The empty-results message at the bottom of the gallery still exposes a contextual "Clear filters" button when a stacked filter yields no matches, so the affordance isn't lost — just removed from a position where it didn't read as actionable. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): drop default subtitle copy from the Official starters strip The Home gallery rendered a two-line explanatory subtitle under "Official starters" — "Ready-to-use Open Design workflows bundled with this runtime. Pick one to load a starter prompt, or browse the registry for more." — every visit. Returning users skip it, new users get the same message from the section title plus the Browse registry link plus the visual card grid itself; the prose read as filler chrome above the chip strip. Default the subtitle prop to undefined and only render the <p> when a caller passes an explicit string. Other surfaces that mount PluginsHomeSection with their own copy keep their subtitle; the bare Home gallery loses one row of vertical noise. Co-authored-by: Cursor <cursoragent@cursor.com> * Fade out empty workflow lanes in Plugins home filter Empty top-level lanes (Deploy 0, Refine 0, etc.) and empty sub-categories (Vercel 0 under Deploy, Figma 0 under Import, etc.) used to look identical to populated ones, so users couldn't tell at a glance which chips were real catalog buckets and which were "contribute a plugin" invites. The strip kept all lanes visible by design — the workflow shape (Import / Create / Export / Share / Deploy / Refine / Extend) is part of what we want users to see — so we keep them clickable, but tag count-zero chips with data-empty='true' and give them a faded, dashed-border treatment. "All" pills stay solid since their count reflects the parent lane, not their own emptiness. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): update home nav test expectations * fix(e2e): align critical smoke with entry chrome --------- Co-authored-by: chaoxiaoche <chaoxiaoche@chaoxiaochedeMacBook-Pro.local> Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-19 14:58:15 +08:00
Eli	18b947c25f	[codex] Land design system GitHub intake handoff (#2187 ) * Add Claude-style design system workflow * Merge design system workflow into main * Restore design system workflow UI styles * Fix design system setup scrolling * Fix design system setup connector button * Preserve connector auth link after popup block * Simplify connected GitHub setup state * Open generated design system workspace project * Summarize design system auto prompt in chat * Add bounded GitHub connector design intake * Prefer path-scoped GitHub intake tools * Restore branch GitHub design context intake * Restore design system review workspace * Restore design system manager tab * Let design system workflow routes own details * Open editable design systems as projects * Restore design system workspace coverage * Fix bounded GitHub connector intake * Hide design system review while generating * Suppress design system generation questions * Constrain GitHub design intake to bounded command * Tolerate oversized GitHub metadata during intake * Rebuild daemon CLI when sources change * Fallback when GitHub connector snapshots are rate limited * Allow GitHub intake without Composio * Use native GitHub auth for design intake * Remove design system review group heading * Improve design system extraction evidence * Align design system scaffold with Claude output * Add evidence inventory for design system intake * Add local design system evidence intake * Add design system package audit gate * Allow auditing Claude Design reference packages * Audit design system package content quality * Migrate legacy design system artifacts * Clean migrated design system artifacts * Require modular design system UI kits * Reject thin design system UI kits * Prioritize core design evidence intake * Require role-based design system UI kits * Clean stale design system manifest references * Require representative preserved design assets * Warn on generic design system visuals * Enforce design system quality warnings * Audit connected design system UI kits * Require mounted design system UI kits * Require composed design system app shells * Require runnable JSX design system kits * Require browser globals for design system components * Infer design system names from source URLs * Require source examples in design system packages * Bind preserved fonts in design system tokens * Require skill frontmatter in design system packages * Preserve build icons in design system packages * Require real assets in brand previews * Require substantive source examples * Require product overview in design system README * Require reusable UI kit README * Require reusable design system skill docs * Seed Claude-style UI kit entry contract * Preserve runtime build assets in design packages * Audit design system packages after generation * Audit design system first-run output * Audit source-backed preview cards * Align design system UI kit scaffolds * Materialize design evidence package artifacts * Show project chat during design system setup * Hand off design system setup to project chat * Auto-repair design system audit failures * Harden design system evidence preservation * Tighten design system package guidance * Add targeted design system repair guidance * Bound design system audit auto repair * Use connector statuses in design system setup * Audit design system preview manifests * Require README preview manifests for design systems * Fix design system GitHub intake handoff * Fix daemon prompt CI assertions	2026-05-19 14:30:17 +08:00
PerishFire	bd48c597b0	chore: pin dependency versions and harden CI caches (#2189 ) * chore: pin dependency versions * ci: enforce pinned dependency specs * ci: fix pnpm executable invocation	2026-05-19 13:58:27 +08:00
kami	68b0e0258d	fix(web): scope daemon transcript to active agent (#2157 )	2026-05-19 13:49:34 +08:00
kami	b838e94b88	fix(web): expand skill mention picker (#2170 )	2026-05-19 13:49:15 +08:00
shangxinyu1	f650a043d9	test(e2e): align entry coverage with redesigned flows (#2101 ) * Migrate entry E2E coverage and split CI * test(e2e): relax connectors auth error assertions * ci: route scenario registry changes to extended e2e * ci: decouple packaged smoke jobs from validate gate * ci: restore pre-split workflow * test(e2e): slim critical ui smoke coverage * test(e2e): move broader entry flows out of critical * test(e2e): restore entry chrome coverage to ci * ci: parallelize workspace validation jobs * test(web): stabilize media palette bridge assertion * ci: cache e2e playwright browsers	2026-05-19 11:26:40 +08:00
Caprika	34f66113a0	Implement Home audio essential workflow (#2104 ) Some checks failed ci / Packaged mac smoke (push) Blocked by required conditions Details ci / Packaged windows smoke (push) Blocked by required conditions Details ci / Detect PR change scopes (push) Failing after 1s Details ci / Validate workspace (push) Has been skipped Details landing-page-ci / Validate landing page (push) Failing after 1s Details landing-page-deploy / Deploy landing page (push) Has been skipped Details nix-check / build (push) Failing after 1s Details ci / Packaged linux headless smoke (push) Has been skipped Details * fix(web): align Home prompt overlay with textarea so caret lands on click Picking a chip such as Slide deck or Image loaded a default prompt into the Home textarea and rendered an overlay with `{{key}}` placeholders as interactive `<input>` / `<select>` controls. The overlay controls and the underlying textarea text were laid out independently: - Inputs declared `min-width: 8ch` and `Math.max(displayValue.length + 1, 10)ch` of width. - Selects added 18px of right-padding for the dropdown arrow. - The textarea kept the raw substituted string in a proportional font. The two layouts no longer matched column-for-column, so every slot shifted the textarea text to the left of where it appeared in the overlay, compounding across the line. Clicking on visible prose to position the caret hit a different character offset in the textarea and subsequent typing or deletion landed in the wrong place. For example, the Slide deck template Create a {{slideCount}}-slide {{deckType}} for {{audience}} about {{topic}}. ... renders with slideCount=10 (~2 ch in the textarea) under a slot input forced to 10 ch in the overlay — clicking right after the literal `-slide` placed the caret several characters into `pitch deck`. The Image / Video / Audio chips with their pre-filled subject, style, aspect values reproduced the same drift. Render the inline pills as read-only `<span>`s carrying the exact substring the textarea shows at that position, mark them `aria-hidden` so the textarea remains the single labelled control, and surface every plugin input field — including the ones referenced inline — in `PluginInputsForm` underneath. Editing flows through the form, the parent's `updateActiveInputs` already re-renders the prompt, and the pills stay aligned with the textarea on every keystroke. Also drop the now-unused inline helpers (updatePluginInput, getTemplateInputNames, shouldRenderSlotAsText, inlineFieldType, fileInputLabel, fileMetadata) and the dead `.home-hero__prompt-slot-control/input/select/toggle/file/text` CSS rules. Verified: - pnpm --filter @open-design/web typecheck - vitest run on HomeHero.plugin-picker, HomeHero.rail, and HomeView.prefill (29/29 pass; tests updated to reflect the new read-only span + always-on form contract) - Manual click-to-edit on Slide deck and Image chips in the pnpm tools-dev web runtime — caret now lands where the user clicked. * Implement home audio essential workflow * Fix Home media composer review issues * Guard stale Home media apply results --------- Co-authored-by: hahaplus <zmjdll@gmail.com>	2026-05-18 23:14:03 +08:00
kami	1523defad1	feat(web): add plugin registry detail drawer (#2087 ) Co-authored-by: multica-agent <github@multica.ai>	2026-05-18 22:12:16 +08:00
kami	8a629eb999	fix: discover codex models from cli (#2082 )	2026-05-18 22:11:51 +08:00
kami	ec53959601	feat(web): unify plugin trust badges (#2088 ) Normalizes plugin trust badge rendering across Home, plugin details, registry entries, sources, and plugin share/install surfaces while mapping bundled and official sources to the user-facing Official tier. Validation: CI and nix-check were green on PR head `1a68f20571`.	2026-05-18 19:20:01 +08:00
kami	ac748d53fc	Fix pointer HTML artifact previews (#2075 ) Resolves short pointer-only HTML artifacts to the existing project HTML target before persisting preview artifacts, with helper and ProjectView regression coverage for #536. Validation: CI and nix-check were green on PR head `8c7ee5f38d`.	2026-05-18 18:28:28 +08:00
kami	7b3b7c3b74	Fix finalize provider routing for Gemini BYOK (#1964 ) Routes Finish Design/finalize requests through the selected BYOK provider, including Gemini, while preserving the Anthropic fallback. Validation: CI and nix-check were green on PR head `6c334e08d1`.	2026-05-18 18:03:44 +08:00
kami	0101a09b10	fix(mcp): support no-auth local HTTP servers (#2008 )	2026-05-18 17:08:46 +08:00
Yuhao Chen	25708fbe0f	fix(web): keep connector errors out of cards (#1740 ) * fix(web): keep connector errors out of cards * fix(web): route panel-alert open-details through openConnectorDetails The new "open details" action on the panel-alert row called `setDetailConnectorId(alert.connectorId)` directly, bypassing the `toolPreviewFailedIds` clear that `openConnectorDetails()` does on every other entry point. Concrete regression: user opens a connector's drawer from the card, the tool-preview hydration fetch fails so `toolPreviewFailedIds[id]` gets stamped with the current `toolPreviewRetryToken`, user closes the drawer, and later the same connector throws a connect error and shows up as a panel alert. Clicking the alert's open-details button re-sets `detailConnectorId` but never clears the failed-fetch token, so the hydration effect at the previous bail-out check (`toolPreviewFailedIds[detailConnector.id] === toolPreviewRetryToken`) returns immediately and the drawer never retries loading tools. Routing the click through `openConnectorDetails(alert.connectorId)` restores the same clear-before-set behavior the card and search entry points already had, so the reopened drawer always gets a fresh hydration attempt. * test(web): cover connector panel alert tool retry * fix(web): clarify connector panel alert status text * fix(web): clear stale connector cancel alerts --------- Co-authored-by: lefarcen <935902669@qq.com>	2026-05-18 17:06:46 +08:00
Ethan Guo	bd1b41a9d4	feat(web): highlight captured Pod component on chip hover (#1982 ) * feat(web): highlight captured Pod component on chip hover Wires onPointerEnter/Leave on chip + onFocus/Blur on its × so the matching member overlay gets an outline-focused emphasis on canvas. * ci: trigger re-run for flaky palette dark-mode test * fix(web): skip non-mouse pointer events on pod chip hover Gate onPointerEnter/Leave to pointerType === 'mouse' so a touch tap on a chip no longer flickers the captured-member highlight; also add the CommentTargetOverlay class-wiring test, drop the redundant alpha on the hover outline, and document the non-member fallback path.	2026-05-18 16:34:19 +08:00
fyz3120	92d91101ae	Fix inspect ancestor selection notice (#2039 ) Co-authored-by: fuyizheng3120 <182291973+fuyizheng3120@users.noreply.github.com>	2026-05-18 16:33:42 +08:00
Jeongjin Shin	0026dfa344	fix: support CODEX_API_KEY for Codex CLI (#1880 ) Some checks failed ci / Packaged mac smoke (push) Blocked by required conditions Details ci / Packaged windows smoke (push) Blocked by required conditions Details ci / Detect PR change scopes (push) Failing after 1s Details ci / Validate workspace (push) Has been skipped Details landing-page-ci / Validate landing page (push) Failing after 1s Details landing-page-deploy / Deploy landing page (push) Has been skipped Details nix-check / build (push) Failing after 3s Details ci / Packaged linux headless smoke (push) Has been skipped Details * fix(daemon): allow Codex API key env * fix(web): expose Codex API key setting * fix(web): clarify Codex key labels	2026-05-18 14:29:23 +08:00
Yuhao Chen	156191d565	fix(web): show unavailable state for missing examples (#1983 )	2026-05-18 14:02:22 +08:00
Ghxst	332e982a07	fix(web): stabilize HTML preview URL on tab switch (#1959 ) Some checks failed ci / Packaged mac smoke (push) Blocked by required conditions Details ci / Packaged windows smoke (push) Blocked by required conditions Details ci / Detect PR change scopes (push) Failing after 1s Details ci / Validate workspace (push) Has been skipped Details nix-check / build (push) Failing after 2s Details ci / Packaged linux headless smoke (push) Has been skipped Details Co-authored-by: GHX5T-SOL <200635707+GHX5T-SOL@users.noreply.github.com>	2026-05-17 23:25:14 +08:00
kami	91be2696dd	Persist routine failure reasons (#1963 ) Co-authored-by: multica-agent <github@multica.ai>	2026-05-17 23:22:00 +08:00
hahaplus	89bf313782	fix(web): align Home prompt overlay with textarea so caret lands on click (#1958 ) Picking a chip such as Slide deck or Image loaded a default prompt into the Home textarea and rendered an overlay with `{{key}}` placeholders as interactive `<input>` / `<select>` controls. The overlay controls and the underlying textarea text were laid out independently: - Inputs declared `min-width: 8ch` and `Math.max(displayValue.length + 1, 10)ch` of width. - Selects added 18px of right-padding for the dropdown arrow. - The textarea kept the raw substituted string in a proportional font. The two layouts no longer matched column-for-column, so every slot shifted the textarea text to the left of where it appeared in the overlay, compounding across the line. Clicking on visible prose to position the caret hit a different character offset in the textarea and subsequent typing or deletion landed in the wrong place. For example, the Slide deck template Create a {{slideCount}}-slide {{deckType}} for {{audience}} about {{topic}}. ... renders with slideCount=10 (~2 ch in the textarea) under a slot input forced to 10 ch in the overlay — clicking right after the literal `-slide` placed the caret several characters into `pitch deck`. The Image / Video / Audio chips with their pre-filled subject, style, aspect values reproduced the same drift. Render the inline pills as read-only `<span>`s carrying the exact substring the textarea shows at that position, mark them `aria-hidden` so the textarea remains the single labelled control, and surface every plugin input field — including the ones referenced inline — in `PluginInputsForm` underneath. Editing flows through the form, the parent's `updateActiveInputs` already re-renders the prompt, and the pills stay aligned with the textarea on every keystroke. Also drop the now-unused inline helpers (updatePluginInput, getTemplateInputNames, shouldRenderSlotAsText, inlineFieldType, fileInputLabel, fileMetadata) and the dead `.home-hero__prompt-slot-control/input/select/toggle/file/text` CSS rules. Verified: - pnpm --filter @open-design/web typecheck - vitest run on HomeHero.plugin-picker, HomeHero.rail, and HomeView.prefill (29/29 pass; tests updated to reflect the new read-only span + always-on form contract) - Manual click-to-edit on Slide deck and Image chips in the pnpm tools-dev web runtime — caret now lands where the user clicked.	2026-05-17 23:18:28 +08:00
Ethan Guo	0d2c87242f	fix(web): snapshot pending before reload so expired-auth cancel actually fires (#1972 ) reloadConnectorStatuses commits its prune setState (and runs the connectorAuthorizationPendingRef mirror effect) between the two awaits in onFocus, so cancelStaleAuthorizations was reading a post-prune ref and never identified anyone as stuck. Capture the snapshot inside onFocus before reloadConnectorStatuses runs, and pass it explicitly so the expired-auth daemon cancel actually fires. Refs #1909 #1354	2026-05-17 23:13:53 +08:00
kami	d36ceacf3a	feat(web): surface saved Project instructions for review and retrieval (#1933 ) Saved Project instructions had no post-save read-back surface. The header control was a bare pencil icon that opened an editor and closed on Save, so after returning to the workspace a user could not confirm what was stored, revisit it, or tell whether it was still active (#1822). Make the saved state discoverable: once instructions exist, the header affordance becomes a persistent "Project instructions" chip. Opening it shows a read-only review panel with a preview of the saved text and an "Active - included in every message" status, plus a reopen-to-edit action. Saving lands back on the review panel so the stored value is read back immediately. The empty state keeps the pencil and opens the editor directly. Persistence and project-level system-prompt injection are unchanged. Closes #1822 Co-authored-by: multica-agent <github@multica.ai>	2026-05-17 23:07:25 +08:00
kami	647433ccef	Fix Inspect preview transport flash (#1967 ) Some checks failed ci / Packaged mac smoke (push) Blocked by required conditions Details ci / Packaged windows smoke (push) Blocked by required conditions Details ci / Detect PR change scopes (push) Failing after 2s Details ci / Validate workspace (push) Has been skipped Details nix-check / build (push) Failing after 1s Details ci / Packaged linux headless smoke (push) Has been skipped Details * Fix inspect preview transport flash Co-authored-by: multica-agent <github@multica.ai> * Keep inactive srcdoc transport lazy Co-authored-by: multica-agent <github@multica.ai> --------- Co-authored-by: multica-agent <github@multica.ai>	2026-05-17 22:30:37 +08:00
kami	30ad8b8ac3	improve privacy consent modal: policy link, clearer CTAs, mobile layout (#1921 ) The first-run consent banner had no link to a privacy policy, an affirmative button ("Help improve") that didn't read as a consent choice, and a fixed bottom-right card that crowded content on phones. - Add a "Read the privacy policy" link (external-link icon, accent, underlined) above the actions, plus a root PRIVACY.md it points to documenting the telemetry behaviour the modal discloses. - Rename the CTAs to "Share usage data" / "Don't share" so both name the action; they stay equal-prominence per the EDPB/GDPR comment. - Stretch the banner to a bottom-edge bar under 540px with a safe-area inset so it clears mobile browser chrome. - Add PrivacyConsentModal tests; sync the new i18n key to every locale and update the consent-label assertion in App.connectors. Refs #1756	2026-05-17 20:24:15 +08:00
Ethan Guo	9cc1fb28f3	fix(web): apply Tweaks palette shift to :root CSS custom properties (#1952 ) Walk document.styleSheets so chromatic custom properties on :root/html/body/:host (including @media-nested rules) are hue-shifted via inline setProperty, not just the values returned by getComputedStyle. Fixes #1393 References: PR #1643 (open, in review) binds the Tweaks toolbar toggle to artifact `.tw-panel` panels via a new `injectTweaksBridge` in `srcdoc.ts` — adjacent surface but different problem. Our fix extends `injectPaletteBridge` to also shift `:root` CSS custom-property colors, which #1643 does not touch.	2026-05-17 20:16:57 +08:00
Ethan Guo	b7fa1d9286	fix(web): recover stuck Composio connector when auth is cancelled (#1909 ) * fix(web): recover stuck Composio connector when auth is cancelled When a user closes the Composio system-browser auth window without completing the flow, the connector card stayed in its loading state forever because nothing told the daemon the request was abandoned. The window-focus refresh now awaits the status poll and, for every connector still in connectorAuthorizationPending whose freshly fetched status is not 'connected', issues cancelConnectorAuthorizationRequest and clears the matching local pending/error UI state. The connected guard is read from the just-returned statuses so a postMessage success that lands just before focus is not racy. Fixes #1354 * fix(web): gate Composio focus auto-cancel on TTL + cancel response Require expiresAt to have elapsed before auto-cancelling a pending Composio auth on focus, and preserve the spinner on a failed cancel (matching the manual Cancel handler) so the UI never disagrees with the daemon. Refs https://github.com/nexu-io/open-design/pull/1909#discussion_r3254152045 Refs https://github.com/nexu-io/open-design/pull/1909#discussion_r3254152046	2026-05-17 20:15:38 +08:00
Ethan Guo	324eca27ea	feat(web): add manual removal for captured Pod components (#1951 ) * feat(web): add manual removal for captured Pod components Adds `removePodMember` helper and a per-chip × in `BoardComposerPopover`; leaves `comments.ts` untouched (avoid-zone from #1127). Closes #802 Contract: runs/2026-05-16T08-08-52_open-design_issue-feat/contract.md * style(web): hide Pod chip × until chip hover Swaps the unicode × for the existing `Icon name="close"` SVG so the hit target stays centered, and fades the button in only on chip hover / keyboard focus for a quieter resting state. * fix(web): auto-close Pod composer when last chip is removed Removing the last chip leaves a stale anchor; close so Send cannot attach to elements no longer visible. * refactor(web): extract BoardComposerPopover and Pod-member reducer Moves the popover to its own module and lifts the chip-removal reducer into a pure `applyPodMemberRemoval` so unit tests exercise the real code path and the popover's export is no longer test-only. * fix(web): rebuild Pod anchor when a member is removed Without this, the popover keeps the original union bbox / selector / label after each chip removal, so a subsequent Send to chat anchors the comment to elements no longer in the Pod. * fix(web): render every captured chip and scroll on overflow The previous slice(0, 6) cap left chips beyond the sixth invisible and undeletable. Render the full list inside a 132px-tall scrollable strip.	2026-05-17 20:13:56 +08:00
Ethan Guo	10c62bf6e4	fix(web): warn before editing a built-in skill creates a shadow (#1850 ) Clicking Edit on a built-in skill silently issued PUT /api/skills/:id, which the daemon stores as a user-owned shadow and then hides the built-in entry from the Settings list. From the user's perspective the row they just edited disappears with no explanation. Arm an inline override-warning banner on Edit for source==='built-in' skills, mirroring the existing inline delete-confirm pattern. The edit form only opens after the user explicitly confirms. User skills bypass the warning and edit directly. No daemon or listSkills contract change. Fixes #1378	2026-05-17 18:30:45 +08:00
Ethan Guo	a6288cec3f	fix(web): allow editing existing routines from RoutinesSection (#1884 ) The Routines list only exposed Run now / Pause / History / Delete, so any change to a routine required deleting and recreating it. The daemon already serves PATCH /api/routines/:id; only the web entry point was missing. Add an Edit button on each routine row that pre-populates the create form from the stored schedule and target via a new formFromRoutine helper, and branch the submit handler on editingId so it PATCHes /api/routines/:id with the updated fields instead of POSTing a new routine. Fixes #1373	2026-05-17 15:02:54 +08:00
张东明	bac56415a2	fix(web): surface daemon error messages for invalid folder imports (#1923 ) * fix(web): surface daemon error messages for invalid folder imports importFolderProject() was swallowing non-2xx responses by returning null, so the UI could only show a generic "Open folder failed: <path>" message even though the daemon already returns specific errors like "cannot import the filesystem root" or "folder not found". Parse the daemon error body and throw so the panel displays the actual reason. Also show feedback for empty path input instead of silently returning. Fixes #1186 * test(web): update folder import test to match new error propagation The existing test expected a generic "Open folder failed: <path>" message from a boolean return. Update to match the new behavior where the daemon's error message is thrown and displayed directly.	2026-05-17 15:00:49 +08:00
kami	e64f1d8497	fix(desktop): export PDF saves to a file instead of the OS print dialog (#1920 ) * fix(desktop): export PDF saves to a file instead of the OS print dialog The `od:print-pdf` IPC handler called `webContents.print()`, which opens the printer-first macOS system print dialog. That handler is the destination of the renderer's `window.__odDesktop.printPdf()` bridge, so on desktop "Export PDF" felt like a print flow rather than a file export — from both the PreviewModal share menu (which always uses this path) and the FileViewer share menu (whose daemon-route fallback lands here too). Route the handler through a direct Save-as-PDF flow instead: a native Save dialog, then `webContents.printToPDF()` straight to the chosen file — the same shape `exportPdfFromHtml` already uses for the daemon-backed export path. The flow is extracted into `savePrintReadyDocumentAsPdf` behind a structural `PrintReadyPdfTarget` surface that has no `print()` method, so the regression cannot be reintroduced. Fixes #1774 Co-authored-by: multica-agent <github@multica.ai> * fix(desktop): preserve PDF page sizing in print bridge Co-authored-by: multica-agent <github@multica.ai> --------- Co-authored-by: multica-agent <github@multica.ai>	2026-05-17 11:26:08 +08:00
Yuhao Chen	2f553941d3	fix(runtime): auto-annotate imported HTML elements for Tweaks selection (#892 ) (#1169 ) * fix(runtime): auto-annotate imported HTML elements for Tweaks selection (#892) * fix(runtime): always annotate missing od-ids and broaden selector (#892) - Remove the conditional gate so annotateMissingOdIds runs for every srcdoc, not just comment/inspect bridges. This fixes the persistence regression where saved Inspect tweaks reference synthesized data-od-id selectors that vanish when the bridge is rebuilt without annotations. - Expand the selector to cover div-based imported HTML (div[class], div[id]), headings, buttons, and links so Picker/Pods work on common anonymous wrapper markup. Addresses review feedback from lefarcen and mrcfps. * fix(runtime): narrow div selector, skip iframe/object/embed, add tests (#892) - Replace broad div[class] with direct-child combinators only under semantic containers, body, and [id] elements to avoid layout noise - Add iframe, object, embed to the skip list alongside script/style - Add tests for: direct-child divs, nested div skip, always-on behavior, skip list with id attributes, div children of [id] elements - Format selector as array join and skip tags as Set for readability * fix(runtime): let annotateManualEditSourcePaths coexist with data-od-id (#892) annotateMissingOdIds now runs unconditionally, so elements like main/h1 get data-od-id before annotateManualEditSourcePaths runs. The old skip condition (has data-od-id) incorrectly prevented source-path marking on those elements. Change the guard to skip only when the element already has data-od-source-path, allowing both attributes to coexist.	2026-05-16 16:17:36 +08:00
Yuhao Chen	26502ea124	test(web): decouple memory preview icon assertion (#1863 )	2026-05-16 11:37:34 +08:00
mehmet turac	6b15236843	fix(web): distinguish expanded memory preview action (#1813 )	2026-05-15 23:08:30 +08:00
Yuhao Chen	19d65678a3	fix(web): keep filter pill hover labels readable (#1828 )	2026-05-15 23:07:43 +08:00
mehmet turac	9689dce1ad	fix(web): align comment marker numbering (#1826 )	2026-05-15 23:07:32 +08:00
mehmet turac	444f7d03eb	fix(web): reveal memory editor after edit click (#1827 )	2026-05-15 23:07:18 +08:00
mehmet turac	7d9488300e	fix(web): keep draw overlay scrollable (#1848 )	2026-05-15 23:05:49 +08:00
mehmet turac	c4a8e92916	fix(web): add breathing room to plugin publish footer (#1849 )	2026-05-15 23:02:36 +08:00
lefarcen	22a3b99a47	Merge origin/main into preview/v0.8.0 Sync 49 commits from main. Conflicts resolved: - .github/workflows/ci.yml: kept v0.8.0 granular per-area gating, added main's linux specs + release-stable.yml + release-preview.yml triggers - .github/workflows/release-preview.yml: kept v0.8.0's full workflow over main's placeholder - apps/web/src/components/AssistantMessage.tsx: combined v0.8.0 file-ops summary with main's stripTodoToolGroups + suppressAskUserQuestionFallbackText - apps/web/src/components/ChatPane.tsx: kept both new imports - apps/web/src/index.css: kept both .msg-plugin-chip and .user-copy-btn blocks - e2e/ui/*.test.ts: kept v0.8.0 openEntrySettingsDialog helper over main's inline dialog navigation (UI was redesigned in v0.8.0) - nix/package-{daemon,web}.nix: kept v0.8.0 pnpmDepsHash; rerun nix build to refresh	2026-05-15 18:23:33 +08:00
mehmet turac	15ebe8b266	fix(web): keep picker hint clear of comments panel (#1820 )	2026-05-15 17:51:12 +08:00
mehmet turac	fd80d7c997	fix(web): clear draw ink when exiting mode (#1821 )	2026-05-15 17:48:28 +08:00
Nicholas-Xiong	9cfb01e6b0	feat: add Italian (it) locale support (#1323 ) * feat: add Italian (it) locale support - Add complete Italian translation dictionary (apps/web/src/i18n/locales/it.ts) - Register 'it' locale in types.ts (Locale type, LOCALES array, LOCALE_LABEL) - Import and register Italian dictionary in index.tsx - All 1352 translation keys translated to Italian - Follows the same structure as French locale (PR #326) Closes #1245 * test: Update locale tests to include Italian (it) 1. Add 'it' to EXPECTED_LOCALES array 2. Add LOCALE_LABEL.it assertion to verify 'Italiano' label This fixes the CI test failure and completes the Italian locale integration. --------- Co-authored-by: lefarcen <935902669@qq.com>	2026-05-15 16:38:55 +08:00
leessju	4e19c3f4f3	Prevent imported Claude canvases from zooming on scroll (#1726 ) * Preserve HTML preview state across mode toggles HTML previews could rebuild their iframe when switching into Edit or Comment, which reset scroll/canvas state and caused visible churn for multi-file artifacts. The viewer now keeps URL-loaded previews mounted when the artifact owns the mode bridge, relays file-refreshes through frame navigation, and restores preview scroll/viewport state across bridge mode changes. Constraint: Generic srcdoc-only bridges are still required for unbridged artifacts, inspect mode, palette tweaks, decks, draw overlays, and forceInline. Rejected: Keep all Edit/Comment previews on srcdoc \| causes unnecessary iframe replacement for bridge-capable URL-loaded artifacts. Confidence: high Scope-risk: moderate Directive: Do not enable URL-load for bridge-dependent modes unless the artifact has an owned postMessage bridge. Tested: pnpm guard Tested: pnpm --filter @open-design/web typecheck Tested: pnpm --filter @open-design/web test Tested: Playwright verified Edit and Comment toggles preserve iframe src and DOM node while receiving comment targets. * Prevent preview wheel gestures from escaping into zoom Trackpad pinch-like wheel events arrive with ctrl/meta modifiers on some platforms, which can make a normal vertical scroll feel like the preview zoomed. The preview now consumes those modified wheel events inside the host preview shell and in injected srcdoc previews, then maps the delta back to scroll where a scroll target exists. Constraint: URL-loaded sandbox iframes cannot always be inspected by the host, so srcdoc previews need their own in-frame guard. Rejected: Add allow-same-origin to preview iframes \| weakens the sandbox boundary for generated artifacts. Confidence: medium Scope-risk: narrow Directive: Do not broaden iframe sandbox permissions to fix gesture handling without a security review. Tested: pnpm guard Tested: pnpm --filter @open-design/web typecheck Tested: pnpm --filter @open-design/web exec vitest run tests/components/FileViewer.test.tsx tests/runtime/srcdoc.test.ts Tested: playwright-cli verified ctrl-wheel in preview keeps app zoom at 100% and prevents default in the iframe context * Revert "Prevent preview wheel gestures from escaping into zoom" This reverts commit `976407ab4c`. * Prevent imported Claude canvases from zooming on scroll Claude Design exports can classify ordinary macOS two-finger vertical wheel events as mouse-wheel zoom clicks inside design-canvas.jsx. Normalize that imported canvas code so plain wheel input pans, while Cmd+wheel remains the explicit zoom gesture. Constraint: The offending canvas code lives inside imported user artifacts rather than a tracked runtime component, so the fix belongs in the Claude Design zip import normalization path.\nRejected: Host-side wheel interception \| wheel events inside the sandboxed iframe are handled by the artifact before the host can reliably classify them.\nRejected: Disable all wheel zoom \| users still need Cmd+wheel as an explicit zoom control.\nConfidence: high\nScope-risk: narrow\nDirective: Keep plain wheel as pan-only for imported design-canvas.jsx unless a future bridge provides an explicit wheel-mode handshake.\nTested: pnpm --filter @open-design/daemon exec vitest run tests/claude-design-import.test.ts\nTested: pnpm --filter @open-design/daemon typecheck\nTested: pnpm guard --------- Co-authored-by: nicejames <nicejames@gmail.com> Co-authored-by: lefarcen <935902669@qq.com>	2026-05-15 16:37:57 +08:00
Nicholas-Xiong	d16acf6462	fix: Add error feedback for manual folder path import (#1666 ) * fix: Add error feedback for manual folder path import Fixes #1408 When users manually enter a folder path and click 'Open folder' (non-Electron environment), the app now provides clear error feedback if the import fails. Before: - No error clearing before import - No error handling for failed imports - Silent failures left users confused After: - Clears previous errors before attempting import - Catches and displays import errors with clear messages - Success feedback is implicit (navigation to the opened project) Why implicit success feedback: The parent handler (Home.tsx) navigates to the newly opened project on success, which provides clear visual feedback by changing the entire view. An additional toast would be redundant. Error handling: - Catches all errors from onImportFolder - Displays user-friendly error messages - Preserves error details when available * fix: surface failed folder imports --------- Co-authored-by: Siri-Ray <2667192167@qq.com>	2026-05-15 16:36:24 +08:00
Zihan Zhao	cfcfbe0178	Inline attached file context for BYOK chats (#1730 ) BYOK/API-mode chats bypass the daemon run path, so attached project files were saved as message metadata but their readable contents were not sent to the provider. This adds a web-side attachment context step for API-mode requests, reusing raw text reads and existing document preview extraction. Constraint: Docker PDF previews require pdftotext in the runtime image Confidence: high Scope-risk: moderate Tested: corepack pnpm --filter @open-design/web test -- tests/api-attachment-context.test.ts tests/components/ProjectView.api-empty-response.test.tsx Tested: corepack pnpm --filter @open-design/web typecheck Tested: corepack pnpm --filter @open-design/web build Tested: corepack pnpm guard Tested: corepack pnpm typecheck	2026-05-15 15:52:15 +08:00
Yuhao Chen	b2d2635360	fix(web): hide resolved comments from preview overlays (#1762 )	2026-05-15 15:46:03 +08:00
Quang Do	88db51521d	feat(web): add custom select primitive (#1714 ) Some checks failed ci / Packaged mac smoke (push) Blocked by required conditions Details ci / Packaged windows smoke (push) Blocked by required conditions Details ci / Detect PR change scopes (push) Failing after 2s Details ci / Validate workspace (push) Has been skipped Details nix-check / build (push) Failing after 1s Details * feat(web): add custom select primitive * fix(web): harden custom select active option state	2026-05-15 14:43:18 +08:00
Tom Huang	c5d77a03bd	Garnet hemisphere (#1769 ) Some checks failed nix-check / build (push) Failing after 2s Details * feat(chat-composer): enhance mention handling and input overlay - Introduced a new overlay for inline mentions in the chat composer, improving user experience by visually indicating mentions as users type. - Updated the `ChatComposer` component to manage mention entities and integrate them into the input field, allowing for better context and interaction. - Enhanced the `AssistantMessage` component to support the display of plugin action panels based on the current project context, facilitating easier plugin management. - Refactored related components to ensure consistent handling of project files and mentions across the application. This update significantly improves the chat interaction model, making it more intuitive for users to engage with mentions and plugins. * feat(plugin-management): enhance plugin action panels and UI components - Updated the `AssistantMessage` component to include plugin action panels based on the latest project context, improving user interaction with generated plugins. - Refactored the `PluginsView` to support detailed views for available marketplace entries, allowing users to access more information and actions for each plugin. - Introduced new CSS styles for improved visual representation of plugin-related UI elements, enhancing overall user experience. - Enhanced the `listPlugins` function to include an option for fetching hidden plugins, providing more flexibility in plugin management. This update significantly improves the usability and functionality of the plugin management system, making it easier for users to interact with and manage their plugins. * fix(assistant-message): refine plugin folder candidate selection logic - Updated the `pluginFoldersTouchedThisTurn` function to improve the logic for selecting plugin folder candidates based on touched paths and message content. - Introduced a new helper function, `pathMatchesFolderFileBasename`, to enhance the matching criteria for folder candidates. - Added a check for explicit folder matches before falling back to a single candidate, improving accuracy in folder selection. - Modified the `shouldRenderSlotAsText` function in `HomeHero` to include the name parameter, refining the rendering logic for slot text. These changes enhance the functionality and reliability of the assistant message component in managing plugin folder candidates. * feat(plugin-folder-actions): implement agent-routed CLI actions for plugin management - Introduced a new `PluginFolderAgentAction` type to streamline actions related to plugin folders, including install, publish, and contribute. - Updated the `DesignFilesPanel`, `FileWorkspace`, and `AssistantMessage` components to utilize the new agent action handling, improving user interaction with generated plugins. - Refactored the action handling logic to send commands to the agent, enhancing the workflow for managing plugin folders. - Added corresponding tests to ensure the new functionality works as expected and integrates seamlessly with existing components. This update significantly enhances the plugin management experience by routing actions through the agent, allowing for a more cohesive and interactive user experience. * Fix PR 1702 CI blockers * Fix PR 1702 remaining CI checks * Prebuild AGUI adapter after install * Restore plugin project snapshot wiring * feat(marketplace): refactor marketplace URL handling and enhance fetching logic - Introduced new functions to normalize marketplace URLs and manage fetching of marketplace manifests, improving the reliability of marketplace integrations. - Updated the server and plugin logic to utilize the new fetching mechanisms, ensuring consistent handling of marketplace data. - Enhanced tests to cover new URL normalization and fetching scenarios, ensuring robustness in marketplace management. This update significantly improves the marketplace experience by streamlining URL handling and enhancing data fetching capabilities. * Fix project auto-send cleanup spec * Reconcile run messages on cancel * Use active design system as visual direction * Fix active design system prompt wording * feat(workspace-tabs): implement workspace tabs functionality and file attachment handling - Introduced a new `WorkspaceTabsBar` component to manage workspace tabs, allowing users to navigate between different views (projects, marketplace, etc.). - Enhanced file handling capabilities in the `HomeHero` and `EntryShell` components, enabling users to stage and attach files before project creation. - Updated the `App` component to support auto-sending attachments alongside the first message in a project. - Improved CSS styles for workspace tabs and attachment UI, ensuring a cohesive design and user experience. This update significantly enhances the workspace navigation and file management features, providing users with a more intuitive and efficient workflow. * refactor(workspace-tabs): streamline workspace tabs and UI components - Removed unused components and actions from the `WorkspaceTabsBar` and `AppChromeHeader`, simplifying the codebase. - Updated CSS styles for the workspace shell and tabs, enhancing visual consistency and reducing element sizes for a cleaner layout. - Introduced a new client type detection mechanism to dynamically adjust the workspace shell's class, improving responsiveness. - Added tests for the `WorkspaceTabsBar` to ensure proper navigation and tab management functionality. These changes improve the overall performance and user experience of the workspace navigation system. * Update critical e2e for entry modal flow * Stabilize entry critical e2e flows * fix(ui): adjust workspace tabs and header styles for improved layout - Updated the CSS for workspace tabs and the app header, reducing element sizes and padding for a cleaner appearance. - Introduced a new button in the `WorkspaceTabsBar` for quick access to the home tab, enhancing navigation. - Minor adjustments to the layout and styles to ensure consistency across components. These changes enhance the user interface and improve the overall user experience in the workspace navigation system. * feat(workspace-tabs): implement pinned home tab functionality - Added a new pinned home tab feature to the `WorkspaceTabsBar`, allowing the home tab to remain accessible during navigation. - Updated tab management logic to collapse duplicate home tabs into a single pinned instance when restoring from local storage. - Enhanced CSS styles for workspace tabs to accommodate the new pinned tab design. - Updated tests to verify the behavior of the pinned home tab and its interaction with other tabs. These changes improve navigation consistency and user experience within the workspace. * refactor(workspace-tabs): enhance tab management and styling - Updated CSS styles for workspace tabs, adjusting padding and flex properties for improved layout and consistency. - Refactored tab creation logic to ensure unique IDs for project and marketplace tabs, enhancing navigation clarity. - Removed deprecated functions related to pinned home tabs, streamlining the codebase. - Improved test cases to verify independent behavior of home tabs during navigation. These changes enhance the user experience by providing a more intuitive tab management system and a cleaner UI. * style(workspace-tabs): update CSS for improved layout and visibility - Adjusted CSS properties for workspace tabs, including overflow, position, and z-index to enhance layout and stacking context. - Ensured consistent styling across tab components for better visual hierarchy. These changes contribute to a more polished and user-friendly interface within the workspace. * style(entry-layout): update CSS variables for improved layout consistency - Replaced fixed width values with CSS variables for the entry rail to enhance flexibility. - Adjusted padding and height properties for better visual alignment and spacing. - Introduced a new background style for the entry main topbar to improve aesthetics. These changes contribute to a more responsive and visually appealing layout in the entry view. --------- Co-authored-by: qiongyu1999 <2694684348@qq.com> Co-authored-by: Eli <129168833+qiongyu1999@users.noreply.github.com>	2026-05-15 14:42:11 +08:00
ngoduybien	843b6fec4f	fix(web): fall back to srcDoc when HTML preview needs sandbox shim (#1306 ) * fix(web): fall back to srcDoc preview when HTML needs the sandbox shim The URL-load HTML preview iframe is sandboxed with `allow-scripts` only — no `allow-same-origin` — so any artifact that reads `localStorage`/`sessionStorage` at startup throws SecurityError, its React tree unmounts, and the preview goes blank. The srcDoc path already polyfills both via `injectSandboxShim` (apps/web/src/runtime/srcdoc.ts) before any user script runs, but URL-load served raw HTML untouched. Agent-emitted React prototypes that read Web Storage at mount went blank until the user toggled Tweaks (which forces the srcDoc path). Detect the two reliable signals — `<script type="text/babel">` (Babel-standalone XHR-fetches and evals sibling `.jsx` files at runtime; those routinely read Web Storage from `useState` initializers) and direct `localStorage` / `sessionStorage` references in the source — and set `forceInline` automatically so those artifacts route back through the srcDoc path. Plain static HTML keeps the URL-load benefits (real source maps, per-asset HTTP caching, isolated per-script failures). No new daemon endpoint, no new contract, no sandbox loosening. Pure content sniff in the existing render-mode helper; reuses the same `forceInline` seam the `parseForceInline` opt-out already uses. Tests cover the new helper across positive (Babel-standalone variants with attribute reordering, quoting, whitespace, case) and negative (plain script tags, module type, JSON type, substring lookalikes) cases. * fix(web): address review feedback on sandbox-shim fallback - Accept unquoted `<script type=text/babel>` per HTML5 attribute syntax (regex `["']` → `["']?`). Adds a focused test covering bare unquoted, unquoted with unquoted `src=`, mixed unquoted/quoted, and the negative case `<script type=text/babelish>` to confirm the trailing word-boundary still rejects look-alikes. - Memoize `htmlNeedsSandboxShim(source)` on `source`. HtmlViewer re-renders on board/inspect/edit/slide state changes; the scan only changes when the source itself does. Cheap micro-opt, free correctness win. - Narrow the helper docstring's scope claim and add an explicit known limitation: external scripts (`<script src="./app.js">`, `<script type="module" src="./main.js">`) that read Web Storage during module eval are not covered — the helper only sees the document, not the linked subresource. Workaround documented: `?forceInline=1` or Tweaks. Catching this case would require fetching every script reference before deciding load strategy, duplicating browser work; not worth the cost until a real report surfaces. * fix(web): correct inline comment on `\b` boundary behavior The comment claimed `\b` rejects `text/babel-other`, but `\b` matches between `l` and `-` (hyphen is a non-word char), so the regex actually does match that input. The test asserts `text/babelish` as the negative case, which `\b` does correctly reject (`i` is a word char). Comment now matches the regex's actual behavior, with a note that hyphenated variants are a harmless false positive (srcDoc fallback is the safe direction) and a pointer to the `(?=[\s>"'])` lookahead tightening if a real case ever surfaces. No behavior change; existing tests still pass. * fix(web): align test comment with helper docstring on hyphenated variants Same class of inconsistency the previous commit fixed in the helper: the test comment claimed `type=text/babel-other` "remains a non-match", but the assertion actually covers `type=text/babelish`, and the helper docstring explicitly documents hyphenated variants as a safe false-positive that does match. Comment now describes both shapes correctly and explains why the hyphenated variant isn't asserted (it's the documented safe direction, not a regression). No behavior change; test count unchanged. * chore: trigger CI	2026-05-15 14:41:23 +08:00
chaoxiaoche	bcc58af931	refactor(web): rename Execution mode and tighten settings dialog UI (#1568 ) * refactor(web): rename Execution mode and tighten settings dialog UI - Rename "Settings → Execution & model" to "Settings → Execution mode" across the web UI, i18n keys, docs, and e2e selectors. - Redesign SettingsDialog: kicker + title row in the modal head, a flatMap-driven agent grid that renders the inline test-result row beside the selected card, compact unavailable cards with right-aligned install/docs links, and an install guide that only shows when the user has no working agent picked. - Trim verbose subtitle / hint copy across chat model, CLI proxy, media providers, custom instructions, and memory sections. - Add an `info` Icon variant for the redesigned settings hints. - Update e2e selectors and docs that referenced the old menu label. Co-authored-by: Cursor <cursoragent@cursor.com> * refactor(web): polish Settings dialog — media providers, skills, MCP Media providers - Hide internal Stub fixture provider (settingsVisible: false) - Split provider list into Available (integrated, editable) and Coming Soon (collapsed <details> drawer with name/hint/Docs link only) - Drop right-side Integrated/Configured badges from every row; all rows in the main list are integrated by definition; inline grey "Saved" chip next to the provider name is the only status indicator now - "Saved" badge moves inline to the right of the provider name and uses a neutral grey treatment (was a standalone green pill below the name) - "Reload from daemon" button shows a 2s green "✓ Reloaded" flash on success instead of leaving a permanent paragraph under the header; errors remain sticky Skills - Replace three pill-row filter banks (Source, Type, Category) with a compact single-row toolbar: search + three inline <select> dropdowns side by side; active filter highlighted with a stronger border MCP server - Shorten section hint to one line - Move WHAT YOUR AGENT CAN DO capabilities above the client dropdown (motivate before asking to act) - Move "Build the daemon first" warning below the code block where it contextually explains why the command might fail, not as a top-level error before the user has done anything - Downgrade "Restart your client" left-border from accent orange to border-strong grey — it is a next step, not a warning External MCP - Shorten section hint to one line Misc CSS - Add .sr-only utility for accessible off-screen live regions - Add button.ghost.is-success-flash for transient success feedback - Add .library-filter-selects / .library-filter-select for dropdown filter rows - Add .media-provider-coming-soon-* for the roadmap drawer Co-authored-by: Cursor <cursoragent@cursor.com> * [codex] Add Cursor Agent auth diagnostics (#1538) * Add Cursor Agent auth diagnostics * Handle Cursor not logged in auth status * Address Cursor auth review feedback * Classify Cursor stdout auth failures * test: expand Memory and Routines coverage (#1521) * test: expand settings and packaged coverage * test: extend memory settings coverage * test: cover routine settings failure states * test: cover routine operation failures * test: fix daemon test typing on CI * test: decouple packaged smoke from orbit bug * test: avoid live memory LLM calls in route tests * test: fix daemon fetch typing in CI * fix: restore preview comment and inspect toggles * test: align manual edit flow with current inspector UX * test: align comment attachment flow with current preview comments UI * fix: probe resolved Codex launch path during detection * fix: remove duplicate board activation helper after rebase * test: update ghost cli detection mock * test: align FileViewer toolbar expectation * ci: move full app tests to extended lane * ci: run app tests by changed scope * ci: cover shared app inputs in test scopes * ci: avoid setup-node cache in windows packaged smoke * test: align extended settings and manual edit flows * refactor(web): rename Execution mode and tighten settings dialog UI - Rename "Settings → Execution & model" to "Settings → Execution mode" across the web UI, i18n keys, docs, and e2e selectors. - Redesign SettingsDialog: kicker + title row in the modal head, a flatMap-driven agent grid that renders the inline test-result row beside the selected card, compact unavailable cards with right-aligned install/docs links, and an install guide that only shows when the user has no working agent picked. - Trim verbose subtitle / hint copy across chat model, CLI proxy, media providers, custom instructions, and memory sections. - Add an `info` Icon variant for the redesigned settings hints. - Update e2e selectors and docs that referenced the old menu label. Co-authored-by: Cursor <cursoragent@cursor.com> * refactor(web): settings dialog UX polish — layout, dedup, and interactions - Remove duplicate section headers from all settings sections (Notifications, Appearance, Privacy, About, Design Systems, Skills, MCP server, Connectors, Media providers, Routines) - Restructure Notifications cards: title + toggle on same row, hint below - Restructure Skills toolbar: search + New skill button in row 1, filter dropdowns in row 2 with left-aligned labels - Restructure Pet section: tabs and Wake button on same row - MCP server: group capabilities and setup into separate cards, remove nested double border on client picker - Connectors: show connect errors as toast instead of inline card text, position toast inside panel, hide single-provider tab - Media providers: move Reload button to left-aligned small ghost button - Memory: info icon shows path on hover, Path copied badge inline; Extraction history and MEMORY.md as standalone collapsible cards; group header hidden when only one type visible - Pet grid cards: Adopt button hidden until hover, icon-only when adopted, description truncated to 2 lines, text fills full width via abs positioning - Agent cards: selected state uses accent border only, no background change - Add sun/moon icons to Appearance theme buttons (Light/Dark) - Shorten several hint strings for clarity Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): resolve i18n review comments from PR #1568 - Update settings.title and settings.envConfigure to localized "Execution mode" in all 17 non-English locale files - Add settings.memoryFlashPathCopied to all locales and use t() in MemorySection instead of hardcoded English "Path copied" - Add settings.agentModelHead to all locales and use t() in SettingsDialog for "Model for:" agent model row header Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): update tests to match settings dialog redesign - Add role prop to Toast (alert/status) so error toasts from ConnectorsBrowser are announced immediately by screen readers - Clear connectErrorToast on successful connector retry - Update SettingsDialog.execution tests: - Remove heading assertions for About and MCP server (headers were intentionally removed as duplicate nav labels) - Rewrite CLI env test to use codex-only fields (per-agent filtering means only selected agent's fields are shown) - Update Composio key hint text assertion to match shortened copy - Replace filter button click with select change for Type filter - Replace Configured/Unsupported/Integrated badge checks with updated assertions matching the new media provider UI - Replace disabled BFL row test with coming-soon section check - Update SettingsDialog.media test: remove Fal.ai input assertions (non-integrated providers no longer have editable fields) Co-authored-by: Cursor <cursoragent@cursor.com> * fix(web): unblock CI for #1568 Three small fixes to get Playwright back to green on the settings dialog redesign: 1. `en.ts`: revert `settings.envConfigure` to "Configure execution mode". This PR collapsed both `settings.title` (header gear) and `settings.envConfigure` (entry-side foot pill) to the same string "Execution mode", so `getByRole('button', { name: 'Execution mode' })` resolved to two elements and tripped Playwright strict mode in the three Composio-flow tests (entry-configuration-flows.test.ts:174, 228, 285). Restoring the distinct label also gives screen readers a clearer hint for the pill, which doubles as a status display. Non-English locales still alias the two keys; happy to follow up on those, but they don't gate the (English-only) Playwright suite. 2. entry-configuration-flows.test.ts:167 — `Connectors` heading is now rendered at `<h2>` in the modal-head (SettingsDialog.tsx:1545), with the inner `<h3>` removed by design (see comment around line 1448). Updated the assertion from `level: 3` to `level: 2`. 3. project-management-flows.test.ts:360 — same change for the `Pets` heading. Verified locally with `pnpm --filter @open-design/web typecheck` and `pnpm --filter @open-design/e2e typecheck`. The actual Playwright specs need the dev server up; I didn't rerun them here, but the locator changes are mechanical and match the new DOM. * fix(web): use exact match for Execution mode button locator Playwright's `getByRole({ name })` defaults to substring matching, so `{ name: 'Execution mode' }` still resolved to both the header gear (aria-label "Execution mode") and the entry-side foot pill (aria-label "Configure execution mode" — substring contains "Execution mode"). Strict mode tripped in the three composio-flow tests at lines 202, 257, and 319. Adding `exact: true` makes each call resolve to just the header gear, which opens the same dialog the foot pill does — the test outcomes are unchanged. --------- Co-authored-by: chaoxiaoche <chaoxiaoche@chaoxiaochedeMacBook-Pro.local> Co-authored-by: Cursor <cursoragent@cursor.com> Co-authored-by: Caprika <56862773+alchemistklk@users.noreply.github.com> Co-authored-by: shangxinyu1 <shangxinyu@refly.ai> Co-authored-by: lefarcen <935902669@qq.com>	2026-05-15 14:35:06 +08:00
Quang Do	a41d4f6126	fix(web): keep chat pinned during content growth (#1716 )	2026-05-15 14:12:00 +08:00
Prantik Medhi	01e54700a2	fix(web): make file grouping by kind work (#1551 ) * fix(web): group design files by kind * fix(web): unblock CI for #1551 - FileViewer test (line 434): add missing `projectKind="prototype"` to match every other instance; this was the source of the typecheck failure blocking workspace validation. - DesignFilesPanel "groups files by kind" test: assert against `.df-section-label` elements so the section header check is not ambiguous with the per-row kind cell text. - DesignFilesPanel batch-delete test: derive the expected file names from the rendered row testids and use `arrayContaining` so the assertion no longer depends on the (now kind-default) row order. * fix(web): satisfy strict-index typecheck in batch-delete test `onDeleteFiles.mock.calls[0][0]` tripped `noUncheckedIndexedAccess` ("Object is possibly 'undefined'"). Drop the separate length probe and assert the exact array instead — `selected` is a `Set`, `handleBatchDelete` spreads it with `[...selected]`, and the test clicks rows[0]/rows[1] in that order, so insertion order is deterministic and equals `[firstName, secondName]`. --------- Co-authored-by: lefarcen <935902669@qq.com>	2026-05-15 13:07:27 +08:00
Prantik Medhi	8aeedf368b	fix(web): localize accent controls in settings (#1565 ) * fix(web): localize accent controls * fix(web): localize accent default label * fix(web): unblock CI for #1565 Add missing `projectKind="prototype"` to the FileViewer deck-render test (line 434) so workspace typecheck stops failing on the `Property 'projectKind' is missing` error. This mirrors every other FileViewer render in the same file and is unrelated to the accent localization changes in this PR — it's drift from a recent change on main that made `projectKind` required. --------- Co-authored-by: lefarcen <935902669@qq.com>	2026-05-15 12:09:28 +08:00
Yuhao Chen	b0963fd874	fix(web): allow downloads from preview iframes (#1732 )	2026-05-15 11:55:29 +08:00
Tom Huang	76defffb93	Garnet hemisphere (#1702 ) * feat(chat-composer): enhance mention handling and input overlay - Introduced a new overlay for inline mentions in the chat composer, improving user experience by visually indicating mentions as users type. - Updated the `ChatComposer` component to manage mention entities and integrate them into the input field, allowing for better context and interaction. - Enhanced the `AssistantMessage` component to support the display of plugin action panels based on the current project context, facilitating easier plugin management. - Refactored related components to ensure consistent handling of project files and mentions across the application. This update significantly improves the chat interaction model, making it more intuitive for users to engage with mentions and plugins. * feat(plugin-management): enhance plugin action panels and UI components - Updated the `AssistantMessage` component to include plugin action panels based on the latest project context, improving user interaction with generated plugins. - Refactored the `PluginsView` to support detailed views for available marketplace entries, allowing users to access more information and actions for each plugin. - Introduced new CSS styles for improved visual representation of plugin-related UI elements, enhancing overall user experience. - Enhanced the `listPlugins` function to include an option for fetching hidden plugins, providing more flexibility in plugin management. This update significantly improves the usability and functionality of the plugin management system, making it easier for users to interact with and manage their plugins. * fix(assistant-message): refine plugin folder candidate selection logic - Updated the `pluginFoldersTouchedThisTurn` function to improve the logic for selecting plugin folder candidates based on touched paths and message content. - Introduced a new helper function, `pathMatchesFolderFileBasename`, to enhance the matching criteria for folder candidates. - Added a check for explicit folder matches before falling back to a single candidate, improving accuracy in folder selection. - Modified the `shouldRenderSlotAsText` function in `HomeHero` to include the name parameter, refining the rendering logic for slot text. These changes enhance the functionality and reliability of the assistant message component in managing plugin folder candidates. * feat(plugin-folder-actions): implement agent-routed CLI actions for plugin management - Introduced a new `PluginFolderAgentAction` type to streamline actions related to plugin folders, including install, publish, and contribute. - Updated the `DesignFilesPanel`, `FileWorkspace`, and `AssistantMessage` components to utilize the new agent action handling, improving user interaction with generated plugins. - Refactored the action handling logic to send commands to the agent, enhancing the workflow for managing plugin folders. - Added corresponding tests to ensure the new functionality works as expected and integrates seamlessly with existing components. This update significantly enhances the plugin management experience by routing actions through the agent, allowing for a more cohesive and interactive user experience. * Fix PR 1702 CI blockers * Fix PR 1702 remaining CI checks * Prebuild AGUI adapter after install * Restore plugin project snapshot wiring * feat(marketplace): refactor marketplace URL handling and enhance fetching logic - Introduced new functions to normalize marketplace URLs and manage fetching of marketplace manifests, improving the reliability of marketplace integrations. - Updated the server and plugin logic to utilize the new fetching mechanisms, ensuring consistent handling of marketplace data. - Enhanced tests to cover new URL normalization and fetching scenarios, ensuring robustness in marketplace management. This update significantly improves the marketplace experience by streamlining URL handling and enhancing data fetching capabilities. * Fix project auto-send cleanup spec	2026-05-14 21:12:50 +08:00
sakshyasinha	c4a67a7b3e	Fix Kimi CLI icon contrast in light mode (#1667 ) * fix(web): improve Kimi CLI icon contrast * fix(web): render Kimi icon via theme-aware CSS mask Move Kimi to the MONO_ICONS set so it renders through CSS mask with currentColor adaptation, making it legible in both light and dark themes instead of baking a single dark fill that fails on dark backgrounds. * fix(web): adjust Kimi icon secondary mark for dual-theme contrast Keep Kimi as a baked two-tone asset: blue accent (#1783ff) for brand identity, mid-tone gray (#666666) secondary mark for acceptable contrast on both light and dark card surfaces. Revert from mask path to preserve the blue branding. * fix(web): correct corrupted Kimi SVG and strengthen asset validation test Remove extraneous PR discussion text that was accidentally included in the SVG file. Strengthen the test to validate the bundled asset is valid SVG with the expected fills (blue accent + gray secondary mark), catching asset corruption that would otherwise go undetected.	2026-05-14 20:32:52 +08:00
lefarcen	640a332276	Merge remote-tracking branch 'origin/garnet-hemisphere' into reconcile/garnet-main-merge	2026-05-14 17:44:44 +08:00
lefarcen	3b7f87c7ae	Merge remote-tracking branch 'origin/main' into reconcile/garnet-main-merge	2026-05-14 17:44:26 +08:00
pftom	8e3af79dea	feat(github-installer): enhance GitHub content installation and error handling - Introduced new interfaces for GitHub content entries and budgets to streamline content fetching. - Enhanced the `installFromGithub` function to support installation from GitHub contents, including subpath handling. - Implemented robust error handling and retry logic for fetching GitHub content, improving installation resilience. - Updated tests to validate the new content fetching logic and ensure correct behavior across various scenarios. This update significantly improves the GitHub installation process, making it more flexible and user-friendly.	2026-05-14 17:31:29 +08:00
lefarcen	b268bbe169	Merge origin/garnet-hemisphere (post-9e196d34) — Use Plugin handoff fix Brings in 11 new garnet commits, most importantly: - `1a90aef4` feat(plugin-use): implement plugin use handoff functionality — fixes the bug QA reported where /plugins Use Plugin would 422 silently for template plugins; new flow hands off to HomeView with the plugin pre-bound + input form prompted there. - `2ac58544` feat(plugin-inputs): enhance plugin input handling with file upload support — extends PluginInputsForm for file uploads. - `3b167b69` feat(plugins): registry protocol — new @open-design/registry-protocol workspace package (needs build before daemon boot). - Plus enhancements to plugin metadata, GitHub installer, plugin detail view, login/whoami, static HTML preview paths. Conflicts resolved: - packages/contracts/src/api/projects.ts: HEAD's skipDiscoveryBrief field + garnet's contextPlugins (@-mention plugin context refs) both kept on ProjectMetadata. - apps/landing-page/* (3 files): accepted HEAD — garnet had the older single-page landing-page header; main has the multi-page layout (/skills/, /systems/, /templates/, /craft/) with dynamic counts. Not related to the Use Plugin core fix. New @open-design/registry-protocol package must be built before daemon boots; pnpm install does this via postinstall already.	2026-05-14 16:32:35 +08:00
pftom	6614b9bf09	refactor(plugin-authoring): streamline prompt and input handling - Removed the redundant `buildPluginAuthoringPrompt` function call in `startPluginAuthoring` for cleaner code. - Introduced new functions to build prompts and inputs based on user goals, enhancing the authoring experience. - Updated `HomeView` to manage authoring inputs and prompts more effectively, ensuring better state handling. - Adjusted the `PluginImportModal` to reflect changes in the import process, removing references to template creation. - Enhanced tests to cover new input handling and prompt generation logic, ensuring reliability in the authoring flow. This update improves the clarity and efficiency of the plugin authoring process, making it more intuitive for users.	2026-05-14 16:18:34 +08:00
pftom	fbcf50382a	feat(github-installer): enhance GitHub source parsing and error handling - Updated the GitHub source regex to support more flexible parsing of repository references, including subpaths. - Introduced a new `parseGithubSource` function to handle the extraction of owner, repo, and potential subpaths, improving the robustness of the installation process. - Enhanced the `installFromGithub` function to retry fetching from multiple candidates and provide detailed error messages when installation fails. - Added tests to validate the new parsing logic and ensure correct behavior when handling various GitHub source formats. This update significantly improves the handling of GitHub sources, making the installation process more resilient and user-friendly.	2026-05-14 16:09:37 +08:00
Nagendhra Madishetti	40766ef1ba	test(web): Critique Theater Phase 13 (reducer p99 bench + surface coverage walker) (#1318 ) * feat(web): pure reducer for Critique Theater states (Phase 7.1) Pure CritiqueState reducer driven by the contracts-level PanelEvent (the same shape both the live SSE stream and the recorded transcript emit), so a single reducer powers both the in-flight panel and the rerun replay. Lifecycle covers run_started → running → (shipped / degraded / interrupted / failed), with panelist_open / dim / must_fix / close / round_end events building per-round CritiquePanelistView entries as they arrive. Defensive behaviour that surfaced while writing the spec tests: - Terminal phases (shipped / degraded / interrupted / failed) are sticky against further lifecycle events for the same run, except for parser_warning which can land late and is recorded in a side channel without changing phase. - A new run_started for a different runId at any time discards the prior state and reboots, so the UI can launch consecutive runs without an explicit reset action. - Events whose runId does not match the active run return the same state reference, so React's useReducer doesn't re-render subscribers on stray traffic. - Round bookkeeping keys by round number rather than "always last", so an out-of-order panelist_dim for round 1 arriving after a round 2 dim does not corrupt the round 2 bucket. Test coverage: 18 cases covering each transition, the runId guard, sticky-terminal behaviour, the out-of-order round invariant, and the stable-identity guarantee. Sets up Phase 7.2 and 7.3 to wire SSE + replay into the same reducer. * feat(web): useCritiqueStream hook subscribes to SSE and feeds reducer (Phase 7.2) createCritiqueEventsConnection is a pure connection manager that mirrors apps/web/src/providers/project-events.ts: opens an EventSource at /api/projects/:id/events, listens for every name in CRITIQUE_SSE_EVENT_NAMES, decodes each frame back into a PanelEvent (stripping the critique. prefix and merging the data payload), and hands it to the caller's onEvent. Reconnect uses exponential backoff (1s → 30s) and resets on `ready`; malformed payloads drop with a dev-mode warning rather than tearing the stream. useCritiqueStream wraps the manager in a useReducer that owns the CritiqueState. enabled=false or a null projectId tears down the connection cleanly; switching projectId closes the old connection and opens a fresh one. The returned dispatch lets local UI synthesise actions (e.g. an Esc keypress firing a synthetic interrupted while a kill request is in flight); production traffic comes from the SSE stream. Test coverage: - sse.test.ts (10 cases, node env): subscription set covers every CRITIQUE_SSE_EVENT_NAMES channel; payload decoding lifts the wire shape back to PanelEvent; malformed JSON is swallowed and does not stop the stream; exponential backoff schedule and ready-reset semantics are pinned with a setTimeout seam; close() cancels pending reconnects and shuts the live source; no-op fallback when EventSource is unavailable. - useCritiqueStream.test.tsx (6 cases, jsdom env): idle pre-event, reducer driven by synthetic actions, no connection when disabled or projectId is null, clean close on unmount, projectId change reopens cleanly. * feat(web): useCritiqueReplay hook drives reducer from transcript file (Phase 7.3) Fetches the per-run NDJSON transcript (one PanelEvent per line), parses every line via the shared isPanelEvent predicate, and dispatches into the same CritiqueState reducer the live SSE stream uses. A single reducer means the UI rendering a replay can be identical to the live panel, and a UI mounting both useCritiqueStream and useCritiqueReplay in parallel does not have to reconcile two state shapes. speed knob is `paused \| instant \| live \| { intervalMs: N }`. - instant flushes every event synchronously, useful for opening a finished run already at its terminal state. - intervalMs paces dispatches at a fixed cadence so the reviewer can watch the run unfold. - paused parses the transcript but holds events back until the caller advances speed (consumers can drive a scrubber later). - live is reserved for the future "playback at original cadence" feature, currently treated as instant; replay timestamps are not yet persisted with each event so honest pacing requires a follow-up Phase 7+ task. gunzip seam handles `.ndjson.gz` transcripts via DecompressionStream when present; the production fetch path picks between text and arrayBuffer based on the URL extension. Both seams are injectable so the unit tests don't need to spin up a real network or a real gzip pipeline. Test coverage (8 cases, jsdom env): - Idle status before any URL is provided. - speed=instant flushes the full transcript synchronously to shipped state. - speed={intervalMs:N} paces with the setTimeout seam, reaching done after the last tick. - speed=paused leaves status=playing with no dispatches. - Empty transcript reports done with state still idle. - Fetch rejection surfaces an error status with the message. - Malformed NDJSON lines are skipped; valid events around them still land. - .gz transcripts route through the gunzip seam. Closes the Phase 7 plan tasks 7.1 / 7.2 / 7.3 (reducer + stream + replay), all on one branch ready for review. Phases 8+ (Theater components) consume these from this PR. * fix(web): close payload-override gap + paused-resume bug in Critique Theater hooks (Phase 7 review) Two P1 fixes from lefarcen's review on PR #1307: SSE payload override `sseToPanelEvent` previously spread `data` after the channel-derived `type`, so a payload-provided `type` could override the channel and route a `critique.run_started` frame into the reducer as a `ship` action. Reversed the spread so the channel-derived `type` is authoritative, and revalidated the resulting object through the contracts-level `isPanelEvent` predicate before returning. Frames that fail validation (missing runId, empty runId, unknown type) are dropped, so a malformed or compromised SSE frame can no longer dispatch a wrong-shape action into the reducer. Three new sse.test.ts cases pin the regression: hostile `type:'ship'` in the payload still resolves to `run_started`, missing runId is dropped, empty runId is dropped. Replay pause/resume `useCritiqueReplay` had one big effect keyed on `transcriptUrl` only, so flipping `speed` from `paused` to `instant` never re-fired and the held events sat undispatched. Split into a parse effect (depends on URL, fetches and stores events in state) and a pace effect (depends on parsed-events + speed, owns the cursor + timers). The playback cursor lives in a ref that survives pause/resume cycles, so flipping `paused` -> `instant` flushes from the current position rather than restarting (which would double-dispatch `run_started` and reset the reducer). Two new useCritiqueReplay.test.tsx cases: - paused-then-instant transitions from `playing` to `done` and reaches the shipped terminal phase - intervalMs paced playback dispatches one event, pauses to drain the next scheduled timer, flips to instant, and confirms the remaining transcript drains exactly once (cursor was preserved) Doc consistency The earlier source comment in useCritiqueReplay.ts claimed `live` "paces by recorded timestamps" while the impl used zero-delay timers and the PR body said it behaves like `instant`. Aligned to reality: `live` currently behaves like `{ intervalMs: 0 }` (events drain on successive microtasks via setTimeoutFn) because transcripts do not yet carry per-event timestamps. Honest timestamp-driven pacing is queued as a Phase 7+ follow-up. Validated: pnpm guard, pnpm --filter @open-design/web typecheck, Theater suite 47/47 (up from 42, +3 sse + 2 replay), full web suite 96 files / 888 tests. * feat(i18n): seed Critique Theater key block (en + zh-CN; other locales fall back via spread) * feat(web): Theater PanelistLane component (Phase 8.1) * feat(web): Theater ScoreTicker component (Phase 8.2) * feat(web): Theater RoundDivider component (Phase 8.3) * feat(web): Theater InterruptButton component with Escape keybind (Phase 8.4) * feat(web): Theater TheaterDegraded chip (Phase 8.5) * feat(web): Theater TheaterCollapsed post-run summary (Phase 8.6) * feat(web): Theater TheaterTranscript replay surface (Phase 8.7) * feat(web): Theater TheaterStage top-level container (Phase 8.8) * feat(web): Theater CSS using existing semantic tokens (no hex literals) * feat(web): Theater public exports barrel * fix(web): resolve P2 + P3 review feedback on Phase 8 (PR #1314) Addresses all 4 P2 + 3 P3 items from codex, Siri-Ray, and lefarcen. State-lifecycle fixes (3 x P2) 1. Reducer learns a synthetic `__reset__` action (`CritiqueResetAction`). Host hooks dispatch it when their gating prop changes so a stale run from a prior project / transcript cannot bleed into the next context. Reset is idempotent on idle (returns the same reference). 2. `useCritiqueStream` dispatches `__reset__` at the top of its connection effect, so a workspace switch from project A (which streamed a critique) to project B clears the reducer before the new EventSource opens. enabled=false also clears. 3. `useCritiqueReplay` dispatches `__reset__` at the top of its parse effect, so transcriptUrl swaps (including swap-to-null after a replay reached `shipped`) lift the reducer back to idle before the new fetch starts. SSE validation (1 x P2) 4. `sseToPanelEvent` now runs a per-variant `hasValidVariantShape` check after the cheap `isPanelEvent` predicate. A `critique.ship` frame missing `composite` / `round` / `status` / `artifactRef` is rejected before reaching the reducer, so TheaterCollapsed can no longer crash on `undefined.toFixed(1)`. Every variant's required fields are validated: run_started (protocolVersion, non-empty cast, maxRounds, threshold, scale), panelist_* (round, role, plus variant-specific shape), round_end (round, composite, mustFix, decision in {continue,ship}, reason), ship (round, composite, status, artifactRef.{projectId,artifactId}, summary), degraded (reason, adapter), interrupted (bestRound, composite), failed (cause), parser_warning (kind, position). Reducer correctness (1 x P2) 5. `panelist_open` now materializes the round + an empty panelist view (`{dims: [], mustFixes: []}`) so TheaterStage can highlight the in-progress lane the instant the tag opens. Before this, a stream that emitted only `panelist_open` after `run_started` left `rounds = []` and the UI rendered no current round until a later `panelist_dim` arrived. Polish (3 x P3) 6. Brand role tint swaps from `var(--magenta, var(--accent))` to `var(--purple, var(--accent))`. `--purple` is actually defined across the design systems; `--magenta` is not, so Brand was silently falling through to `--accent` and looking identical to Designer. 7. New i18n key `critiqueTheater.interruptedSummary` for the interrupted-collapse copy ("Interrupted at round N, best composite X.X"). Previously the interrupted branch reused `shippedSummary` and the UI read "Shipped at round..." for a run that specifically did not ship. Native value in en + zh-CN; other locales fall back via `...en` spread. 8. `TheaterDegraded` heading id comes from `useId()` instead of a hardcoded `theater-degraded-heading`, so two chips rendered on the same page (chat history with multiple completed runs) keep their aria-labelledby references unambiguous. Tests (15 new cases) - reducer.test.ts (+5): __reset__ on running/terminal/idle, panelist_open materializes round, panelist_open does not stomp prior panelist data. - sse.test.ts (+6): variant-level rejection for ship without required fields, degraded without adapter, run_started with empty cast, panelist_dim with non-numeric score, round_end with unknown decision, plus a positive fully-formed ship. - useCritiqueStream.test.tsx (+2): state reset on projectId change, state reset on enabled flip false. - useCritiqueReplay.test.tsx (+1): state reset on transcriptUrl swap to null after a replay reached shipped. - TheaterCollapsed.test.tsx (text-pinning update): asserts the interrupted branch reads "Interrupted at round 1" + "best composite 7.9", and explicitly NOT "Shipped at round...". - TheaterDegraded.test.tsx (+1): two chips on the same page get unique aria-labelledby ids that each resolve to an `<h3>`. Validated - pnpm guard clean - pnpm --filter @open-design/web typecheck clean - Theater suite: 13 files, 101 tests (was 86 on the first Phase 8 push, +15 new) - tests/i18n/locales.test.ts 5 of 5 across 18 locales * feat(web): CritiqueTheaterMount wires SSE + reducer into a single drop-in (Phase 9.1) * feat(i18n): Critique Theater strings for de + ja + ko + zh-TW (Phase 9.2) * fix(web): resolve P1 + P2 review feedback on Phase 9 (PR #1315) Addresses every blocker from codex, Siri-Ray, and lefarcen. The three state-lifecycle and SSE-validation issues they also flagged inherit fixes from PR #1314's review pass that this branch now sits on top of after rebase. Real daemon kill on Interrupt (P1) - CritiqueTheaterMount now POSTs to /api/projects/:id/critique/:runId/interrupt alongside the optimistic local dispatch. Before this fix, clicking Interrupt only flipped the React state to interrupted while the daemon job kept running. The fetch is best-effort: a 404 (endpoint not wired yet, lands in Phase 15) is swallowed with a dev-mode console.warn so the UI still moves to the collapsed badge. - New fetchInterrupt test seam lets RTL assert on the URL / method and simulate the "daemon not ready yet" path. Two tests pin both: the happy URL proj-42/critique/run-abc/interrupt POSTs, and a rejected fetch still flips the UI. interruptPending reset on new run (P2) - A ref-backed effect compares the current runId against the last one we saw; when it changes, interruptPending is cleared. A user who interrupts run-1 and then triggers run-2 from the same mount now gets a fresh, enabled kill button instead of one stuck in "Interrupting…". Pinned by a new mount test. Escape keybind scope (P2) - InterruptButton now checks the keydown target. Escape inside an input, textarea, select, or contenteditable element is ignored (and any ancestor of those via closest() is treated the same way). Body-level focus still fires the keybind so the Theater area's affordance keeps working. Four new tests cover textarea, input, contenteditable, and the body-focus positive case. userFacingName i18n key (P2) - The spec at specs/current/critique-theater.md:6 mandates a single critiqueTheater.userFacingName key so the "Design Jury" label can be renamed without touching code. Phase 8 introduced critiqueTheater.title by mistake; renamed across types.ts, en.ts, zh-CN.ts, de.ts, ja.ts, ko.ts, zh-TW.ts, and the lone consumer TheaterStage.tsx. The locale alignment test stays green. Validated - pnpm guard clean - pnpm --filter @open-design/web typecheck clean - Theater suite: 14 files, 112 tests (was 101 before, +11 new for the Phase 9 review pass: 3 mount + 4 InterruptButton focus scope; the rest were already in #1314's review fix). - tests/i18n/locales.test.ts 5 of 5 across 18 locales. * feat(daemon): adapter-degraded registry with TTL (Phase 10.1) In-memory registry recording adapters that produced malformed or oversize transcripts so the orchestrator can skip them for a TTL window (default 24h) instead of cycling through known-bad providers on every run. Records carry reason (malformed_block \| oversize_block \| missing_artifact), source label, and expiresAt. The test-only clock seam lets the suite advance time deterministically and prove that an expired entry stops counting as degraded without anyone calling clearDegraded. 7/7 vitest cases green. * feat(daemon): synthetic good + bad adapter fixtures (Phase 10.2) Two test-only adapters that read the existing v1 transcript fixtures (happy-3-rounds and malformed-unbalanced) and replay them as either a full string or a 512-byte chunked stream. The chunked form is what the conformance harness uses to prove the parser holds together when the transcript arrives in arbitrary network slices, not as one buffered blob. * feat(daemon): adapter conformance harness (Phase 10.3) runAdapterConformance pulls a transcript through the same parseCritiqueStream pipeline the orchestrator uses and classifies the outcome as shipped, degraded, or failed. On a degraded outcome it forwards the matched reason to the adapter-degraded registry, so a single nightly conformance run is what populates the skip list rather than the orchestrator learning each adapter is broken at request time. 5/5 vitest cases green covering shipped, malformed degraded, oversize degraded, no-ship failure, and the harness-thrown failure path. * test(e2e): Critique Theater Playwright suite (Phase 11) Six tests, one viewport per visual case, deterministic SSE fixtures stubbed via page.route(). Adds the suite to test:ui:extended so the existing extended-UI lane picks it up. Coverage: 1. Happy path: a single mounted theater plays the full fixture (1 run_started, 5 panelists open / dim / must_fix / close, 1 round_end, 1 ship) and ends on the score badge. 2. Interrupt mid-run: the panelist that is open at the time the interrupt button is clicked closes with an interrupted marker and the transcript freezes there. 3. Visual regression at 375x720 mobile. 4. Visual regression at 768x1024 tablet. 5. Visual regression at 1280x800 desktop. 6. A11y role tree: the theater region exposes a labelled landmark, each panelist lane is a group with an accessible name, the score is a status live region. All SSE traffic is stubbed by page.route so the suite runs in CI without a daemon. The toggle is seeded via localStorage by bootAppWithCritiqueEnabled so the gate behaves as if Settings flipped it on. typecheck clean; playwright --list reports 6. * test(web): reducer p99 bench at 10k iterations (Phase 13.1) Locks the documented 2ms budget for the Critique Theater reducer on a representative SSE script (27 actions, one full happy run) behind a regression gate. Asserts p99 stays under 4ms (2x the documented budget) so CI runners with a noisy neighbour do not flake while a real regression to 20ms or 200ms still trips. The bench is a vitest case rather than a bare microbenchmark so it runs in the same CI lane as every other web test and does not need a parallel runner. * test(web): critique surface coverage walker (Phase 13.2) Walks the public critique surface (11 SSE event names, 5 panelist roles, 6 lifecycle phases, 9 named i18n keys) and asserts each named symbol appears in both the src corpus and the test corpus. The walker is the gate that catches a rename in one half of the codebase without a matching update in the other half: a future PR that drops 'panelist_must_fix' from the reducer without also removing its test reference fails this suite. 62 assertions, one per symbol per corpus. * fix(web): tighten Phase 13 gates from lefarcen review (PR #1318) Address the actionable items from lefarcen's review of the two Phase 13 CI gates. The two questions about longer-term DX (pre- commit hook to auto-update the symbol table, AST-walker swap) are documented as deferred follow-ups rather than landed here. reducer-bench: - Describe renamed to 'reducer p99 regression gate (Phase 13.1)' so it reads as a gate, not a comparative benchmark. - Failure message now carries the full distribution (p50 / p90 / p99 / max + ceiling), so triage on a tripped gate can distinguish a real 20ms regression from a 4.001ms CI hiccup without re-running locally (lefarcen Q3). - Captured a baseline (p50=0.011ms p90=0.013ms p99=0.018ms max=0.244ms on a local Node 24 / Win11 run, 2026-05-11) inside the docblock so reviewers can see the actual reading sits ~222x below the 4ms ceiling (lefarcen Q1). - Replaced 'role as any' casts with PanelistRole-typed casts so the fixture is typecheck-strict. - Phase numbering corrected (13.2 → 13.1 to match the PR body). critique-coverage: - Symbols now grouped under four describe blocks (SSE events / panelist roles / lifecycle phases / i18n keys) so a failure points at the category that drifted at a glance (lefarcen nit). - Docblock now explains the grep-over-AST trade-off (the bug class is structural at the string level, not at the AST level) and points at the future AST-walker work as a deferred follow- up (lefarcen Q2). - Docblock now walks a contributor through the four-step maintenance flow (add to contract → add caller → add test → add literal here), so the next person to add an SSE event or i18n key knows the gate exists and what to update (lefarcen Q4). - Phase strings switched from 'phase: <name>' to bare-quoted literals so the walker is robust against single vs double quotes and ':' vs '===' source-shape changes. - Dead try/catch around 'stack = [root]' removed (cannot throw). - Per-symbol failure messages name the symbol AND which corpus is missing it, so the gate is self-describing on the next CI red. - Phase numbering corrected (13.4 → 13.2 to match the PR body). 63 / 63 vitest cases green (1 bench + 62 coverage). Web typecheck clean. * fix(web): tighten coverage walker semantics from lefarcen P2/P3 (PR #1318) Two follow-on findings on commit `338a185`: P2 — coverage gate weakened. The previous revision used one helper `corpusReferences` for both SRC and TEST corpora, and that helper accepted the unprefixed PanelEvent type form (`type: 'panelist_must_fix'`) as a substitute for the prefixed SSE wire name (`critique.panelist_must_fix`). The fallback is correct on the TEST side (reducer tests dispatch PanelEvent literals) but it weakened the SRC side: production code could drop the SSE channel name silently and the PanelEvent type alias would keep the walker green. Split into two helpers: `srcReferences` is strict (exact substring match only, no fallback) and `testReferences` keeps the lenient fallback for SSE events. The production-side assertions now route through `srcReferences` so the wire name is load-bearing again. P3 — maintenance doc overclaimed. The previous revision said 'CI red if you forget step 4' but the symbol arrays are partially hand- maintained, so a contributor adding a NEW phase string or i18n key without updating the array leaves CI green (the walker never knew to look). Rewrote the failure-mode section to distinguish the two cases: - Renaming an EXISTING symbol without updating the walker → CI red (existing assertion fails because the old name is gone). - Adding a NEW hand-maintained symbol without updating the walker → CI stays green (walker does not know to look for it). Also clarified that `SSE_EVENTS` and `PANELIST_ROLE_STRINGS` are auto-built from contracts so step 4 is one-line for `PHASE_STRINGS` and `I18N_KEYS` only. 63 / 63 vitest cases still green. * fix(web): close two P2 findings on PR #1318 (Siri-Ray + lefarcen) P2 (coverage walker counted self as evidence). The walker walked apps/web/tests, which contains apps/web/tests/components/Theater/ critique-coverage.test.ts itself. The hand-maintained PHASE_STRINGS and I18N_KEYS literals inside that file would satisfy the test-side coverage assertion against themselves, so a real Theater test that covers a symbol could be deleted and the gate would still pass. Excluded the walker file from TEST_FILES via path.resolve(__filename) filter so the test corpus only contains independent evidence. Once the walker stopped seeing itself, the gate correctly red-flagged nine i18n keys that no INDEPENDENT test exercises: critiqueTheater.userFacingName, roundLabel, composite, threshold, interrupt, interrupted, degradedHeading, shippedSummary, interruptedSummary. Component tests like TheaterCollapsed.test.tsx exercise the rendered text but never mention the key STRING, so the walker couldn't see them. Closed that gap by adding apps/web/tests/components/Theater/critique-i18n-keys.test.ts: 9 cases, one per watched key, asserting the dictionary entry exists as a non-empty string. That's both real coverage (catches a stale dict) and the independent evidence the walker requires. P2 (interruptedSummary missing from de/ja/ko/zh-TW). The native locale overrides were missing the key, so an interrupted run on a German / Japanese / Korean / Traditional Chinese UI silently fell back to the English string via the ...en spread. Added the key with {round} and {composite} placeholders preserved, using PerishCode's suggested copy from the earlier review thread. Verified: - pnpm --filter @open-design/web typecheck clean. - pnpm exec vitest run tests/components/Theater tests/i18n: 20 files / 190 tests green (critique-coverage 62 / 62, critique-i18n-keys 9 / 9 new, reducer-bench 1 / 1, locales 5 / 5). * fix(web): drop the Dict cast in i18n key coverage test (lefarcen P1 / Siri-Ray on PR #1318) The previous revision used `(en as Record<string, string>)[key]` to read each watched key. Dict has no string index signature, so CI's strict typecheck rejected the broad cast with TS2352 even though the runtime assertion was fine. Replaced with the typed pattern lefarcen suggested: type WATCHED_KEYS as `readonly (keyof typeof en)[]` and read `en[key]` directly. That removes the cast and also strengthens the test, because a renamed or removed key now fails the type check immediately rather than at runtime. Verified: - pnpm --filter @open-design/web typecheck clean. - pnpm --filter @open-design/web exec vitest run tests/components/Theater/critique-i18n-keys.test.ts: 9 / 9 green. * fix(web): tighten isPanelEvent in contracts so enum + numeric fields are checked end-to-end (Siri-Ray round-3 P1 on PR #1314) The variant validator on the web SSE path previously accepted any `typeof === 'string'` for closed-enum fields (ship.status, panelist_.role, degraded.reason, failed.cause, parser_warning.kind, run_started.cast[]) and any `typeof === 'number'` for numeric fields, which let NaN / Infinity through. Downstream components index i18n tables by enum value, so an unknown status or role would land `SHIP_BADGE_KEY[final.status]` on undefined and crash the translator. The replay parser had a separate gap: `useCritiqueReplay.parseTranscript` called the cheap `isPanelEvent` header check directly, so a recorded line like `{"type":"ship","runId":"r"}` reached the reducer with composite, status, round, artifactRef, summary all undefined and TheaterCollapsed then called `final.composite.toFixed(1)` on undefined. Resolution: move all wire-side validation into the contract guard. - Export const arrays for the closed enums: SHIP_STATUSES, DEGRADED_REASONS, FAILED_CAUSES, PARSER_WARNING_KINDS, ROUND_DECISIONS (PANELIST_ROLES already existed). - Rewrite `isPanelEvent` in packages/contracts/src/critique.ts to be the single deep validator: header (known type + non-empty runId) plus every variant-specific required field plus closed-enum membership plus Number.isFinite on every numeric field. Documented as the wire source of truth. - Drop the local `hasValidVariantShape` from web/sse.ts; sseToPanelEvent now relies entirely on the contract guard, and parseTranscript in useCritiqueReplay (which already uses isPanelEvent) gets the deeper validation for free. Tests (TDD, red-first): - packages/contracts/tests/critique.test.ts: 13 new cases pinning the strict guard directly (well-formed across every variant, every rejection path: unknown type, empty/non-string runId, unknown enum, non-finite numeric, missing variant field). - apps/web/tests/components/Theater/state/sse.test.ts: 9 new cases for each closed-enum rejection on the wire path plus a positive sweep across every legal enum value across every variant. - apps/web/tests/components/Theater/hooks/useCritiqueReplay.test.tsx: 2 new cases for incomplete and unknown-enum transcript lines. Verified: - pnpm --filter @open-design/contracts test 4 files / 30 tests green. - pnpm --filter @open-design/contracts build clean. - pnpm --filter @open-design/web typecheck clean. - pnpm --filter @open-design/web test 107 files / 976 tests green. fix(contracts): enforce numeric domains in isPanelEvent (lefarcen P2 on PR #1314 round 4) The strict guard from PR #1314 round 3 enforced enum membership and Number.isFinite, but accepted any finite number where the contract intends a specific domain: scale: 0 (ScoreTicker divides by it), negative thresholds, fractional rounds, negative mustFix, etc. ScoreTicker.tsx writes `var(--scale, ${state.scale})` into inline CSS and divides by it for tick width, so a guard-passing scale: 0 shipped Infinity into the rendered style. Negative composite / score values reached downstream code that assumes >= 0. Resolution: mirror the daemon-side Zod domain constraints in the runtime guard. Three new helpers in packages/contracts/src/critique.ts: - isPositiveInt(v): integer with v > 0. Used for round, maxRounds, scale, protocolVersion (all 1-indexed in the orchestrator). - isNonNegativeInt(v): integer with v >= 0. Used for mustFix, position, bestRound. bestRound: 0 is the valid sentinel for 'interrupted before any round closed'. - isNonNegativeFinite(v): finite number with v >= 0. Used for composite, score, dimScore, threshold. Threshold may be fractional (e.g. 8.5 on a scale of 10). Cross-field check inside run_started: threshold <= scale (the daemon Zod schema enforces this with an epsilon refine, the wire guard matches the same intent). Tests (TDD, red-first) added in packages/contracts/tests/critique.test.ts: - 22 new rejection cases across every numeric field that previously slipped through: scale: 0, negative scale, fractional scale, maxRounds: 0, fractional maxRounds, protocolVersion: 0, fractional protocolVersion, negative threshold, threshold > scale, round: 0, fractional round, negative dimScore / score, negative / fractional mustFix, negative composite, ship round: 0, negative / fractional bestRound, negative interrupted composite, negative / fractional parser_warning position. - 3 positive boundary cases that must still pass: threshold == scale, fractional threshold within [0, scale], interrupted with bestRound: 0 (no round completed before interrupt), parser_warning with position: 0 (start of stream). Verified: - pnpm --filter @open-design/contracts build clean. - pnpm --filter @open-design/contracts test: 4 files / 59 tests green (was 37 before the new domain cases). - pnpm --filter @open-design/web typecheck clean. - pnpm --filter @open-design/web test: 110 files / 1004 tests green; no regression on Theater suite, sse validator, replay parser, or assistant-feedback widget tests. * fix(web): restore wait-for-daemon-ack pattern on Theater interrupt Same regression as flagged on PR #1316 post-main-merge: the optimistic local dispatch fired before the POST resolved, so a daemon 404 / 409 still terminalized the UI and the real SSE terminal event got ignored by the sticky interrupted phase. Snapshot runId / bestRound / composite at click time, dispatch interrupted only on res.ok, clear interruptPending on rejection or non-2xx so the user can retry. Tests cover rejection + 404 leaving the run on the live stage; the 204 path waits for the ack. * test(e2e): move critique-coverage walker from apps/web/tests to e2e/tests (Siri-Ray P2) The walker is by definition a cross-app consistency check: it reads the web reducer, the daemon critique module, the contracts package, and the e2e UI suite. Hosting it under apps/web/tests/ violated the repo boundary rule (root AGENTS.md): app packages must not import another app's private src/ or tests/ as a shared helper, and cross-app consistency checks belong in e2e/tests/. The web test lane was effectively coupled to daemon and e2e file layout, so a daemon-only refactor could break the web lane. Moved the file to e2e/tests/critique-coverage.test.ts and switched the contracts import to the import.meta.glob shape the e2e package already uses (see localized-content.test.ts), so the e2e package does not have to add @open-design/contracts as a workspace dep just to load two const arrays. REPO_ROOT and SELF_PATH recalculated for the new location. Web test lane no longer depends on daemon, contracts, or e2e layout. The e2e walker covers the same 62 assertions as before: e2e/tests/critique-coverage.test.ts 62 / 62 green Web typecheck clean, e2e typecheck clean. * fix(test): add projectKind prop to FileViewer deck render after v0.7.0 merge --------- Co-authored-by: Nagendhra <nagendhra405@gmail.com>	2026-05-14 15:55:36 +08:00
pftom	2ac5854432	feat(plugin-inputs): enhance plugin input handling with file upload support - Added support for file input fields in the PluginInputsForm, allowing users to upload files with serializable metadata. - Updated the HomeHero component to improve the layout and interaction of input fields, enhancing user experience. - Adjusted CSS styles for better visual representation of input fields and their states. - Modified HomeView to reflect changes in authoring chip IDs for better clarity in plugin actions. - Enhanced tests to cover new file input functionality and ensure correct behavior in various scenarios. This update significantly improves the plugin input handling, enabling users to upload files seamlessly and enhancing the overall interaction model.	2026-05-14 15:52:21 +08:00
pftom	1a90aef4a2	feat(plugin-use): implement plugin use handoff functionality - Added support for using installed plugins directly from the PluginsView, allowing users to initiate plugin actions seamlessly. - Enhanced the HomeView to handle plugin use handoffs, managing state and user interactions effectively. - Introduced new types and functions to facilitate the creation and processing of plugin use handoffs, improving the overall user experience. - Updated tests to cover the new plugin use functionality, ensuring reliability and correctness in the application flow. This update significantly enhances the interaction model for plugins, enabling users to utilize plugins more intuitively within the application.	2026-05-14 15:40:42 +08:00
pftom	9ea33e076b	feat(context-plugins): add support for context plugins in project metadata and UI - Introduced a new `contextPlugins` field in the `ProjectMetadata` type to accommodate plugins selected via `@` mentions, allowing for additive context in project creation. - Updated the `HomeHero` and `EntryShell` components to handle and display context plugins, enhancing user interaction with selected plugins. - Implemented rendering logic for context plugins in the metadata block, providing clear visibility of selected plugins and their descriptions. - Enhanced the UI to support the removal of context plugins and display additional details on hover, improving the overall user experience. This update significantly enriches the project creation process by allowing users to incorporate multiple context plugins seamlessly.	2026-05-14 15:29:49 +08:00
lefarcen	6c16283850	Merge origin/main (post-7c8305f4) into reconcile branch Brings in 10 new main commits: routine deep-link to specific conversations (#1508), Windows resource cache fix for Orbit templates, collapsible comment side panel (#1607), routines project radio polish, Copilot logo swap, and minor UI fixes. Conflicts resolved: - router.ts: garnet's home/view + marketplace routes + main's per-project conversationId deep-link field coexist on Route union - ProjectView.tsx: garnet's isPhantomDaemonRunMessage helper + main's isStoppableAssistantMessage helper both kept - ProjectView.run-cleanup.test.tsx: accepted HEAD (garnet's phantom-row regression test); main's three new tests for finalizeActiveAssistantMessagesOnStop / clearStreamingConversationMarker / shouldClearActiveRunRefs are queued as a follow-up TODO inline.	2026-05-14 15:13:38 +08:00
shangxinyu1	2976c76fc3	test: expand Memory and Routines coverage (#1521 ) * test: expand settings and packaged coverage * test: extend memory settings coverage * test: cover routine settings failure states * test: cover routine operation failures * test: fix daemon test typing on CI * test: decouple packaged smoke from orbit bug * test: avoid live memory LLM calls in route tests * test: fix daemon fetch typing in CI * fix: restore preview comment and inspect toggles * test: align manual edit flow with current inspector UX * test: align comment attachment flow with current preview comments UI * fix: probe resolved Codex launch path during detection * fix: remove duplicate board activation helper after rebase * test: update ghost cli detection mock * test: align FileViewer toolbar expectation * ci: move full app tests to extended lane * ci: run app tests by changed scope * ci: cover shared app inputs in test scopes * ci: avoid setup-node cache in windows packaged smoke * test: align extended settings and manual edit flows	2026-05-14 14:48:40 +08:00
Nagendhra Madishetti	5cb0508790	fix(web): deep-link Routines history rows to their specific conversation (Fixes #1505 ) (#1508 )	2026-05-14 14:27:34 +08:00
soulme	2a8ebff11a	feat(web): add collapsible comment side panel (#1607 )	2026-05-14 14:27:09 +08:00

1 2 3 4 5 ...

439 commits