open-design/apps/daemon/tests/mcp-tokens.test.ts
Tom Huang d592f6087f
feat(mcp): external MCP client with daemon-managed OAuth and 39 design-focused templates (#898)
* feat(mcp): add external MCP client with daemon-managed OAuth and 17 design-focused templates

Open Design now acts as an MCP CLIENT and surfaces tools from third-party
MCP servers to the underlying agent (Claude Code, Hermes, Kimi).

Daemon
- New mcp-config / mcp-oauth / mcp-tokens modules: persist server entries
  to .od/mcp-config.json, run the OAuth dance for HTTP/SSE servers
  end-to-end on the daemon (so cloud deployments work and tokens
  survive across turns), and inject Authorization: Bearer headers into
  the per-spawn .mcp.json the daemon writes for Claude Code (or the
  ACP mcpServers map for Hermes/Kimi).
- /api/mcp/servers and /api/mcp/oauth/{start,status,disconnect}
  endpoints, plus spawn-time wiring in agents that hands the configured
  servers to the active agent CLI.
- System-prompt directive for connected external MCPs so the model
  does not chase Claude Code's synthetic *_authenticate /
  *_complete_authentication tools when the Bearer is already pinned.

Web
- Settings -> External MCP servers panel with per-row OAuth Connect /
  Disconnect / Refresh affordances and per-row template hints.
- New "Add server" picker categorized into 7 groups
  (image-generation, image-editing, web-capture, ui-components,
  data-viz, publishing, utilities) with a search box, sticky close
  button, collapsible <details> sections (auto-expand on search),
  60vh capped scroll region, and a pinned Custom-server footer.
- ChatComposer /mcp slash and MCP picker button forward to the new
  Settings tab; AssistantMessage renders MCP tool calls inline;
  markdown autolinker handles bare http(s) URLs (incl. OAuth links)
  before italic markers so OAuth callback URLs do not get
  italic-fragmented mid-token.

Contracts
- packages/contracts/src/api/mcp.ts owns the wire shapes
  (McpServerConfig, McpTemplate with stable McpTemplateCategory
  enum, McpServersResponse, OAuth start/status/disconnect bodies, the
  postMessage payload from the OAuth callback).

Templates (17 built-in)
- image-generation: Higgsfield (OpenClaw, OAuth HTTP), Pollinations,
  Allyson (animated SVG), AWS Bedrock Image (uvx).
- image-editing: Imagician, ImageSorcery.
- web-capture: just-every screenshot-website-fast, ScreenshotOne.
- ui-components: 21st.dev Magic, shadcn/ui, FlyonUI.
- data-viz: AntV Chart, Mermaid.
- publishing: EdgeOne Pages.
- utilities: Filesystem, GitHub, Fetch.

Tests
- apps/daemon/tests/mcp-{config,oauth,tokens,spawn}.test.ts cover
  storage round-trip, OAuth helpers, token persistence, spawn-time
  wiring, every template's transport / command / args / env-field
  invariants, and the canonical category enum.
- apps/web/tests/runtime/markdown.test.tsx covers the new autolinker
  ordering rules.

Co-authored-by: Cursor <cursoragent@cursor.com>

* feat(mcp): add 21 more design-focused templates and a `design-systems` category

Expands the built-in MCP picker from 17 to 38 templates so users can compose
the full Open Design craft loop (design-system intake → generate → edit →
audit → publish) without leaving the Settings dialog. Every install spec is
verified live against the upstream README; templates that needed Go binaries,
multi-step `init` ceremonies, or massive runtime stacks (PostgreSQL + Redis
+ Ollama) are intentionally deferred so picking a template still resolves to
a working server in one click.

New `design-systems` category between `web-capture` and `ui-components`
(reflects the upstream-of-components position in the workflow). Mirrored in
`McpTemplateCategory` on both contracts and daemon, and `CATEGORY_ORDER` on
the web side.

New templates by category:

- image-generation (+4): prompt-to-asset (icons / favicons / OG / logos with
  free-tier routing across Cloudflare AI / NVIDIA NIM / HF / Stable Horde),
  Nano Banana (hosted streamable HTTP, virtual try-on + product placement),
  Seedream (hosted streamable HTTP, ByteDance Seedream v3-v5 + SeedEdit),
  fal.ai (uvx, 600+ models incl. FLUX / Kling / Hunyuan / MusicGen).
- image-editing (+3): Photopea (34 layered-editor tools — closes the PSD
  gap), Topaz Labs (AI upscale / denoise / sharpen), Transloadit (86+ media
  pipeline robots).
- web-capture (+1): Pagecast (browser → demo GIF / MP4 with auto-zoom).
- design-systems (+4, NEW category): Figma-Context (Framelink, designs →
  code), Design Token Bridge (Tailwind ⇄ CSS ⇄ Figma ⇄ M3 / SwiftUI / W3C
  DTCG + WCAG contrast), Design System Extractor (Storybook scrape),
  Aesthetics Wiki (cottagecore / dark-academia / y2k / … moodboards).
- data-viz (+2): MCP Dashboards (45+ chart types + KPI dashboards),
  Excalidraw Architect (hand-drawn architecture diagrams).
- publishing (+6): PageDrop, PDFSpark, OGForge, QRMint, Slideshot
  (HTML → PDF / PPTX / PNG with 7 themes), Deckrun (Markdown → PDF / video,
  hosted free tier with no key required).
- utilities (+1): A11y axe-core (WCAG 2.0/2.1/2.2 + color-contrast + ARIA).

Tests cover every new template's wiring (command, args, env / header
required-vs-optional, secret flag), the category enum invariant, and
in-category declaration order for image-generation, design-systems and
publishing buckets where the order is what users see in the picker. 21 new
test cases pass; full mcp-config suite is green.

Templates intentionally deferred (documented in PR body): figma-use
(needs Figma desktop with --remote-debugging-port=9222), m-moire (multi-step
`memi suite init` + daemon ceremony), gemini-media-mcp + trident-mcp (Go
binaries — no npx / uvx path), Pixelle-MCP (full app with web UI + ComfyUI
backend), storybook-addon-mcp (lives inside user's Storybook, not standalone),
primitiv (multi-step init / build / serve), ReftrixMCP (PostgreSQL + Redis +
Ollama + DINOv2), narasimhaponnada/mermaid (overlap with peng-shawn).

Co-authored-by: Cursor <cursoragent@cursor.com>

* feat(mcp): add figma-use template (write designs from chat) under design-systems

figma-use is the natural counterpart to Figma-Context already in this PR:
where Framelink reads Figma designs into the model, figma-use writes back
into the canvas (90+ tools — create frames / text / components / variants,
render JSX into Figma, export PNG/SVG, query nodes via XPath, lint for
WCAG / auto-layout / hardcoded colors, analyze design systems).

Wired as an HTTP MCP template (`http://localhost:38451/mcp`) because
`figma-use mcp serve` only exposes HTTP — there's no stdio mode in the
upstream `serve.ts`. No API key. Two prerequisites the user owns are
spelled out in the description so picking the template still resolves to
a working server: (1) start Figma with `--remote-debugging-port=9222`
(or `figma-use daemon start --pipe` on Figma 126+), and (2) leave
`npx figma-use mcp serve` running in a terminal.

Inserted between `design-system-extractor` and `aesthetics-wiki` so the
design-systems category reads as a workflow: read existing design (Figma
Context) → translate tokens (Token Bridge) → extract from Storybook
(Extractor) → write back to Figma (figma-use) → break creative block
(Aesthetics Wiki).

Tests cover the new template's transport (`http`), endpoint URL, the
empty header-fields invariant (no auth required), and bump the
design-systems group order to include it.

Co-authored-by: Cursor <cursoragent@cursor.com>

* feat(settings): i18n the External MCP / MCP server / Connectors sidebar entries and make the dialog header track the active section

The External MCP sidebar entry this PR introduces was hardcoded English
("External MCP / Add MCP tools (Higgsfield, GitHub…)"). Same for the
adjacent Connectors and MCP server entries. The dialog header was also
pinned to "Execution & model" copy, so opening Settings → External MCP
showed a header that lied about which section the user was on.

Adds six translation keys — `settings.connectorsTitle/Hint`,
`settings.mcpServerTitle/Hint`, `settings.externalMcpTitle/Hint` — and
translates them across all 17 locales (ar, de, en, es-ES, fa, fr, hu, id,
ja, ko, pl, pt-BR, ru, tr, uk, zh-CN, zh-TW).

`SettingsDialog` now derives the header title/subtitle from the active
section (11 sections total) instead of a single hardcoded pair, so each
section renders an honest header.

Co-authored-by: Cursor <cursoragent@cursor.com>

* test(e2e): pin level: 3 on dialog heading lookups for Pets and Connectors

CI's Validate workspace job (#1479) failed two Playwright cases with the
strict-mode violation:

  getByRole('dialog').getByRole('heading', { name: 'Pets' })
  resolved to 2 elements:
    1) <h2>Pets</h2>
    2) <h3>Pets</h3>

Same root cause as the unit-test fix already in this PR: the dynamic
dialog `<h2>` now echoes the section's own `<h3>` because the dialog
header tracks the active section. Disambiguate to `level: 3` so each
assertion still pins the section heading specifically (which is what
the test intends to verify).

Audit of the rest of e2e/ for `dialog.getByRole('heading', ...)` —
settings-api-protocol.test.ts looks for "OpenAI API" / "Anthropic API"
section h3s which never appear in the dialog `<h2>` (always
"Execution & model"), so those stay safe.

Co-authored-by: Cursor <cursoragent@cursor.com>

* fix(mcp): bind OAuth refresh to the issuing client and skip stale tokens

Persist the OAuth client context (token endpoint, client_id, client_secret,
issuer, redirect_uri, resource) alongside the bearer token so refresh hits
the same client the refresh_token was bound to (RFC 6749 §6). The previous
refresh path re-ran beginAuth with a dummy OOB redirect URI, which kept
getOrRegisterClient from finding the original DCR client and made
providers reject the refresh on the next chat turn. Refreshes now reuse
the persisted endpoint/client pair directly.

Also stop injecting expired access tokens at spawn time when refresh is
unavailable or fails. Pinning a stale Bearer made every Claude MCP call
401 while the prompt still treated the server as connected; on that path
we now skip the entry and let the UI surface a reconnect.

Generated-By: looper 0.6.1 (runner=fixer, agent=claude-code)

---------

Co-authored-by: Cursor <cursoragent@cursor.com>
2026-05-08 17:59:20 +08:00

212 lines
6.8 KiB
TypeScript

import { mkdtemp, readFile, rm, stat, writeFile } from 'node:fs/promises';
import { tmpdir } from 'node:os';
import path from 'node:path';
import { afterEach, beforeEach, describe, expect, it } from 'vitest';
import {
clearToken,
getToken,
isTokenExpired,
readAllTokens,
readTokensFile,
sanitizeTokensFile,
setToken,
type StoredMcpToken,
} from '../src/mcp-tokens.js';
describe('mcp-tokens storage', () => {
let dataDir: string;
beforeEach(async () => {
dataDir = await mkdtemp(path.join(tmpdir(), 'od-mcptokens-'));
});
afterEach(async () => {
await rm(dataDir, { recursive: true, force: true });
});
it('returns no servers when the file is missing', async () => {
expect(await readTokensFile(dataDir)).toEqual({ servers: {} });
expect(await getToken(dataDir, 'higgsfield')).toBeNull();
});
it('persists, re-reads, and overwrites a token', async () => {
const tok: StoredMcpToken = {
accessToken: 'access-1',
refreshToken: 'refresh-1',
tokenType: 'Bearer',
scope: 'openid email',
expiresAt: Date.now() + 60_000,
savedAt: Date.now(),
};
await setToken(dataDir, 'higgsfield', tok);
const got = await getToken(dataDir, 'higgsfield');
expect(got?.accessToken).toBe('access-1');
expect(got?.refreshToken).toBe('refresh-1');
expect(got?.scope).toBe('openid email');
// Overwrite with a rotated token.
await setToken(dataDir, 'higgsfield', {
...tok,
accessToken: 'access-2',
refreshToken: 'refresh-2',
});
const got2 = await getToken(dataDir, 'higgsfield');
expect(got2?.accessToken).toBe('access-2');
expect(got2?.refreshToken).toBe('refresh-2');
});
it('isolates tokens per server', async () => {
await setToken(dataDir, 'github', {
accessToken: 'gh-token',
tokenType: 'Bearer',
savedAt: Date.now(),
});
await setToken(dataDir, 'higgsfield', {
accessToken: 'hg-token',
tokenType: 'Bearer',
savedAt: Date.now(),
});
const all = await readAllTokens(dataDir);
expect(Object.keys(all).sort()).toEqual(['github', 'higgsfield']);
expect(all.github?.accessToken).toBe('gh-token');
expect(all.higgsfield?.accessToken).toBe('hg-token');
});
it('clearToken removes only the requested entry', async () => {
await setToken(dataDir, 'github', {
accessToken: 'gh-token',
tokenType: 'Bearer',
savedAt: Date.now(),
});
await setToken(dataDir, 'higgsfield', {
accessToken: 'hg-token',
tokenType: 'Bearer',
savedAt: Date.now(),
});
await clearToken(dataDir, 'github');
expect(await getToken(dataDir, 'github')).toBeNull();
expect(await getToken(dataDir, 'higgsfield')).not.toBeNull();
});
it('clearToken on an absent server is a no-op', async () => {
await expect(clearToken(dataDir, 'never')).resolves.toBeUndefined();
});
it('writes the tokens file with mode 0600 on POSIX', async () => {
if (process.platform === 'win32') return; // mode bits are advisory on win32
await setToken(dataDir, 'higgsfield', {
accessToken: 'tok',
tokenType: 'Bearer',
savedAt: Date.now(),
});
const s = await stat(path.join(dataDir, 'mcp-tokens.json'));
// Mask off file-type bits — only the permission bits are interesting.
expect(s.mode & 0o777).toBe(0o600);
});
it('survives a corrupt tokens file by returning empty', async () => {
await writeFile(path.join(dataDir, 'mcp-tokens.json'), '{not valid');
const all = await readAllTokens(dataDir);
expect(all).toEqual({});
});
it('sanitizes incoming JSON, dropping malformed entries', () => {
const out = sanitizeTokensFile({
servers: {
good: {
accessToken: ' abc ',
tokenType: 'Bearer',
savedAt: 123,
scope: 'a b c',
},
bad_no_token: { tokenType: 'Bearer', savedAt: 123 },
bad_shape: 'not an object',
__proto__: { accessToken: 'evil', tokenType: 'Bearer', savedAt: 1 },
},
});
expect(out.servers.good?.accessToken).toBe('abc');
expect(out.servers.bad_no_token).toBeUndefined();
expect(out.servers.bad_shape).toBeUndefined();
// __proto__ is reserved — sanitizer must NOT have set it as a real
// own property (which would be a prototype-pollution vector). The
// implicit access via `.__proto__` returns Object.prototype, so we
// check Object.hasOwn instead of toBeUndefined.
expect(Object.hasOwn(out.servers, '__proto__')).toBe(false);
expect(Object.keys(out.servers)).toEqual(['good']);
});
it('round-trips JSON disk format', async () => {
await setToken(dataDir, 'higgsfield', {
accessToken: 'tok',
tokenType: 'Bearer',
savedAt: 1234,
});
const raw = await readFile(path.join(dataDir, 'mcp-tokens.json'), 'utf8');
const parsed = JSON.parse(raw);
expect(parsed.servers.higgsfield.accessToken).toBe('tok');
});
it('persists OAuth client context fields with the token', async () => {
const tok: StoredMcpToken = {
accessToken: 'access-1',
refreshToken: 'refresh-1',
tokenType: 'Bearer',
savedAt: Date.now(),
tokenEndpoint: 'https://auth.example.com/token',
clientId: 'client-xyz',
clientSecret: 'secret-xyz',
authServerIssuer: 'https://auth.example.com',
redirectUri: 'https://app.example.com/api/mcp/oauth/callback',
resourceUrl: 'https://mcp.example.com/mcp',
};
await setToken(dataDir, 'higgsfield', tok);
const got = await getToken(dataDir, 'higgsfield');
expect(got).toMatchObject({
tokenEndpoint: 'https://auth.example.com/token',
clientId: 'client-xyz',
clientSecret: 'secret-xyz',
authServerIssuer: 'https://auth.example.com',
redirectUri: 'https://app.example.com/api/mcp/oauth/callback',
resourceUrl: 'https://mcp.example.com/mcp',
});
});
});
describe('isTokenExpired', () => {
const now = 1_700_000_000_000;
it('returns false for a token with no expiresAt', () => {
expect(
isTokenExpired({ accessToken: 'a', tokenType: 'Bearer', savedAt: now }, now),
).toBe(false);
});
it('returns false when expiresAt is comfortably in the future', () => {
expect(
isTokenExpired(
{ accessToken: 'a', tokenType: 'Bearer', savedAt: now, expiresAt: now + 60_000 },
now,
),
).toBe(false);
});
it('returns true when expiresAt is in the past', () => {
expect(
isTokenExpired(
{ accessToken: 'a', tokenType: 'Bearer', savedAt: now, expiresAt: now - 5 },
now,
),
).toBe(true);
});
it('honors the skew window (default 30s) so we do not ship a token about to expire', () => {
expect(
isTokenExpired(
{ accessToken: 'a', tokenType: 'Bearer', savedAt: now, expiresAt: now + 5_000 },
now,
),
).toBe(true);
});
});