mokosh

Author	SHA1	Message	Date
Mark	b112cb7861	test(01-10): wave-3 task-4 — harness A15+A16+A17 (onboarding flag observability + no-re-open settle + design-swap-readiness with @import probe); 24/24 GREEN Plan 01-10 Wave 3: extends the UAT harness with three new page-side assertions covering the onboarding contract + the canonical-tokens design-swap-readiness invariant. UAT baseline 21 → 24 GREEN. tests/uat/extension-page-harness.ts (page-side): - assertA15 — chrome.storage.local 'onboarding-completed' === true + 'installed-at' is number. Verifies SW's openWelcomeIfFirstInstall side-effects. - assertA16 — 2s settle window; chrome.tabs.query welcome-tab count delta === 0. Verifies flag-gating across SW respawns. - assertA17 — 7 sub-checks covering: welcome.html parse + .welcome-hero + >=7 mokosh-keyed attrs + welcome.css canonical @import literal OR inlined --mks-* evidence + (zero hex OR canonical resolved) + >=5 var(--mks-*) refs + bundled JS preserves populate plumbing + getComputedStyle --mks-rec → rgb(178, 84, 61) (canonical D-04 Loom). - window.__mokoshHarness surface extended with the three new methods; type declaration + assignment + page-ready status text updated. tests/uat/lib/harness-page-driver.ts (host-side): - driveA15, driveA16, driveA17 — standard page.evaluate wrappers matching driveA14 / driveA18..A22 idiom. driveA16 dominates the new wall-clock budget (~2.1s for the settle window). tests/uat/harness.test.ts (orchestrator): - Drivers array interleaves A15/A16/A17 AFTER A14 + BEFORE A18. A22's skip-gate no longer triggers (Plan 01-10 lands welcome.html; A22 now exercises the substantive token-usage path). - FORBIDDEN_HOOK_STRINGS unchanged at 12 entries (A15-A17 use only chrome.tabs.query / chrome.storage.local.get / fetch / DOMParser / getComputedStyle — all production-API surfaces). DEVIATION (Rule 1 — auto-fix bug in plan-supplied check): The plan's A17.6 spec used literal substring checks 'COPY[' and 'chrome.i18n.getMessage(' which fail against minified production output. Vite/Rollup terser renames `COPY` → `f` (local variable mangling) and welcome.ts's source uses optional chaining `chrome?.i18n?.getMessage?.(` which doesn't match the verbatim literal. Replaced with two minification-survivable witnesses: 1. 'welcome.page.title' — literal Object.freeze key (terser preserves object-literal keys verbatim). 2. 'i18n' + 'getMessage' + 'welcomeHero' substring conjunction — chrome global + property access + fallback key literal; all three survive minification regardless of optional-chaining insertion or rename. Both witnesses prove the populate plumbing survives the build (the ground-truth contract A17.6 enforces). The relaxed contract is semantically equivalent — neither substring is load-bearing on its own; both witness the same underlying invariant. Verify (all GREEN): - npm run test:uat: 24/24 assertions passed (A0 grep gate + A1..A14 + A15..A17 + A18..A22 + A23). - npx tsc --noEmit: clean. - npm run build:test: clean; dist-test/assets/welcome-wB0e_R_n.js bundled; harness page bundle includes new asserts. - SKIP_BUILD=1 npx vitest run tests/background/no-test-hooks-in-prod-bundle.test.ts: 13/13 GREEN (Tier-1 grep gate; FORBIDDEN_HOOK_STRINGS at 12). - Full vitest baseline preserved: 137 ex-grep-gate + 13 grep-gate = 150 GREEN (Plan 01-10 target). A17.7 canonical proof: getComputedStyle.color = 'rgb(178, 84, 61)' — the @import '../shared/tokens.css' directive resolves through to the canonical D-04 Loom palette --mks-madder-600 = #b2543d at runtime, as the empirical proof Plan 01-12 must_have #9 path-B contract demands. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-20 09:41:10 +02:00
Mark	b909c374cc	feat(01-12): wave-6 task-1 — harness A18-A22 (font reachability + icon-distinct + manifest-i18n + Lora-resolved + welcome-tokens) UAT harness extended with 5 new page-side assertions following the 01-13 Approach B pattern (page-side assertA* + host-side driveA* wrapper + harness.test.ts orchestrator entry): A18 — Lora WOFF2 reachable from harness page (font self-host MV3 CSP invariant). Walks document.styleSheets for the first @font-face rule referencing Lora, resolves the rebased asset URL (handles Vite's content-hashing), fetches, asserts byteLength >= 40_000 (subset Lora is ~49 KB) + WOFF2 signature 'wOF2'. 4 checks. A19 — icons rasterized from Loom mark (not Bug A placeholders). Fetches icon128.png, parses IHDR bytes 24-25 (bit-depth + color-type), asserts (8, 6) RGBA vs the placeholder (16, 2) RGB. 2 checks. A20 — manifest:name resolves via chrome i18n. Reads chrome.runtime.getManifest().name; asserts it matches EN extName 'Mokosh — Session Capture' OR RU 'Mokosh — Запись сессии' (robust to whatever locale Chrome uses); explicitly checks no __MSG_ placeholder leaks. 2 checks. A21 — --mks-font-display resolves to Lora stack. Creates transient .mks-display-1 probe div, reads getComputedStyle.fontFamily, asserts the stack starts with 'Lora' or '"Lora"' (accommodates both quoted + unquoted forms across Chrome versions); explicitly checks no Newsreader leak (R2 substitution complete). 2 checks. A22 — welcome page tokens.css adoption (CONDITIONAL on Plan 01-10). Skip-gate on missing welcome.html: catches both HTTP 404 AND network-layer fetch failure (Chrome extensions throw TypeError 'Failed to fetch' for non-web_accessible_resources paths). On reachable: extracts <link rel=stylesheet> hrefs, fetches each, asserts >= 3 var(--mks-) usages OR tokens.css reference. 1 check. Companion changes: - tests/uat/extension-page-harness.html gains `<link rel="stylesheet" href="../../src/shared/tokens.css">` so A18 + A21 have the @font-face rules + .mks-display-1 class + CSS custom properties resolvable via document.styleSheets + getComputedStyle. Vite's crxjs plugin handles the asset path rebasing at build:test time. - tests/uat/lib/harness-page-driver.ts: driveA18..driveA22 wrappers following the established driveA8 pattern (page.evaluate → window. __mokoshHarness.assertXX). No new host-side fs/ffprobe primitives; all A18-A22 work is page-side. - tests/uat/harness.test.ts: orchestrator drivers list extended with A18-A22 between A14 and A23. FORBIDDEN_HOOK_STRINGS UNCHANGED at 12 entries post-Plan-01-14 (A18-A22 use production chrome. + fetch + getComputedStyle exclusively; no new test-mode symbols). Verification (this commit): - npm run test:uat: 21/21 GREEN (was 16/16 post-01-14) - SKIP_BUILD=1 npm test: 147/147 GREEN - Tier-1 grep gate: 13/13 GREEN (no FORBIDDEN_HOOK_STRINGS growth) - npx tsc --noEmit: clean - npm run build + npm run build:test: both succeed The chain of A1..A14 + A18..A22 + A23 runs in ~95 seconds end-to-end under Puppeteer headless mode against the bundled Chrome at ~/.cache/puppeteer/chrome/linux-148.0.7778.167. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-20 07:41:51 +02:00
Mark	b467123578	feat(01-14): monitorTypeSurfaces:'include' — narrow picker to monitor surfaces only [per Plan 01-14; closes B-01-14-01 via Step 1b lockstep] - src/offscreen/recorder.ts: add monitorTypeSurfaces:'include' as top-level DisplayMediaStreamOptions sibling of video: (W3C Screen Capture spec §6.1; Chrome >= 119; removes tab/window panes from the operator's picker per Plan 01-10 RESEARCH §5 + §Pitfall-5 recommendation). Typed widening cast extended in lockstep to keep the explicit-typing contract (no `as any`). D-15 post-grant validation block at recorder.ts:294 UNCHANGED — belt (picker narrowing) + suspenders (post-grant tear-down) chain preserved. - tests/offscreen/display-surface-constraint.test.ts: lockstep update of the strict-deep-equality assertion at lines 223-226 with the same key ordering as the source change (video -> monitorTypeSurfaces -> audio). toHaveBeenCalledWith contract preserved (NO expect.objectContaining — the test author's "catches future drops of ANY field" discipline is honored). This edit + the source change land in the SAME commit so the 98/98 baseline never crosses a commit boundary in RED state. - src/test-hooks/offscreen-hooks.ts: capture last constraints object in module-scoped `lastGetDisplayMediaConstraints` cell (was `_constraints` received-but-unused; renamed to `constraints`); add `get-last-getDisplayMedia-constraints` bridge op to the __mokoshOffscreenQuery dispatcher between get-display-surface and get-segment-count. Defensive try/catch mirrors the existing dispatcher pattern; the cell is module-internal so the MokoshTestSurface cross-cast in types.ts requires NO change (decision documented inline in offscreen-hooks.ts). - tests/uat/extension-page-harness.ts: add `assertA23` mirroring `assertA3` (bridge query → 2-check AssertionResult: non-null constraints + value). Extend the `Window.__mokoshHarness` declaration + runtime export + status bar text + console.log to reference A23. - tests/uat/lib/harness-page-driver.ts: export `driveA23(page)` mirroring the `driveA14` page.evaluate wrapper shape. Standard read-only driver. - tests/uat/harness.test.ts: extend FORBIDDEN_HOOK_STRINGS (line 85) with `lastGetDisplayMediaConstraints` and `get-last-getDisplayMedia-constraints`. Import driveA23. Append `{ name: 'A23', drive: driveA23 }` to the drivers array after the A14 entry. Update header comment + orchestrator stdout to reflect A14 + A23 chain. The `Total = drivers.length + 1` arithmetic adapts automatically: 14 + 1 = 15 → 15 + 1 = 16. - tests/background/no-test-hooks-in-prod-bundle.test.ts: lockstep extension of FORBIDDEN_HOOK_STRINGS (line 105) with the same 2 strings. Header comment updated to "Total: 12 surface strings." (was 10). Confirms production `dist/` has ZERO occurrences after `npm run build` via the `__MOKOSH_UAT__` dead-branch tree-shake (T-01-14-04 mitigation). D-01 (whole-desktop only via getDisplayMedia; reject window/tab surfaces) is the design intent that monitorTypeSurfaces:'include' realizes at the picker- UI level. D-15 post-grant validation (recorder.ts:294-307) remains the actual enforcement against managed-policy/DevTools/older-Chrome overrides. Verification chain (per Plan 01-14 §verify; clean post-commit): - `npx tsc --noEmit` exit 0 - `npm run build` exit 0; dist/ produced, monitorTypeSurfaces ships in the offscreen chunk as the operator-facing picker hint - `npm run build:test` exit 0; dist-test/ produced with the harness hooks intact (gated) - `npm test` 100/100 GREEN (was 98/98; +2 via the 2 new FORBIDDEN_HOOK_STRINGS parametrized tests — both PASS, production bundle hook-free) - `npm run test:uat` 16/16 GREEN (15 → 16 via A23). A23 reads constraints `{video: {...}, monitorTypeSurfaces: 'include', audio: false}` from the fakeGetDisplayMedia capture cell — round-trips through the full call site. - Production bundle spot-check: `grep -rc 'lastGetDisplayMediaConstraints\\|get-last-getDisplayMedia-constraints' dist/ \| grep -v ':0$'` → empty (all `:0` filtered) → ZERO leakage. References: - W3C Screen Capture §6.1 DisplayMediaStreamOptions: https://www.w3.org/TR/screen-capture/#dom-displaymediastreamoptions-monitortypesurfaces - Chrome screen-sharing-controls (Chrome 119+): https://developer.chrome.com/docs/web-platform/screen-sharing-controls - Plan 01-10 RESEARCH §5 + §Pitfall-5 (recommendation provenance): .planning/phases/01-stabilize-video-pipeline/01-10-RESEARCH.md - Architectural-note (replaces retired AMENDMENT-A.md improvisation per 01-11-SUMMARY): canonical GSD ceremony — plan → checker (B-01-14-01) → executor → SUMMARY (this commit). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-19 21:37:59 +02:00
Mark	1baaf45702	feat(01-13-A14-invert): A14 — invert to assert continuous-recording post-SAVE Plan 01-09 Amendment 3 (2026-05-19) end-to-end lock. Inverted A14 to match the reversed charter (SAVE creates zip, recording continues). Page-side (tests/uat/extension-page-harness.ts): - assertA14: assert badge==='REC' (was ''), popup endsWith 'src/popup/index.html' (was ''), no-new-recovery-notif (unchanged). - A14 name + check labels updated to reflect continuous-recording semantic. - New constant A14_POPUP_HTML_SUFFIX for the popup endsWith check (ext-id-agnostic via suffix match). - A13 docstring + diag strings refreshed: setupFreshRecording is now defensive (orthogonal to A12 ordering) rather than a workaround for the prior auto-stop. 11s settle preserved (same wall-clock cost). Host-side (tests/uat/lib/harness-page-driver.ts): - driveA14 docstring refreshed to mention Amendment 3 + the inverted contract; mechanical wrapper unchanged. Verification: - npm run test:uat: 15/15 GREEN - A14 actual output: badge='REC' popup='chrome-extension://<ext-id>/src/popup/index.html' recoveryDelta=0 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-19 17:33:53 +02:00
Mark	2b6c24b2d9	feat(01-13): A14 — post-SAVE state check (badge='', popup='', no new recovery notif) Plan 01-13 Task 9 closure for operator empirical UAT bug .planning/debug/01-09-save-stops-recording.md. Adds the harness assertion that empirically verifies the SAVE-auto-stops-recording fix (committed at `4f4c3e2`) holds end-to-end through a real Chrome instance + real MediaRecorder + real chrome.action + real chrome.notifications. A14 design (read-only post-state check): 1. Snapshot active mokosh-recovery-* notification ids (delta baseline). 2. Settle 500ms for the post-A13 SAVE finally block to land. 3. Read chrome.action.getBadgeText/getPopup + getActiveNotificationIds. 4. Assert badge='', popup='', recoveryDelta=0 — three checks total. A14 chains off A13's SAVE_ARCHIVE (which under the new fix auto-stops the recording per SPEC one-shot intent). A14 does NOT dispatch its own SAVE — A13's SAVE is the event A14 observes the post-state of. This keeps the harness wall-clock minimal (~500ms added for A14, no additional 11s segment-settle). Amendment to A13: now does setupFreshRecording + 11s segment-settle BEFORE its own SAVE_ARCHIVE dispatch. Under the new fix, A12's SAVE_ARCHIVE stopped the recording — without this A13 would dispatch against an empty buffer and fail with EmptyVideoBufferError. The amendment adds ~11s to harness wall-clock; acceptable given the SPEC SAVE=stop contract is now load-bearing. A14 contract notes per orchestrator simpler-design recommendation: - direct isRecording proxy check skipped (no bridge op exposes it; transitively verified via badge='' — production state machine pairs isRecording transitions with badge transitions atomically) - recovery-notif check is delta-based (A7 left a mokosh-recovery-* in the active set; we verify A13's SAVE did NOT add another one) Files modified: - tests/uat/extension-page-harness.ts: +assertA14 (~110 lines) + A13 amended with setupFreshRecording + 11s settle + 1 new SETUP check + window.__mokoshHarness export wire - tests/uat/lib/harness-page-driver.ts: +driveA14 wrapper - tests/uat/harness.test.ts: +A14 in drivers array + header doc + total 14/14 → 15/15 + import line Verification: - npm run test:uat: 15/15 GREEN (was 14/14) - npx tsc --noEmit: exit 0 (no type errors) - npm run build: exit 0 (production bundle clean) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-19 13:31:13 +02:00
Mark	d793c9e1e5	feat(01-13): wave-3D — A11+A12+A13 GREEN + get-segment-count bridge op; 14/14 GREEN Lands the final three UAT-harness assertions. All 14 assertions (A0..A13) now GREEN against the current bundle; `npm run test:uat` exits 0 in ~70s wall-clock (35s of which is A11's mandatory continuity wait). Assertions wired: - A11 — 35s buffer continuity → segments.length >= 3. Tears down any prior recording (STOP_RECORDING → START_RECORDING so the recorder's `resetBuffer` at start clears segments). Waits 35_000ms wall-clock with intermittent SW keepalive PINGs every 20s (belt-and-suspenders over the offscreen recorder's own keepalive port). Queries the new `get-segment-count` bridge op. Asserts count >= 3 (per D-13: SEGMENT_DURATION_MS=10s × MAX_SEGMENTS=3). - A12 — SAVE_ARCHIVE produces zip; webm passes ffprobe. Page side dispatches SAVE_ARCHIVE (recording from A11 still alive). Host side polls `downloadsDir` for the new/updated zip (overwrite-aware mtime delta — the CDP-routed downloads pattern OVERWRITES `download.zip` rather than numbering it, empirically verified during initial RED). Extracts `video/last_30sec.webm` via JSZip to a tmpfile. Runs `/usr/bin/ffprobe -v error -f matroska <path>`; asserts exit 0 + clean stderr. Three skip-gates: (i) ffprobe binary absent → SKIPPED; (ii) webm < 10_240B (synthetic-stream-limitation signature — canvas captureStream in `--headless=new` offscreen produces 0-frame WebM with only EBML/Track headers) → SKIPPED with explicit diagnostic pointing operators to `tests/offscreen/webm-playback.test.ts` as the primary defense for the codec/remux contract; (iii) happy path → strict ffprobe gate (will fire RED on remux/codec regressions when operators run HEADLESS=0 with a real screen-share grant). A12's role as "belt + suspenders" is documented inline + framed by Plan 01-13 Task 7 behavior block. - A13 — Zip structure + meta.json shape. Second SAVE_ARCHIVE (verifies idempotency over A12's first save). JSZip parse via the `assertArchiveShape` helper (extended in this wave to read `extensionVersion` — the actual production SessionMetadata field name per src/shared/types.ts:103, vs. the earlier 01-11 prototype's incorrect `version` assumption). Six checks: SW dispatch ack, zip arrival, webm entry present, webm size > 1024B, meta.json entry present, meta.json.extensionVersion matches chrome.runtime.getManifest().version (captured once at orchestrator startup via the new page-side getManifestVersion helper). Bridge op + recorder wire: - Adds `get-segment-count` op to the offscreen-hooks `__mokoshOffscreenQuery` chrome.runtime.onMessage handler — returns `{count: number}` via the existing segmentCountGetter closure (segments.length captured at recorder.ts:284 inside startRecording; the getter binding survives multiple START/STOP cycles via the module-level let segments array). - Adds `get-segment-count` to FORBIDDEN_HOOK_STRINGS in BOTH gate files: `tests/background/no-test-hooks-in-prod-bundle.test.ts` (Tier-1 unit gate; 9 → 10 entries; vitest 93 → 94 GREEN) and `tests/uat/harness.test.ts:assertA0_GrepGate` (UAT-level mirror). Production bundle remains hook-free (0 occurrences in dist/ after `npm run build` — verified). Harness surface: - `tests/uat/extension-page-harness.ts` extends `window.__mokoshHarness` from 10 → 13 assertion methods + 1 helper: `assertA11, assertA12, assertA13, getManifestVersion`. Adds `teardownAndStartFreshRecording` helper for A11's clean-slate contract. - `tests/uat/lib/harness-page-driver.ts` retires the Wave-3 stub marker (no more NYI throws). Adds `driveA11` (standard wrapper), `driveA12` + `driveA13` (heavyweight host-side drivers with fs polling + JSZip + ffprobe). Adds `pollForNewOrUpdatedZip` which detects both new files AND overwrites via mtime delta — fixes the `download.zip` overwrite blindness that turned A12 RED on first run (driveA5's name-only filter wasn't reused). - `tests/uat/lib/zip.ts` updates `assertArchiveShape` to read `extensionVersion` (the production field name per src/shared/types.ts:103); adds the A13_MIN_VIDEO_BYTES=1024 floor constant. - `tests/uat/harness.test.ts` orchestrator wires the three new drivers + the per-run manifest-version capture for A13. Baseline: - `npx tsc --noEmit`: exit 0. - `npm run build`: exit 0; production bundle clean of all 10 hook strings (verified by grep). - `npm run build:test`: exit 0; test bundle ships `get-segment-count`. - `npx vitest run`: 94/94 GREEN (was 93; +1 from the new gate string). - `npm run test:uat`: 14/14 GREEN; wall-clock ~70s (35s A11 wait + 2× ~13s save settles + ~10s production rebuild + overhead). A11 RED-on-regression demo (documented per acceptance-criteria "at least 1 of 3"): Edit src/offscreen/recorder.ts:52: `SEGMENT_DURATION_MS = 10_000` → `SEGMENT_DURATION_MS = 30_000`. Rebuild dist-test. Re-run UAT. A11 FAILS (only 1 segment rotates in 35s, vs floor of 3). Revert the edit; A11 PASSES. The harness empirically catches regressions that lengthen the rotation cadence beyond the 30s ring window — the canonical D-13 contract. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-19 10:24:39 +02:00
Mark	b665919c5f	feat(01-13): wave-3C — A8+A9+A10 GREEN + Bug A canonical regression rewind Plan 01-13 Task 6 (Wave 3C). Wires the final three Wave-3 assertions before A11+A12+A13 (Wave 3D — 35s segments / ffprobe / zip shape): - A8 (Bug A canonical regression rewind) — invokes chrome.notifications.create from the harness page with the SAME options the production SW onStartup handler uses (iconUrl resolved via chrome.runtime.getURL('icons/icon128.png')). Exercises Chrome's imageUtil icon validation — the exact code path Bug A regressed on (`a881bf0`). 4 checks: non-empty assignedId, id-honoring, getAll delta=1, prefix set-membership. The SW handler invocation itself remains covered by tests/background/onstartup-notification.test.ts (unit tier); A8 covers the end-to-end imageUtil-acceptance gate (e2e tier). Per T-1-13-06 threat-model row: unit + e2e are intentional defense in depth covering both halves of the Bug A contract. - A9 (icon file sizes meet imageUtil floors) — fetches icons/icon{16,48, 128}.png via chrome.runtime.getURL and asserts blob.size against the 200/500/1024-byte silent-rejection floors per assets-spec.md. Cheap pre-check for the Bug A class: a future icon swap that drops below the floor would silently break the notification flow; A9 catches it BEFORE the SW even tries to create. - A10 (manifest shape contract) — chrome.runtime.getManifest() asserts: permissions includes 'notifications' (without it, chrome.notifications.create is unreachable), icons['16/48/128'] defined + non-empty, action.default_icon['16/48/128'] same. 7 checks total. Catches manifest-edit regressions that would silently break A8. Bug A canonical RED-on-regression demo cycle ============================================ Regression trigger: head -c 50 /tmp/icon128.png.backup > icons/icon128.png (truncates the 2615-byte PNG to 50 bytes — preserves PNG magic so manifest loads, but Chrome's imageUtil silent-rejects the create). RED — A8 standalone driver with truncated icon128.png (50 bytes): A8 — BUG A canonical: chrome.notifications.create accepts startup-icon (imageUtil contract): FAIL Top-level error: notifications.create rejected: Unable to download all specified images. Diagnostics: - Step 1: snapshot notif count + ids BEFORE create - Step 1 result: 0 active; ids=[] - Step 2: chrome.notifications.create(id='mokosh-startup-1779124969677', iconUrl='chrome-extension://<ext-id>/icons/icon128.png') - THREW: notifications.create rejected: Unable to download all specified images. GREEN — A8 standalone driver after restoring icon128.png (2615 bytes): A8 — BUG A canonical: chrome.notifications.create accepts startup-icon (imageUtil contract): PASS Checks: [PASS] A8.1: create callback resolves with non-empty assignedId (imageUtil acceptance) expected: "non-empty string" actual: "mokosh-startup-1779124999809" [PASS] A8.2: assignedId matches input id (chrome.notifications honors caller-supplied id) expected: "mokosh-startup-1779124999809" actual: "mokosh-startup-1779124999809" [PASS] A8.3: notification count delta === 1 (exactly one new startup notification) expected: 1 actual: 1 [PASS] A8.4: at least one notification id startsWith 'mokosh-startup-' (set membership) expected: true actual: true The RED→GREEN cycle proves the harness empirically catches Bug A regression class (imageUtil silent rejection on undersized iconUrl PNG). The "Unable to download all specified images." rejection is Chrome's internal error surface for the same imageUtil validation that Bug A originally regressed on (fix at `a881bf0`). Note: under the full orchestrator order, the same truncation surfaces FIRST at A7 (recovery notification, which shares NOTIFICATION_ICON_PATH) — orchestrator bail-on-first-failure means A8 isn't reached in the full run. The isolated A8 demo above (via an ephemeral local driver script, NOT committed) confirmed A8 catches the same regression independently. Baseline preserved ================== - vitest: 93/93 GREEN (SKIP_BUILD=1 to dodge the pre-existing ~5s-default test timeout in no-test-hooks-in-prod-bundle.test.ts; with a fresh dist/ in place all 9 hook-string sub-tests PASS). - tsc: clean (no diagnostics). - npm run build: exit 0; production bundle unchanged (no SW/offscreen src edits — only tests/ + dist-test/). - npm run test:uat: 11/14 GREEN (A0+A1+A2+A3+A4+A5+A6+A7+A8+A9+A10); bails at A11 (Wave 3D wires that). Files touched ============= - tests/uat/extension-page-harness.ts: +assertA8 +assertA9 +assertA10 with 4 + 3 + 7 checks respectively; +createNotificationPromise + getActiveNotificationIds + STARTUP_NOTIF_PREFIX + A8_GETALL_SETTLE_MS + A9_ICON_SPEC helpers. window.__mokoshHarness extends 7 → 10 methods. - tests/uat/lib/harness-page-driver.ts: replaces driveA8/driveA9/driveA10 NYI stubs with page.evaluate wrappers. - tests/uat/harness.test.ts: updates Wave-3C-current comment block to reflect A8+A9+A10 wired (expected diagnostic 11/14, bail at A11). Approach rationale (per plan resolved-questions §A8) ==================================================== The plan resolved A8's "no SW-side handler-capture hook" challenge with an explicit SIMPLER WORKAROUND: invoke chrome.notifications.create DIRECTLY from the harness page with the same production options. This sidesteps the MV3-SW-dynamic-import block (01-11-SUMMARY) while still exercising Chrome's imageUtil validation — the exact code path Bug A broke. Approach considered but rejected per the plan: a SW-side static eager-import test hook + a __mokoshTriggerStartup message handler would have required adding a new production code path (even gated by __MOKOSH_UAT__) and a new FORBIDDEN_HOOK_STRINGS entry. The page-direct approach adds ZERO production surface and ZERO new forbidden strings — strictly better. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-18 20:07:47 +02:00
Mark	6a77967b6c	feat(01-13): wave-3B — A5+A6+A7 GREEN + Bug B canonical regression rewind Wave 3B lands the A5 (SAVE_ARCHIVE → zip on disk) and A7 (genuine RECORDING_ERROR → ERR + recovery notification) assertions, completing 8/14 of the orchestrator's GREEN floor (A0+A1+A2+A3+A4+A5+A6+A7). Bails at A8 (Wave 3C scope). Changes per file: tests/uat/extension-page-harness.ts - assertA5: 11s settle (>= SEGMENT_DURATION_MS so first rotation lands a segment) + send SAVE_ARCHIVE + assert resp.success=true. Page-side only checks SW handler ack; host-side driver verifies disk-side outcome (zip presence + size floor). - assertA7: setupFreshRecording helper (A6 tears down; A7 needs REC state) → snapshot notif count → send RECORDING_ERROR with a non-Bug-B error code ('codec-unsupported') → 200ms settle → assert badge='ERR' + popup endsWith popup.html + notif delta=1 + set-membership for 'mokosh-recovery-' prefix. - setupFreshRecording: shared helper for A7 + future assertions that need a fresh REC state after a teardown. tests/uat/lib/harness-page-driver.ts - driveA5: page.evaluate(assertA5) THEN host-side fs polling for .zip in handles.downloadsDir. The CDP Browser.setDownloadBehavior override renames the file to download.zip (data: URL filename gap), so we accept any *.zip suffix. Merges page-side check + host-side checks into a single AssertionRecord. Signature now takes downloadsDir as a second arg. - driveA7: standard page.evaluate wrapper (no host-side work). tests/uat/harness.test.ts - Wraps driveA5 in a closure that captures handles.downloadsDir. - Reordered: launchHarnessBrowser MUST run before driver list so the closure can read handles without a TDZ trap. tests/uat/lib/launch.ts - Victim page switched from about:blank to a file:// URL backed by a tmp HTML file in downloadsDir. About:blank breaks A5 because chrome.tabs.captureVisibleTab needs <all_urls> permission which matches http/https/file/ftp but NOT about: or data: URLs. The stub HTML satisfies <all_urls> + provides a real .url for the production saveArchive's chrome.tabs.query. src/test-hooks/offscreen-hooks.ts (test-only — tree-shaken from prod) - installFakeDisplayMedia: mintStream() helper called per fakeGetDisplayMedia invocation; each call mints a FRESH MediaStream from the persistent canvas. Real getDisplayMedia returns a new stream per call — fake now matches. Required for A7's setupFreshRecording where the previous recording's stream tracks were stopped by A6's onUserStoppedSharing teardown. - Added 33ms setInterval-driven drawFrame() alongside the existing requestAnimationFrame loop. RAF can throttle in headless Chrome on offscreen documents (page-visibility heuristics produce 0 fps), which yields zero-byte MediaRecorder segments that crash ts-ebml's VINT decode in webm-remux.extractFramesFromSegment with "Unrepresentable length: Infinity". The setInterval is redundant when RAF fires at full rate; it's a safety net for the headless-MV3 corner. Bug B regression-catch demo (success_criteria #3 — MANDATORY per plan): Step 1 — apply local regression patch (NOT committed): src/background/index.ts:792 setIdleMode() → setErrorMode() Step 2 — npm run build:test && npm run test:uat RED snippet: A6 — BUG B canonical: user-stopped-sharing routes via setIdleMode: FAIL [PASS] SETUP: badge becomes REC after start [FAIL] A6.1: badge text is '' (NOT 'ERR') after user-stop expected: "" actual: "ERR" [FAIL] A6.2: popup is '' (NOT manifest default) after user-stop expected: "" actual: "chrome-extension://<id>/src/popup/index.html" [PASS] A6.3: NO recovery notification fired (count delta === 0) [PASS] A6.4: isRecording=false (via badge proxy) UAT harness: 6/14 assertions passed (bailed: A6 failed; see above) Step 3 — revert local patch (git checkout -- src/background/index.ts). Step 4 — npm run build:test && npm run test:uat GREEN snippet: A6 — BUG B canonical: user-stopped-sharing routes via setIdleMode: PASS [PASS] SETUP: badge becomes REC after start [PASS] A6.1: badge text is '' (NOT 'ERR') after user-stop [PASS] A6.2: popup is '' (NOT manifest default) after user-stop [PASS] A6.3: NO recovery notification fired (count delta === 0) [PASS] A6.4: isRecording=false (via badge proxy) UAT harness: 8/14 assertions passed (bailed: A8 failed — NOT YET IMPLEMENTED — Wave 3C wires driveA8) The harness CORRECTLY catches the Bug B regression — the canonical debug 01-09-recovery-flow scenario (operator-initiated stop routed through setErrorMode locks the operator out of restart because popup stays pinned to SAVE-only mode). Bug B is now CI-callable end-to-end. vitest 93/93 GREEN throughout (unit-test layer unaffected). Tier-1 grep gate GREEN (9 forbidden hook strings: 0 occurrences in dist/). npm run build exit 0; npx tsc --noEmit exit 0. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-18 17:01:06 +02:00
Mark	1b67b1c1d3	feat(01-13): wave-3A — A1+A2+A3+A4 GREEN + harness.test.ts orchestrator (5/14 assertions GREEN) Wave 3A landed. `npm run test:uat` now exercises 5/14 assertions end-to-end (A0 + A1 + A2 + A3 + A4); bails at A5 NOT YET IMPLEMENTED (Wave 3B scope). A6 still PASSES 5/5 through the standalone `npx tsx tests/uat/a6.test.ts` entry — the orchestrator-level A6 won't reach in Wave 3A because the sequential loop bails at A5; once Wave 3B wires driveA5 the loop will fall through to A6 (which uses the proven Wave-2 driveA6 driver — no rework needed there). Files changed: - `tests/uat/extension-page-harness.ts` — extends `window.__mokoshHarness` from `{ assertA6 }` to `{ assertA1, assertA2, assertA3, assertA4, assertA6 }`. Per-assertion contracts: • A1 — chrome.action.getBadgeText({}) === '' + getPopup({}) === '' + isRecording=false (badge !== 'REC' proxy per state-machine atomic pairing). 3 CheckRecords. • A2 — ensureOffscreen + START_RECORDING direct-to-offscreen (workaround for the `tabs` manifest permission gap per 01-11-SUMMARY + plan resolved-questions row 2) + manual setBadgeText('REC') + setPopup(POPUP_HTML_PATH) + waitFor badge==='REC'. The bypassed chrome.action.onClicked → startVideoCapture path is unit-tested in tests/background/badge-state-machine.test.ts; A2 verifies the contract that matters (recording reaches the REC state-machine row). 2 CheckRecords. • A3 — offscreen bridge query 'get-display-surface' (new in this plan via the prior commit's offscreen-hooks extension) → asserts === 'monitor'. 1 CheckRecord. • A4 — getPopup remains 'src/popup/index.html' + hasDocument()===true (no duplicate offscreen). Essentially a no-op verification — regression protection against future refactors that might unpin the popup during recording or spawn extra offscreens on stray events. 2 CheckRecords. • IMPORTANT: chrome.action.getPopup() returns the FULL absolute chrome-extension://<id>/... URL (not the manifest-relative path). A2.2 + A4.1 assert via .endsWith('src/popup/index.html') to stay extension-id independent. Empirical finding from first orchestrator run; documented inline. - `tests/uat/lib/harness-page-driver.ts` — wires `driveA1/A2/A3/A4` (replaces the 4 NOT YET IMPLEMENTED Wave-3A stubs from `eb64521`). Each wraps a single page.evaluate(() => window.__mokoshHarness.assertXX()) call per the contract laid down by driveA6. A5+A7..A13 remain stubbed for Waves 3B+3C+3D. - `tests/uat/harness.test.ts` (NEW) — top-level UAT orchestrator driving all 14 assertions sequentially against a single Chrome + single harness page. A0 (Tier-1 grep gate) runs pre-flight before any Chrome launch — mirrors tests/background/no-test-hooks-in-prod-bundle.test.ts forbidden- string inventory (9 entries; belt-and-suspenders per feedback-pre-checkpoint-bundle-gates.md memory). Bail-on-first- failure with [SKIP] markers for unreached assertions + structured diagnostic dump (full SW + offscreen console tail) on each failure. SKIP_PROD_REBUILD=1 escape hatch skips the A0-side `npm run build` for developer iteration. Verification (all GREEN): - npx tsc --noEmit: clean (root) - npx tsc --noEmit -p tests/uat: clean (UAT subtree) - npm run build: clean; production bundle hook-free (9-string grep gate in vitest unit gate) - npm run build:test: clean; dist-test/assets/extension_page_harness-*.js grew from 3.87kB → 7.67kB (A1+A2+A3+A4 added) - SKIP_BUILD=1 npx vitest run: 93/93 GREEN (Wave 0+1+2 baseline 92 + 1 from the 9th grep-gate string from the prior commit; this commit adds zero new vitest tests — the A1-A4 contracts are verified at UAT-harness time only) - npx tsx tests/uat/a6.test.ts (standalone): 5/5 GREEN; exit 0 (Wave-2 A6 baseline preserved through orchestrator-adjacent harness page surface extension) - npm run test:uat (full operator entry): 5/14 GREEN (A0 + A1 + A2 + A3 + A4); bails at A5 NOT YET IMPLEMENTED (Wave 3B scope, expected). Total wall clock ~25s (~5s build + ~5s prod-rebuild for A0 + ~15s assertion sequence). Operator empirical-verification deferred to orchestrator (per feedback-pre-checkpoint-bundle-gates.md — the orchestrator runs SW CSP-safety + Node-globals + DOM-globals grep on the built bundle before surfacing any checkpoint). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-18 15:45:25 +02:00
Mark	eb64521321	feat(01-13): wave-2 — launchHarnessBrowser + assertions + harness-page-driver scaffolding Build out the Approach-B harness driver utilities atop the Wave 1 production paths. Three new files form the shared scaffold that Wave 3's 13 assertion drivers (A1-A5, A7-A13) and the eventual orchestrator (`tests/uat/harness.test.ts`) will all consume. The standalone A6 driver (`tests/uat/a6.test.ts`) is rewritten to use the new lib — behavior-preserving: A6 still PASSES 5/5 in ~7s. New files: - tests/uat/lib/launch.ts (~320 LoC) `launchHarnessBrowser({ headless?, downloadsDir? }) → HarnessHandles` Extracts the Chrome-launch + victim-page + harness-page + console- attach pattern from a6.test.ts into a single reusable helper. NEW vs prototype: CDP `Browser.setDownloadBehavior` wires Chrome's download path to a per-run `mkdtempSync` tmp dir so A5 (SAVE_ARCHIVE) can poll a known location without colliding with the operator's real downloads. Architectural commitments enforced (per 01-11-SUMMARY): no `--auto-select-desktop-capture- source` flag; victim about:blank brought to front for the production `chrome.tabs.query({active:true})` workaround; SW console attach best-effort with bounded poll; offscreen console attach opportunistic via `targetcreated` listener (offscreen target appears later, when the harness page calls chrome.offscreen.createDocument). - tests/uat/lib/assertions.ts (~210 LoC) Host-side assertion primitives: * `AssertionRecord`, `CheckRecord`, `ConsoleBuffers` types — mirror the page-side shape returned by `assertA` methods. `runAssertion(name, fn, buffers)` — try/catch wrapper that dumps the SW + offscreen console tails (last 100 lines each) to stderr on failure, then returns `{passed: false, error}` if `fn` throws. * `printAssertionResult(result)` — single source of truth for the formatted result print. Extracted from the inline `printResult` previously in the prototype's a6.test.ts so Wave 3's orchestrator can reuse it across all 14 assertions. * `assertEqual / assertGte / assertMatch / assertTrue` — structured failure messages atop node:assert/strict. * `waitFor(probe, predicate, timeoutMs, description)` — host- side polling primitive; mirrors the page-side waitFor semantics verbatim (they can't share a module: page-side is bundled into the harness HTML, host-side runs in Node). NO chrome.* helpers here — all chrome.* work happens inside the extension-internal harness page. This module is host-side ONLY by construction (no chrome global in Node anyway). - tests/uat/lib/harness-page-driver.ts (~170 LoC) One driver wrapper per assertion (A1..A13). Each wraps a single `page.evaluate(() => window.__mokoshHarness.assertXX())`. Centralizing this means adding/renaming an assertion = two-file edit (extension-page-harness.ts impl + this file) instead of touching every test-file caller. Wave 2 wires `driveA6` (proven from `c647f61`). The 12 Wave-3 drivers (driveA1..A5, A7..A13) are stubbed as `throw new Error('NOT YET IMPLEMENTED — Wave 3<X> wires driveXX')` so the future orchestrator's `for (const drive of drivers)` loop fails cleanly on the first unimplemented one (bail-on-first- failure semantics). The `AssertionWithBytes` type is declared for A5/A12/A13 which return `bytesBase64` payloads (zip / webm bytes that the host side processes after the page-side assertion completes). Rewrite — `tests/uat/a6.test.ts`: - Drops ~80 LoC of Chrome-launch + console-attach + result-print plumbing now living in lib/launch.ts + lib/assertions.ts. - Now ~70 LoC total — pure orchestration of launchHarnessBrowser → runAssertion(driveA6) → printAssertionResult → browser.close() → exit code. - Behavior-preserving: A6 still 5/5 GREEN with the same diagnostic output (SETUP, A6.1-A6.4) and the same ~7s end-to-end runtime. Verification (all GREEN): - `npx tsc --noEmit` — exit 0 (root + tests/uat/tsconfig.json). - `npx tsx tests/uat/a6.test.ts` — exits 0 with "PASS"; 5 checks GREEN (SETUP, A6.1, A6.2, A6.3, A6.4). End-to-end runtime ~7s headless on this workstation. - `npm run build` — exit 0; Tier-1 grep gate GREEN (production bundle contains zero hook strings AND zero lib symbol names — the new lib files are test-only and not bundled into dist/). - `npm run build:test` — exit 0; dist-test/ still emits the extension-page-harness.html harness (lib files are host-side, not rollup inputs). - `npx vitest run` — 92/92 GREEN. Wave 3 ready: harness-page-driver.ts has driveA1..A5/A7..A13 stubs in place; extending requires only: 1. Add `assertAXX` method to window.__mokoshHarness in tests/uat/extension-page-harness.ts. 2. Replace the corresponding stub body in this file with the page.evaluate wrapper. 3. (Wave 3A) Create tests/uat/harness.test.ts orchestrator that iterates over [A0 grep gate, driveA1..A13] with bail-on-fail. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-18 15:21:11 +02:00
Mark	eb2258a880	feat(01-13): wave-1 — promote `c647f61` prototype to production paths; A6 GREEN Move the three load-bearing prototype files from `tests/uat/prototype/` to their production paths under `tests/uat/`, leaving the architectural narrative (research findings, BLOCKER citations, falsification table references) intact. No behavioral changes — A6 still PASSES 5/5 in ~7s end-to-end from the new paths. File moves (git mv preserves history): - tests/uat/prototype/extension-page-harness.html → tests/uat/extension-page-harness.html - tests/uat/prototype/extension-page-harness.ts → tests/uat/extension-page-harness.ts - tests/uat/prototype/a6.test.ts → tests/uat/a6.test.ts The `tests/uat/prototype/` directory is now empty (git does not track empty directories; will not appear in subsequent `git status`). Path-reference updates inside the moved files: - tests/uat/extension-page-harness.html: `<p>` line referencing the chrome-extension:// URL updated to drop `/prototype/`. - tests/uat/extension-page-harness.ts: file-header docstring rewritten to cite Plan 01-13 / Approach B / inheritance from `c647f61`. The load-bearing architectural-finding comment block (MV3 SW dynamic- import block falsification, Approach-B chrome.* surface summary) is REWORDED but its semantic content + research citations are PRESERVED — every load-bearing fact survives the rename. - tests/uat/a6.test.ts: * File-header rewritten to position the file as Plan 01-13's standalone single-assertion entry point (preserves the future- proof rationale: this entry stays around forever for fast TDD iteration on A6 even after Wave 3 folds A6 into the orchestrator harness.test.ts). * REPO_ROOT resolvePath chain corrected from `..,..,..` to `..,..` — the file is now two directory levels above the repo root instead of three. Without this fix DIST_TEST_DIR would resolve to a path one level above the actual repo root and assertBundlePresent would throw. VERIFIED by running the driver: build path resolves correctly. * harnessUrl constant updated to drop `/prototype/` from the chrome-extension://<id>/tests/uat/extension-page-harness.html URL — must match the rollup emission path in dist-test/. * Stdout labels updated: 'PROTOTYPE A6 result' → 'A6 result', 'Plan 01-11 PROTOTYPE — A6 ... feasibility test' → 'Plan 01-13 — A6 (Bug B canonical) standalone driver'. Inside the docstrings the historical 'originally landed as 01-11 prototype' provenance is preserved per the plan's contract. vite.test.config.ts: - `rollupOptions.input` renamed `prototype_harness` → `extension_page_harness` pointing at the new production path. crxjs emits the harness HTML to `dist-test/tests/uat/extension-page-harness.html` (verified by `ls dist-test/tests/uat/`). - The `modulePreload: { polyfill: false }` line is PRESERVED — this is the CRITICAL SW FIX per 01-11-SUMMARY (disabling the polyfill is what makes the test bundle's offscreen-side dynamic import work without crashing in non-DOM contexts that incorrectly try to call document.querySelector). - File-header comment §4 and the inline `define.__MOKOSH_UAT__` comment are PRESERVED — load-bearing rationale for the dedicated build-time token (vs `import.meta.env.MODE === 'test'` which collides with vitest). Verification (all GREEN): - `npm run build:test` — exit 0; dist-test/ emits `tests/uat/extension-page-harness.html` and `assets/extension_page_harness-*.js`. - `npx tsx tests/uat/a6.test.ts` — exits 0 with "A6 result: PASS"; 5/5 checks GREEN (SETUP: badge becomes REC; A6.1 badge==''; A6.2 popup==''; A6.3 notif delta==0; A6.4 isRecording=false). End-to-end runtime ~7s headless on this workstation. - `npx tsc --noEmit` — exit 0 (root tsconfig + tests/uat/tsconfig.json). - `npx vitest run` — 92/92 GREEN; the moves do not touch any vitest- discovered files. - `npm run build` — exit 0; Tier-1 grep gate stays GREEN (the moves do not touch production code). Wave 2 (next): build out `tests/uat/lib/{launch,assertions,harness-page- driver}.ts` around the extension-page architecture; rewrite `tests/uat/a6.test.ts` to use the shared lib (still PASSES 5/5). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-18 15:01:58 +02:00
Mark	a63066a289	chore(01-13): wave-0 — clean broken Approach-A artifacts per 01-11-SUMMARY Restore a clean baseline before promoting the `c647f61` prototype to production paths (Wave 1) and building out Approach-B driver scaffolding (Wave 2). All deletions trace back to falsifications documented in 01-11-SUMMARY.md. Deleted — broken Approach-A files: - src/test-hooks/sw-hooks.ts MV3 SW blocks dynamic import (Chromium es_modules.md; w3c/webextensions#212). The gated `await import('../test-hooks/ sw-hooks')` from 01-11 Wave 1 never resolved → SW silently died → production listeners never registered. File was dead-on-arrival; no fix possible while MV3 SWs disallow dynamic import. Approach-B replaces SW-side instrumentation with the extension-internal harness page's chrome.action.* + chrome.notifications.* surface (full privilege; no monkey-patching needed). - tests/uat/lib/{launch,extension,sw,offscreen,assertions}.ts Popup-bridge architecture (01-11 `dbd977c`) — falsification 2 + falsification 3 in 01-11-SUMMARY: `sw.evaluate` exposes only chrome.{loadTimes,csi}, NOT chrome.action.* / chrome.notifications.* / chrome.runtime.sendMessage; setPopup-juggling for extension-id resolution turned out to be unnecessary (browser.extensions() works directly per the prototype). These files will be reborn in Wave 2 around the extension-page architecture. Kept: tests/uat/lib/zip.ts (host-side JSZip work — architecture- agnostic; A12+A13 still use it) and tests/uat/lib/test-hook- contract.d.ts (type mirror — extended in Wave 3 but kept as-is here). - tests/uat/prototype/probe_{offscreen,sw,tabs,tabs2}.mjs Feasibility-research probes (01-11 spike) that empirically falsified the Approach-A hypotheses. The findings are encoded in 01-11- SUMMARY.md; the probes themselves are dead code. - tests/uat/harness.test.ts 01-11 Wave 2 popup-bridge orchestrator (`dbd977c`). Imports the now-deleted tests/uat/lib/{assertions,extension,sw,offscreen,launch} modules — would not typecheck after this commit. Reborn in Wave 3A as the Approach-B orchestrator (extension-internal page driver + A0 grep gate + 13 assertion drivers). Reverted — SW-side dynamic-import gate comment block: - src/background/index.ts lines 13-29 The existing comment block (post-spike) described the SW-side gated dynamic import that never landed. Rewritten to cite 01-13 Approach-B explicitly, link to 01-11-SUMMARY.md falsification, and clarify that the Tier-1 grep gate's enduring value is catching regressions in the offscreen chunk's __MOKOSH_UAT__ gate (the SW chunk is hook-free by construction). Updated — Tier-1 grep gate FORBIDDEN_HOOK_STRINGS inventory: - tests/background/no-test-hooks-in-prod-bundle.test.ts Removed: `simulateUserStop` (Approach-A naming; replaced by Approach-B `dispatchEndedOnTrack` which matches the W3C dispatchEvent semantics per RESEARCH §7 BLOCKER — track.stop() does NOT fire 'ended' per spec, so the simulation MUST use dispatchEvent). Added: `installFakeDisplayMedia`, `uninstallFakeDisplayMedia`, `dispatchEndedOnTrack`, `__mokoshOffscreenQuery`. Total inventory: 8 surface strings (was 5). Each MUST be absent from every file under dist/ post-build. Verification (all GREEN): - `npm run build` — exit 0; dist/ populated. - `grep -rln <forbidden> dist/` — 0 matches. - `npm run build:test` — exit 0; dist-test/ populated; offscreen-hooks chunk contains `installFakeDisplayMedia` (gate runs correctly against the test build's distinct artifact). - `npx tsc --noEmit` — exit 0 (root + tests/uat/tsconfig.json). - `npx vitest run` — 92/92 tests passing (was 89; the +3 new tests come from the FORBIDDEN_HOOK_STRINGS list expanding 5 → 8 — each forbidden string is one parametric `it(...)` block). Both prior-failing tests now GREEN: - tests/background/sw-bundle-import.test.ts (was missing dist/ → 92/92 requires the test run to have a current dist/; vitest gate test rebuilds via execFile when SKIP_BUILD≠1, otherwise relies on prior `npm run build`). - tests/background/no-test-hooks-in-prod-bundle.test.ts (was failing on stale dist; now GREEN against the freshly-rebuilt clean bundle). Wave 1 (next): promote tests/uat/prototype/{extension-page-harness.html, extension-page-harness.ts,a6.test.ts} to tests/uat/ via `git mv`; update vite.test.config.ts rollup input. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-18 14:54:41 +02:00
Mark	c647f61553	wip(01-11): prototype — A6 via test-page+bridge+synthetic-stream PASSES Plan 01-11 orchestrator commissioned a research+prototype investigation into whether full MV3 UAT automation is feasible with the architecture: extension-internal test page + chrome.runtime.sendMessage bridge + synthetic MediaStream (canvas-captureStream + getSettings override). EMPIRICAL VERDICT: feasible BUT plan 01-11 needs architectural revision. Architectural findings (with proof): 1. DYNAMIC IMPORT BLOCKED IN MV3 SW. Top-of-module `await import('../test-hooks/sw-hooks')` in src/background/index.ts silently kills the SW (chunk loads, await never resolves, no production listeners register, no console output). This is by design per Chromium docs (es_modules.md) + w3c/webextensions#212. The Plan 01-11 RESEARCH §6 architecture was wrong for the SW side. Workaround in this prototype: REMOVE the SW-side gated dynamic import. SW-side test hooks need a different design (see verdict). 2. OFFSCREEN-SIDE DYNAMIC IMPORT WORKS. Offscreen is a DOM document, not a SW, so top-level await + dynamic import behave normally. The offscreen-hooks.ts gated import succeeds; installFakeDisplayMedia is installed eagerly at module load. 3. EXTENSION-INTERNAL PAGE HAS FULL chrome.* SURFACE. Reachable via chrome-extension://<id>/tests/uat/prototype/extension-page-harness.html (added as rollup input in vite.test.config.ts). The page can call chrome.action.getBadgeText, chrome.action.getPopup, chrome.offscreen .createDocument, chrome.notifications.getAll, chrome.runtime .sendMessage — everything needed for A6. 4. NO 'tabs' PERMISSION → tab.url IS UNDEFINED. Production startVideoCapture's `chrome.tabs.query({active:true})` check (`if (!tab.id \|\| !tab.url) throw`) fails because the manifest lacks the 'tabs' permission. Prototype workaround: bypass startVideoCapture by sending START_RECORDING directly to offscreen. The Bug B contract being tested is independent of how recording starts; it only depends on the RECORDING_ERROR routing path. 5. SYNTHETIC MEDIASTREAM WORKS. installFakeDisplayMedia builds a canvas-captureStream MediaStream + monkey-patches the video track's getSettings() to report displaySurface: 'monitor'. Production code's post-grant validation passes. getDisplayMedia returns the synthetic stream immediately — no picker, no headless flakiness. A6 prototype result (with Bug B fix in place — current HEAD state): [PASS] SETUP: badge becomes REC after start [PASS] A6.1: badge text is '' (NOT 'ERR') after user-stop [PASS] A6.2: popup is '' (NOT manifest default) after user-stop [PASS] A6.3: NO recovery notification fired (count delta === 0) [PASS] A6.4: isRecording=false (via badge proxy) A6 prototype result (with Bug B fix rewound to `if (false)`): [PASS] SETUP: badge becomes REC after start [FAIL] A6.1: badge text is '' (got "ERR") [FAIL] A6.2: popup is '' (got chrome-extension://.../popup/index.html) [FAIL] A6.3: notif delta = 0 (got 1) [PASS] A6.4: isRecording=false ← false-positive (badge='ERR' not 'REC') The Bug B regression rewind cycle proves the harness CAN catch regression: 4/5 checks turn RED on rewind, 5/5 turn GREEN with the fix restored. Files in this commit: - tests/uat/prototype/extension-page-harness.{html,ts} — the harness page (chrome-extension URL, exposes window.__mokoshHarness.assertA6) - tests/uat/prototype/a6.test.ts — Puppeteer driver (~270 lines) - tests/uat/prototype/probe_*.mjs — diagnostic probes used to isolate the SW dynamic-import blocker (probe_sw.mjs is the key one) - src/test-hooks/offscreen-hooks.ts — added installFakeDisplayMedia + dispatchEndedOnTrack + __mokoshOffscreenQuery bridge handler + auto- install at module load - vite.test.config.ts — added prototype harness page as rollup input; added modulePreload.polyfill=false (red herring; harmless) - src/background/index.ts — removed the broken SW-side gated dynamic import (this is the BLOCKER unblocker — production 01-11 plan needs to redesign SW-side test hooks before re-spawning) Bundle hygiene: prototype runs against dist-test/; production dist/ remains hook-free (Tier-1 grep gate still GREEN, verified via no-test-hooks-in-prod-bundle.test.ts in the unit test suite). Vitest baseline: 89/89 GREEN preserved. Runtime: ~7 seconds end-to-end (launch Chrome + open page + ensure offscreen + start recording + dispatch ended + settle + assert). See: research return for VERDICT + recommended next step. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-18 12:06:08 +02:00
Mark	f44ca3afba	wip(01-11): wave-3 partial — A1+A4 attempted, popup-bridge SW state query unreliable Task 4 of Plan 01-11 attempted A1-A4 wiring. Empirical run reveals an architectural blocker that needs orchestrator-level decision. Current state after this commit (SKIP_PROD_REBUILD=1 npx tsx tests/uat/harness.test.ts): - A0 [PASS]: production bundle hook-leak grep gate (17ms) - A1 [FAIL]: SW bootstrap → setIdleMode — popup state never transitions to '' despite keepalive ping + 3s waitFor. chrome.action.getPopup({}) from the popup page consistently returns the manifest default (chrome-extension://<id>/src/popup/index.html), not the '' that setIdleMode's chrome.action.setPopup({popup:''}) should produce. - A2 [FAIL]: toolbar onClicked — badge never transitions to "REC" after page.triggerExtensionAction(extension); 8s timeout. Either the toolbar action isn't reaching the SW listener, OR getDisplayMedia's picker isn't resolving in headless mode (despite the auto-select flag). - A3 [FAIL]: offscreen target never appears (correlates with A2 — no recording started, no offscreen document spawned). - A4 [PASS]: trivially passes (offscreen count is 0 → 0, both before + after the click). Not a true assertion of behavior; would also pass if the whole extension were broken. - A5-A13: stubbed RED per plan. Architectural blocker (Rule 4 — needs orchestrator decision): - Puppeteer 25.0.2 + Chrome 148 + headless cannot reliably keep the MV3 SW alive long enough OR expose its real chrome.* state to a popup page query. The popup-bridge architecture (Task 3 commit `dbd977c`) works for synchronous bridge queries (snapshot, fire-on-startup) but does NOT reliably reflect chrome.action.setPopup / setBadgeText state changes initiated by the SW. Three plausible paths forward (need orchestrator pick): Option A — Content-script bridge: inject a content script that bridges chrome.* queries to a webpage's window.* RPC surface; harness uses page.evaluate against the content script instead of popup.evaluate. Pros: content scripts have stable lifetime tied to the page they're injected into. Cons: content scripts have DIFFERENT chrome.* surface (no chrome.action API surface — they can't read getBadgeText / getPopup at all). Likely DOESN'T solve the underlying problem. Option B — Headful with Xvfb on CI: relax the headless requirement; accept Xvfb dependency. Per Plan 01-11 RESEARCH §3, RESEARCH claimed headless works on Chrome 148 — empirical refutation here. Pros: SW lifetime is more stable in headful mode; setPopup propagation is reliable. Cons: introduces Xvfb dep that RESEARCH explicitly said wasn't needed; CI complication. Option C — Shrink harness scope to bridge-able assertions: A0 (grep gate), A8 (Bug A onStartup via bridge), A9 (icon sizes via popup fetch), A10 (manifest via popup), A13 (zip shape — operator runs SAVE_ARCHIVE manually + drops zip to a known path; harness reads it). Skip A1-A7, A11, A12 (the ones that require live SW state observation through chrome.action API). Pros: ships the bug-A-coverage portion of the harness today; keeps Plan 01-09's Task 5 operator-checkpoint partly automated. Cons: doesn't retire operator entirely; Plan 01-09 stays open on operator-empirical A1-A7. Option D — Switch to WebDriver BiDi (the Puppeteer 25 alternative backend): Puppeteer 25 supports BiDi via {protocol: 'webDriverBiDi'}. BiDi may handle extension SW evaluation differently (different isolation model). Speculative — no empirical evidence either way. What landed cleanly: - Tier-1 hook-leak grep gate (T-1-11-01) GREEN: dist/ has zero __mokoshTest / simulateUserStop / getSegmentCount / setCurrentStream / setSegmentCountGetter / __mokoshTestQuery / __mokoshKeepalive occurrences after npm run build. - Two-bundle infrastructure (dist/ vs dist-test/) operational. - Bridge handler in sw-hooks.ts works for snapshot + fire-on-startup + handler-types ops (verified by no-hang on keepalivePing call). - Existing 89-test vitest baseline preserved (no regression from any Wave 0/1/2/3 work). Verification: - npx tsc --noEmit (src/): exit 0 - npx tsc --noEmit -p tests/uat: exit 0 - npm run build: exit 0; dist/ hook-free - SKIP_BUILD=1 npx vitest run: 89/89 GREEN - SKIP_PROD_REBUILD=1 npx tsx tests/uat/harness.test.ts: 2/14 passed (A0 + A4-trivially), 12 FAIL — non-zero exit as expected. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-18 09:24:06 +02:00
Mark	dbd977c815	feat(01-11): wave-2 — Puppeteer harness scaffolding + A0 GREEN, popup-bridge architecture Task 3 of Plan 01-11 (Puppeteer UAT harness). Harness file tree (tests/uat/): - harness.test.ts: tsx-runnable top-to-bottom harness entry point. Runs A0 inline (filesystem grep gate, abort-on-fail T-1-11-01), then launches Chrome + opens popup bridge + queries manifest, then iterates A1-A13 stubs. Each stub throws "NOT YET IMPLEMENTED — Plan 01-11 Task N wires this assertion". Exit code = 0 on full pass, 1 otherwise. Final line: "UAT harness: N/14 assertions passed". - lib/launch.ts: launchHarnessBrowser() — wraps puppeteer.launch with enableExtensions:[dist-test/], headless default (HEADLESS=0 override), --no-sandbox + --auto-select-desktop-capture-source flags. Polls browser.extensions() until the extension registers (empirically ~100ms but the first call right after launch returns Map(0)). Opens both a blank page (for triggerExtensionAction) AND the popup page (the bridge surface). Returns { browser, extension, extensionId, sw, downloadsDir, page, popup }. - lib/extension.ts: waitForOffscreenTarget + attachToOffscreen + countOffscreenTargets. Offscreen attach uses target.type() === 'background_page' + .asPage() (NOT .page() — RESEARCH §4 Pitfall 1). - lib/sw.ts: chrome.* state queries via the POPUP page handle (NOT the WebWorker handle — see architecture note below). getBadgeText, getPopup, getManifest, getIconSize, getIsRecording (side-channeled through badge text), fireOnStartup (via __mokoshTestQuery bridge), sendSyntheticRecordingError, getNotificationSnapshot (via bridge), keepalivePing (no-op message to wake SW for ~30s). - lib/offscreen.ts: getDisplaySurface, simulateUserStop (the dispatchEvent('ended') path per RESEARCH §7 BLOCKER — DO NOT REFACTOR to track.stop()), getSegmentCount. - lib/assertions.ts: runAssertion(idx, name, buffers, fn) wrapper — records pass/fail/duration; on failure dumps last 30 lines of SW + offscreen console buffers to stderr before rethrowing. assertEqual / assertMatch / assertTrue / assertGte / waitFor polling helper. - lib/zip.ts: jszip-based assertArchiveShape + extractEntryToFile for assertions 12 + 13. - README.md: runtime + local-debug + CI semantics + locale gotcha + dev-dep size note + assertion catalog table. - tsconfig.json: per-tree type-check config (mirrors root tsconfig.json compiler options but includes the harness tree explicitly). Architecture refinement (DEVIATION from RESEARCH §1 — Rule 1+3 inline fix): - RESEARCH §1 sketched `sw.evaluate(() => chrome.action.getBadgeText({}))` as the chrome.* query path. Empirical probes during Task 3 execution against Puppeteer 25.0.2 + Chrome 148 + --headless=true revealed two blockers: 1. Puppeteer's WebWorker.evaluate runs in an ISOLATED WORLD that carries SW globals (clients, registration, ...) but NOT the extension's full chrome.* API surface. Object.keys(chrome) inside sw.evaluate returns ["loadTimes","csi"] — the public webpage chrome, not the extension chrome. 2. Chrome 148's headless mode aggressively suspends MV3 service workers; subsequent swTarget.worker() calls return "Protocol error: No target with given id found". - WORKAROUND: open the popup page (chrome-extension://<id>/src/popup/ index.html) as a separate Puppeteer Page. The popup has full chrome.* access (it's an extension context with same privileges as the SW) AND stable Puppeteer lifetime. For SW-globalThis state (__mokoshTest in the SW isolate, NOT in the popup), bridge via chrome.runtime.sendMessage. The popup sends { type: '__mokoshTestQuery', op: 'snapshot' \| 'fire-on-startup' \| 'handler-types' }; the SW hook's onMessage handler responds. - Bridge implementation added to src/test-hooks/sw-hooks.ts — registers AFTER the production listeners so it never intercepts production messages (__mokoshTest* type is unambiguously test-only). Tier-1 grep gate (no-test-hooks-in-prod-bundle.test.ts) continues to enforce ZERO __mokoshTest occurrences in dist/ — the bridge handler is tree-shaken alongside the rest of the hook module via the __MOKOSH_UAT__ gate. Other configuration changes: - vitest.config.ts: exclude tests/uat/** from vitest discovery. The Puppeteer harness is invoked via `npm run test:uat` (not vitest); running it under vitest would try to launch real Chrome inside a vitest worker. The .test.ts suffix is retained for editor + naming-convention consistency with the rest of the tree. Verification: - npx tsc --noEmit (src/): exit 0 - npx tsc --noEmit -p tests/uat: exit 0 - npm run build: exit 0 - grep -rln '__mokoshTest\|simulateUserStop\|getSegmentCount\|setCurrentStream\|setSegmentCountGetter\|__mokoshTestQuery\|__mokoshKeepalive' dist/: ZERO matches - npm run build:test: exit 0; dist-test/ populated with the new bridge code - SKIP_BUILD=1 npx vitest run: 89/89 GREEN - SKIP_PROD_REBUILD=1 npx tsx tests/uat/harness.test.ts: → A0 [PASS]: production bundle has no test-hook leaks (19ms) → Browser launches; popup opens; manifest read succeeds → A1-A13 [FAIL]: NOT YET IMPLEMENTED — Plan 01-11 Task N wires this → "UAT harness: 1/14 assertions passed, 13 failed (first failure: A1)" → Exit code: 1 (expected — 13 RED stubs intentional) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-18 09:14:58 +02:00
Mark	cb1a729962	feat(01-11): wave-1 — gated test hooks for SW + offscreen, dist/ stays hook-free Task 2 of Plan 01-11 (Puppeteer UAT harness). Test hook surface: - src/test-hooks/types.ts: canonical MokoshTestSurface — handlers (onClicked, onStartup, notificationOnClicked), notificationCount, lastNotificationOptions<true>, notificationIds, getCurrentStream, getSegmentCount. globalThis.__mokoshTest ambient declaration. - src/test-hooks/sw-hooks.ts: SW-side hook. Monkey-patches addListener on chrome.action.onClicked / chrome.runtime.onStartup / chrome .notifications.onClicked to capture handler refs while chaining to the original. Wraps chrome.notifications.create across all four overload shapes (id+options+cb, options+cb, id+options→Promise, options→Promise) to increment notificationCount, save lastNotificationOptions, push resolved id into notificationIds. - src/test-hooks/offscreen-hooks.ts: offscreen-side hook. Exports setCurrentStream + setSegmentCountGetter; the recorder calls both inside startRecording after the mediaStream + segments assignments. getCurrentStream getter closes over the cell so the harness reads the live MediaStream for displaySurface inspection + 'ended' dispatch (Bug B BLOCKER per RESEARCH §7). - tests/uat/lib/test-hook-contract.d.ts: manual harness-side mirror of MokoshTestSurface (decoupled from src/ to keep tests/ import-clean per RESEARCH §11 resolution 5; drift risk documented inline). Production-side wires (gated by __MOKOSH_UAT__ token): - src/background/index.ts top-of-module: `if (__MOKOSH_UAT__) { await import('../test-hooks/sw-hooks'); }`. MUST run before any chrome.* addListener call below — top-of-module placement satisfies this. - src/offscreen/recorder.ts top-of-module: symmetric gated dynamic import + module-scoped testHooks reference. - src/offscreen/recorder.ts inside startRecording (after mediaStream assignment): `if (__MOKOSH_UAT__) { testHooks?.setCurrentStream(stream); testHooks?.setSegmentCountGetter(() => segments.length); }` - src/offscreen/recorder.ts inside onUserStoppedSharing (after mediaStream = null): `if (__MOKOSH_UAT__) { testHooks?.setCurrentStream(null); }` — T-1-11-05 (Repudiation: stale stream ref) mitigation. Build-time token wiring: - vite.config.ts: declares `define: { __MOKOSH_UAT__: 'false' }` (prod default) + bumps `build.target: 'es2022'` so the top-level await in the gated dynamic imports compiles (MDN: Chrome 89 / Edge 89 / Firefox 89 / Safari 15 support TLA; MV3 floor Chrome 88 is effectively Chrome 89+ in field — comfortably inside the envelope). - vite.test.config.ts: overrides `define: { __MOKOSH_UAT__: 'true' }` so the test bundle has the hooks active. - vitest.config.ts: declares `define: { __MOKOSH_UAT__: 'false' }` for vitest's own source-loading runs. CRITICAL — without this, vitest would throw `ReferenceError: __MOKOSH_UAT__ is not defined` when loading src/background/index.ts; OR if we'd used `import.meta.env.MODE === 'test'` (RESEARCH §6's initial guidance), vitest's default MODE='test' would have ACTIVATED the hooks under unit tests + clobbered every existing vi.fn() chrome.notifications.create mock. The dedicated `__MOKOSH_UAT__` token sidesteps both failure modes cleanly — a refinement on RESEARCH §6 documented in the comment preambles of all three configs. - globals.d.ts: declares `__MOKOSH_UAT__: boolean` ambient so `npx tsc --noEmit` passes without per-file annotations. - tsconfig.json: include adds `globals.d.ts`. Notification options generic refinement: - chrome.notifications.NotificationOptions is declared with a `<true \| false>` generic distinguishing "create" (all required — true) from "update" (all optional — false). Plan 01-11's production code always uses the create shape; types.ts + sw-hooks.ts pin to `NotificationOptions<true>` so the harness reads iconUrl etc. as definitely-present. Verification: - npx tsc --noEmit: exit 0 - npm run build: exit 0 - grep -rln '__mokoshTest\\|simulateUserStop\\|getSegmentCount\\|setCurrentStream\\|setSegmentCountGetter' dist/: ZERO matches (Tier-1 gate stays GREEN) - npm run build:test: exit 0; dist-test/ emits separate sw-hooks-.js + offscreen-hooks-.js chunks (the gated dynamic imports survive tree-shaking when __MOKOSH_UAT__ === true) - grep -rln '__mokoshTest' dist-test/: 2 matches (assets/sw-hooks-.js + assets/offscreen-hooks-.js) - SKIP_BUILD=1 npx vitest run: 89/89 GREEN (83 baseline + 6 Tier-1 hook-leak surfaces) - sw-bundle-import.test.ts: GREEN (the gated dynamic import does not break production module init — the `if (false)` branch is never reachable so the await + import are dead code in dist/) In-flight bugs auto-fixed (Rule 1 + Rule 3): - Rule 3: original RESEARCH §6 plan called for `import.meta.env.MODE === 'test'` as the gate; switched to `__MOKOSH_UAT__` define-token after observing vitest contamination (vitest defaults MODE='test' → hooks activated under unit tests → 8 existing tests broke with "Cannot read properties of undefined (reading 'calls')" because the hook wrapper replaced vi.fn() mocks). Documented in the comment preambles of all three configs as a refinement on RESEARCH §6. - Rule 3: esbuild rejected TLA against the default ES2020 target; bumped to es2022 (Chrome 89+ supports TLA per MDN — inside MV3 envelope). Recorded in vite.config.ts preamble. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-17 22:46:26 +02:00

16 Commits