mokosh

Author	SHA1	Message	Date
Mark	a63066a289	chore(01-13): wave-0 — clean broken Approach-A artifacts per 01-11-SUMMARY Restore a clean baseline before promoting the `c647f61` prototype to production paths (Wave 1) and building out Approach-B driver scaffolding (Wave 2). All deletions trace back to falsifications documented in 01-11-SUMMARY.md. Deleted — broken Approach-A files: - src/test-hooks/sw-hooks.ts MV3 SW blocks dynamic import (Chromium es_modules.md; w3c/webextensions#212). The gated `await import('../test-hooks/ sw-hooks')` from 01-11 Wave 1 never resolved → SW silently died → production listeners never registered. File was dead-on-arrival; no fix possible while MV3 SWs disallow dynamic import. Approach-B replaces SW-side instrumentation with the extension-internal harness page's chrome.action.* + chrome.notifications.* surface (full privilege; no monkey-patching needed). - tests/uat/lib/{launch,extension,sw,offscreen,assertions}.ts Popup-bridge architecture (01-11 `dbd977c`) — falsification 2 + falsification 3 in 01-11-SUMMARY: `sw.evaluate` exposes only chrome.{loadTimes,csi}, NOT chrome.action.* / chrome.notifications.* / chrome.runtime.sendMessage; setPopup-juggling for extension-id resolution turned out to be unnecessary (browser.extensions() works directly per the prototype). These files will be reborn in Wave 2 around the extension-page architecture. Kept: tests/uat/lib/zip.ts (host-side JSZip work — architecture- agnostic; A12+A13 still use it) and tests/uat/lib/test-hook- contract.d.ts (type mirror — extended in Wave 3 but kept as-is here). - tests/uat/prototype/probe_{offscreen,sw,tabs,tabs2}.mjs Feasibility-research probes (01-11 spike) that empirically falsified the Approach-A hypotheses. The findings are encoded in 01-11- SUMMARY.md; the probes themselves are dead code. - tests/uat/harness.test.ts 01-11 Wave 2 popup-bridge orchestrator (`dbd977c`). Imports the now-deleted tests/uat/lib/{assertions,extension,sw,offscreen,launch} modules — would not typecheck after this commit. Reborn in Wave 3A as the Approach-B orchestrator (extension-internal page driver + A0 grep gate + 13 assertion drivers). Reverted — SW-side dynamic-import gate comment block: - src/background/index.ts lines 13-29 The existing comment block (post-spike) described the SW-side gated dynamic import that never landed. Rewritten to cite 01-13 Approach-B explicitly, link to 01-11-SUMMARY.md falsification, and clarify that the Tier-1 grep gate's enduring value is catching regressions in the offscreen chunk's __MOKOSH_UAT__ gate (the SW chunk is hook-free by construction). Updated — Tier-1 grep gate FORBIDDEN_HOOK_STRINGS inventory: - tests/background/no-test-hooks-in-prod-bundle.test.ts Removed: `simulateUserStop` (Approach-A naming; replaced by Approach-B `dispatchEndedOnTrack` which matches the W3C dispatchEvent semantics per RESEARCH §7 BLOCKER — track.stop() does NOT fire 'ended' per spec, so the simulation MUST use dispatchEvent). Added: `installFakeDisplayMedia`, `uninstallFakeDisplayMedia`, `dispatchEndedOnTrack`, `__mokoshOffscreenQuery`. Total inventory: 8 surface strings (was 5). Each MUST be absent from every file under dist/ post-build. Verification (all GREEN): - `npm run build` — exit 0; dist/ populated. - `grep -rln <forbidden> dist/` — 0 matches. - `npm run build:test` — exit 0; dist-test/ populated; offscreen-hooks chunk contains `installFakeDisplayMedia` (gate runs correctly against the test build's distinct artifact). - `npx tsc --noEmit` — exit 0 (root + tests/uat/tsconfig.json). - `npx vitest run` — 92/92 tests passing (was 89; the +3 new tests come from the FORBIDDEN_HOOK_STRINGS list expanding 5 → 8 — each forbidden string is one parametric `it(...)` block). Both prior-failing tests now GREEN: - tests/background/sw-bundle-import.test.ts (was missing dist/ → 92/92 requires the test run to have a current dist/; vitest gate test rebuilds via execFile when SKIP_BUILD≠1, otherwise relies on prior `npm run build`). - tests/background/no-test-hooks-in-prod-bundle.test.ts (was failing on stale dist; now GREEN against the freshly-rebuilt clean bundle). Wave 1 (next): promote tests/uat/prototype/{extension-page-harness.html, extension-page-harness.ts,a6.test.ts} to tests/uat/ via `git mv`; update vite.test.config.ts rollup input. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-18 14:54:41 +02:00
Mark	96fa8e8e11	chore(01-11): wave-0 — install puppeteer + tsx, add vite.test.config + Tier-1 hook-leak grep gate Task 1 of Plan 01-11 (Puppeteer UAT harness). - npm install --save-dev puppeteer@^25.0.2 tsx@^4 @types/node resolved: puppeteer@25.x, tsx@4.22.1, @types/node@25.8.0 pulls ~150MB Chromium binary at install time (T-1-11-03 — accepted, package-lock pins resolved hashes via @puppeteer/browsers). - package.json scripts: add build:test + test:uat (per RESEARCH §10 two-bundle orchestration); existing dev/build/preview/test untouched. - vite.test.config.ts: extends ./vite.config.ts via mergeConfig with mode:'test' + build.outDir:'dist-test' + emptyOutDir:true. Verified npm run build:test produces dist-test/ in 7.93s; npm run build keeps producing dist/ in 7.67s (no clobber). - tsconfig.json `include: ["src"]` already covers src/test-hooks/*/ via wildcard — no edit needed. - tests/background/no-test-hooks-in-prod-bundle.test.ts: Tier-1 gate mirroring sw-bundle-import.test.ts's execFile pattern. Greps the BUILT dist/ tree for 5 forbidden hook surfaces (one `it` per surface for granular failure isolation): __mokoshTest, simulateUserStop, getSegmentCount, setCurrentStream, setSegmentCountGetter. All 5 surfaces absent today (RED-then-GREEN polarity inverted — the gate is GREEN now and MUST stay GREEN after Task 2 lands the hooks). SKIP_BUILD=1 escape hatch for developer iteration. - .gitignore: add dist-test/ (no point versioning generated test bundle). Verification: - npx tsc --noEmit: exit 0 - npm run build: exit 0; dist/ populated (375.37 kB SW chunk) - npm run build:test: exit 0; dist-test/ populated (identical chunk sizes — the gated dynamic imports do not land until Task 2; this commit only proves the two-bundle plumbing) - SKIP_BUILD=1 npx vitest run tests/background/no-test-hooks-in-prod-bundle.test.ts: 6/6 GREEN (1 build-sanity + 5 forbidden-surface) - SKIP_BUILD=1 npx vitest run (full suite): 89/89 GREEN (83 baseline + 6 new Tier-1 surfaces = 89) Working-tree cleanup: a stale 5.4 MB tests/fixtures/last_30sec.webm (unrelated operator smoke regen present at session spawn) was stashed before running the baseline — it caused the webm-playback test to time out at 5s. After stashing back to HEAD's 1.9 MB fixture, baseline passes cleanly. Not committing the fixture restoration here (pre-existing working-tree state, not part of Task 1). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-17 21:42:46 +02:00
Mark	91b4475ea1	test(01-09): RED — Bug B route user-stopped-sharing → IDLE; other codes → ERROR Adds Tests E + F to tests/background/badge-state-machine.test.ts pinning the conditional-routing contract for RECORDING_ERROR onMessage: E (RED today): RECORDING_ERROR{error:'user-stopped-sharing'} must route through setIdleMode — badge OFF (text '', red #D32F2F), popup ''. The current handler routes ALL codes through setErrorMode, locking the operator out of restart (popup wins toolbar.onClicked forever). F (GREEN today, preserved after fix): RECORDING_ERROR with any other error code (representative: 'codec-unsupported') continues to route through setErrorMode — badge ERR + yellow #F9A825 + popup html. This is the defensive-fallback regression pin guarding against the patch over-rotating to IDLE for all codes. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-17 16:35:27 +02:00
Mark	2d7ff7d4e3	test(01-09): RED — toolbar-onClicked + badge state machine + onStartup notification + popup SAVE-only Plan 01-09 Task 3 RED — 13 tests across 3 new files: tests/background/toolbar-action.test.ts (5 tests): A: chrome.action.onClicked.addListener registered at SW init B: onClicked while not recording triggers startVideoCapture C: onClicked while isRecording does NOT double-start D: setPopup('') in OFF mode, popup html path in REC mode E: popup init does NOT send REQUEST_PERMISSIONS + saveButton enabled (W-02 fix — without jsdom, uses node-env document stub) tests/background/badge-state-machine.test.ts (4 tests): A: REC state = text 'REC' + #00C853 green + Recording title B: OFF state = text '' + #D32F2F red + Not recording title (fired at SW init via initialize → setIdleMode) C: ERROR state = text 'ERR' + #F9A825 yellow + error title D: RECORDING_ERROR onMessage triggers setBadgeText('ERR') within microtask tests/background/onstartup-notification.test.ts (4 tests): A: chrome.runtime.onStartup.addListener registered at SW load B: onStartup fires exactly one mokosh-startup- notification with basic type + 'Mokosh ready' title + Click-instructed message C: notifications.onClicked with mokosh- id clears + triggers START_RECORDING D: RECORDING_ERROR onMessage triggers mokosh-recovery- notification Task 4 will flip all 13 to GREEN by adding the listeners + state machine + helpers in src/background/index.ts, popup SAVE-only, manifest update. Deviation Rule 3: jsdom not in node_modules; refactored Test E to use a node-env document stub instead of @vitest-environment jsdom pragma.	2026-05-17 15:27:41 +02:00
Mark	e40949d1d2	test(01-08): regenerate last_30sec.webm fixture + split remux input/output fixtures Plan 01-08 Task 5 closeout. The post-B+ smoke run produced a working single-EBML WebM (28.76s, 676 frames, 1.89 MB, monotonic 0→28.76s timestamps). Operator-confirmed empirically (timer overlay in smoke HTML showed the latest frames matched expectations). Two-fixture split resolves a test-design conflict surfaced when last_30sec.webm flipped from pre-remux input shape to post-remux output shape: - tests/fixtures/last_30sec.webm — POST-REMUX output (single EBML, 41 ffmpeg dry-run lines). Validates webm-playback.test.ts' playable-duration + structural assertions. - tests/fixtures/raw-3ebml-concat.webm — PRE-REMUX input (3-EBML concat, 299 ffmpeg dry-run lines = 3 segment boundaries). Preserved from the original 2026-05-15 Phase 1 closure fixture. Used by webm-remux.test.ts to test that remuxSegments correctly transforms 3-EBML input → single-EBML output. tests/background/webm-remux.test.ts FIXTURE_PATH updated to point at raw-3ebml-concat.webm; the hardcoded EBML byte offsets [0, 509038, 970967] and frame bounds [905, 912] remain valid against that preserved input. Result: 64/64 vitest GREEN (was 61/64). tsc clean. Build exit 0. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-17 12:44:50 +02:00
Mark	761dfc0388	test(debug-01-08): extend Tier-1 gate to Layer 2 (exercises remuxSegments) The original Layer 1 gate (`74400ae`) verified module-init under SW-simulated globals. It did not exercise remuxSegments — the actual runtime code path the SW reaches on SAVE_ARCHIVE. Layer 2 imports webm-remux.ts as SOURCE in a spawned Node child under SW-simulated globals, invokes remuxSegments with a synthetic single-segment EBML payload, and classifies the outcome: - `ok` (returned a Blob) or `domain_error` (e.g. invalid EBML header — proves runtime path is structurally reachable) → PASS - `sw_incompat` (ReferenceError for Node globals, EvalError / unsafe-eval for CSP) → FAIL with the specific error surfaced This is the gate that empirically caught the ts-ebml Buffer issue addressed by the preceding polyfill commit; it closes the loop between "bundle loads" (Layer 1) and "bundle works at runtime" (Layer 2). Polyfill-aware design: Layer 2 leaves `Buffer` AVAILABLE in the child env (split strip list: SW_SOURCE_STRIP_GLOBALS omits 'Buffer'). The vite-plugin-node-polyfills rewrite is BUNDLER-LEVEL (Buffer → imported polyfill chunk) and does not apply when source is loaded outside Vite. Leaving Buffer available faithfully models what the polyfilled bundle provides at SW runtime, while keeping the classifier ready to flag Buffer regressions if the polyfill ever gets removed. `process`/`window`/`document` remain stripped (polyfill is configured globals.process: false; SW genuinely lacks DOM). Node 24 native TS transform (`--experimental-transform-types`) is used for source loading; a tiny inline resolution hook appends `.ts` to extensionless relative specifiers, mimicking vite/rollup's extension policy. Hook is base64-encoded as a data: URL so the test stays self-contained (no on-disk hook file). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-17 12:18:04 +02:00
Mark	74400ae6ac	test(debug-01-08): complete SW-bundle-import gate — mock chrome.* surface The Tier-1 SW-bundle-loadability gate (`c75854c`) stripped Buffer/process/window/document from the spawned Node isolate but did not mock chrome.. A correctly-bundled SW that reaches addListener calls at module init would (correctly) progress to chrome.runtime.onMessage.addListener(...) and throw ReferenceError because chrome was undefined — a false-positive RED. This commit adds a minimal Proxy-based chrome. stub that no-ops any chrome.<api>.<method>(...) chain. The gate now verifies what its file-header comment claims: "bundled artifact reaches module-init completion under SW-simulated globals." RED->GREEN: the gate now correctly passes against the post-fix bundle and would catch any future regression in SW bundle-loadability. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-17 11:16:05 +02:00
Mark	c75854cbef	test(debug-01-08): RED Tier-1 SW-bundle-loadability gate + corrected hypothesis Adds tests/background/sw-bundle-import.test.ts that loads the built SW chunk under SW-simulated globals (Buffer/process/window/document stripped) via a spawned Node child process. Pins the orchestrator-side gap that caused Plan 01-08's SW init crash: the prior deps test only checked SOURCE packages under default Node globals, never the bundled output, so Vite/Rollup's CJS-interop bug (tree-shaking the `ebml` package while leaving a dangling `{tools:f}=Pc` destructure against an empty Pc) went undetected until operator empirical smoke. RED against HEAD `aabbd0c` — failure surfaces the exact production error ("Cannot read properties of undefined (reading 'readVint')"), proving the test is a true regression gate, not a tautology. Also rewrites .planning/debug/01-08-sw-incompatibility.md to reflect the actual root cause (Vite/Rollup CJS interop) rather than the orchestrator's initial falsified hypothesis (new Function + Buffer globals — disproven by Node simulation showing the throw fires at module-init line 12:33809 before any CSP-eval or Buffer-ref code path executes). Full vitest: 60 passing + 3 RED (this gate + the 2 pre-existing Task 5 fixture-dependent duration tests). No regressions. Per feedback-pre-checkpoint-bundle-gates.md (auto-loaded memory): any future plan executor whose work surfaces a SW must run this test before any operator-empirical checkpoint. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-17 09:52:45 +02:00
Mark	407e683e9b	test(01-08): RED unit tests for remuxSegments — single-EBML + monotonic + frame-count + size + empty - 5 RED tests pinning the contract for src/background/webm-remux.ts (created in Task 3). All fail with "module missing" today — the Task 3 GREEN gate. - Test 1: exactly 1 EBML header + 1 Segment magic in output. - Test 2: output size within [0.7x, 1.3x] of input sum. - Test 3: ffprobe format=duration >= 25_000 ms (skip-if-no-ffprobe). - Test 4: ffprobe -count_frames in [905, 912] (per-seg sum 912 ± 3 boundary partial-frame drops, I-01 tightening). - Test 5: empty input -> empty Blob (defense-in-depth). - Fixture sliced at d13-confirmed byte offsets (0 / 509038 / 970967); verified against committed last_30sec.webm at Task 2 land time. - Baseline counts: 13 files / 62 tests / 7 failed (2 webm-playback + 5 new webm-remux) \| 55 passed. tsc exit 0.	2026-05-17 09:23:53 +02:00
Mark	503531485c	feat(01-08): install ts-ebml + webm-muxer; pin SW-compat via deps test - Add ts-ebml ^3.0.2 (parse half) and webm-muxer ^5.1.4 (write half) per CONTEXT.md amendment D-14-remux; both MIT, both verified SW-compatible in the d13 debug-session library survey. - tests/background/webm-remux-deps.test.ts pins two contracts: (a) named exports surface (Muxer + ArrayBufferTarget + Decoder). (b) both libraries import cleanly when window/document are absent on globalThis — guards the published dist against accidentally acquiring DOM globals on the hot path that would crash the Chrome service-worker runtime. - Note: webm-muxer 5.1.4 upstream-deprecated in favor of Mediabunny; the pinned version still meets the d13 architectural requirement (single-EBML output via addVideoChunkRaw). Migration to Mediabunny is out of scope for Plan 01-08 and would require a new ADR. - Baseline 53 GREEN + 2 new GREEN; tsc clean; 2 webm-playback duration RED still pending (drive to GREEN in Tasks 3-5).	2026-05-17 09:22:46 +02:00
Mark	246eadb2ef	test(option-c): continuous 600 s port lifecycle pinning contract Implements Option C step 3 per .planning/debug/empty-archive-port-race.md: "Continuous end-to-end vitest covering 600 s of port lifecycle (2 reconnects + simulated REQUEST_BUFFER round-trips). Becomes the new pinning contract for the port lifecycle." The UAT Test 3 BLOCKER surfaced because no test exercised the full operator timeline — 5+ minute recording with port-replacement windows crossing real SAVE_ARCHIVE round-trips. This file pins that contract end-to-end at the unit-test level. What's exercised: - Both SW (src/background/index.ts) and offscreen recorder (src/offscreen/recorder.ts) loaded into the SAME chrome stub, with paired port-pair factory (one connect() yields offPort + swPort that talk to each other through captured listeners). - 12 ping/pong cycles (~300 s simulated wall-clock). - 3 SAVE_ARCHIVE round-trips (one before reconnect, two after each of the two forced reconnects). - 2 EXTERNAL port disconnects (port._disconnected=true) — simulates the SW eviction / port glitch path that the H1.b test pins. - JSZip mocked at file scope (vi.mock) because Node 22+ JSZip can't read native Blobs — preserves integration shape (size accounting) without depending on JSZip's Node compatibility. Final assertions: 1. All 3 saveArchive calls return success:true. 2. EVERY BUFFER message that crossed the wire carried segments (no silent-loss path was reachable). 3. PONGs round-tripped (proves health-probe loop closes). Suite: 53 GREEN / 53 tests. tsc --noEmit exit 0; type-safety grep clean; npm run build exit 0. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-16 14:53:47 +02:00
Mark	4306d59dfd	test(option-c): RED gate for request-id'd port protocol + health probe + error surface Per .planning/debug/empty-archive-port-race.md "Fix Strategy: Option C (Architectural)", land RED tests that pin the 4 sub-behaviours the refactor must satisfy at the unit level. These complement the operator- facing contract already pinned by port-reconnect-race.test.ts (H1+H2). Offscreen side (tests/offscreen/port-health-probe.test.ts): A. Bootstrap installs no 290_000 ms pre-emptive reconnect timer (the timing-based race window from `b064a21` is gone). B. Missed PONGs (5 PINGs without echo) trigger a clean reconnect via the same path the onDisconnect handler uses. C. PONG echoes within timeout keep the port alive indefinitely (counter-test for over-eager probe — already GREEN today). D. REQUEST_BUFFER with requestId → BUFFER response echoes the same id (the architectural mechanism that retires cross-talk). SW side (tests/background/request-id-protocol.test.ts): 1. getVideoBufferFromOffscreen sends REQUEST_BUFFER with a generated uuid requestId on the live videoPort. 2. Stale BUFFER (mismatched requestId) is ignored — no resolution. 3. Port replacement mid-request → SW re-issues REQUEST_BUFFER on the new port with the SAME requestId. Retires the H2 silent-drop class. 4. Empty video segments → saveArchive returns {success:false, error} (operator-visible) instead of {success:true} with no-video archive. 5. SW echoes PONG on PING, closing the health-probe loop. Suite status: 10 files / 52 tests (42 GREEN, 10 RED). - 40 baseline + 2 new GREEN (port-health-probe C; request-id 2 & 4 accidentally pass due to test-stub side effects — they will continue to pass after fix for the right reasons). - 3 RED in port-reconnect-race + 3 RED in port-health-probe + 4 RED in request-id-protocol. Quality gates: tsc --noEmit exit 0; type-safety grep clean. No production code touched in this commit — fix lands in subsequent commits. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-16 14:27:17 +02:00

12 Commits