WI: Implement visual testing primitives (screenshot + assertion + input injection) #184
Closed
opened 2026-05-24 05:13:25 +00:00 by toasterson
·
0 comments
No Branch/Tag specified
main
claude/wi-019dd344-push-diff-re-dispatch-d1-019db1b9-c06f-a
claude/wi-019dda52-pre-dispatch-decision-completeness-check
claude/wi-019ddfe9-audit-cleanup-orphaned-wis-without-plans
claude/wi-019ded3d-backend-extend-listprojectsresponse-with
claude/wi-019dff17-anima-agent-invalidate-stale-wip-snapsho
claude/wi-019dff20-promote-loop-push-diff-race-gate-promote
claude/wi-019e93df-mcp-ambient-get-work-item-detail-full-wi
claude/wi-019e0d32-replace-dockerfile-cargo-build-with-sols
claude/wi-019dd4db-mcp-expose-teamservice-issuepat-as-an-ad
claude/wi-019dd4db-mcp-add-ambient-post-message-variant-wor
claude/wi-019ddfd9-mcp-tool-time-bounded-out-of-hours-sessi
claude/wi-019ddfed-bug-ai-sessions-accumulate-as-pending-in
claude/wi-019de021-bug-team-view-has-no-ui-to-promote-demot
claude/wi-019de052-bug-open-prs-placeholder-pr-s-change-row
claude/wi-019dff16-anima-agent-raise-agent-max-concurrent-t
claude/wi-019dff20-wi-lifecycle-validate-phase-strings-serv
claude/wi-019e0d25-dockerfile-add-cache-mounts-for-cargo-re
claude/wi-019e0d25-dockerfile-add-apt-cache-mount-to-builde
claude/wi-019e57a6-p0-5-render-diff-harness-extend-snapshot
claude/wi-019e8873-native-solstice-ci-dispatch-send-explici
claude/wi-019e8f7d-merge-engine-commits-unresolved-conflict
claude/wi-019e8f7e-needs-rebase-re-dispatch-is-dead-on-arri
claude/wi-019e8f7e-post-merge-gap-analysis-runs-with-empty
claude/wi-019e8f7e-anima-agent-restore-daily-spawn-cap-on-c
claude/wi-019e505a-akh-medu-assert-batch-is-non-functional
claude/wi-019e533a-research-deployments-system-architecture
claude/wi-019e57a6-p2-screen-inference-all-5-pages-shared-c
claude/wi-019e57a6-p1-hybrid-refinement-loop-converge-to-2
claude/wi-019e590b-build-qa-runner-cli-for-ci-integration
claude/wi-019e796e-detect-anthropic-max-tokens-truncation-m
claude/wi-019e89ec-solstice-git-fetch-auth-parked-only-need
claude/wi-019e533b-phase-2-helm-provider-for-k8s-self-deplo
claude/wi-019e533b-phase-3-docker-compose-macos-native-prov
claude/wi-019e533b-phase-4-deployment-history-audit-trail-c
claude/wi-019e534c-add-create-project-mcp-tool-to-anima
claude/wi-019e57a6-p4-production-hardening-regression-safet
claude/wi-019e590b-create-anima-qa-crate-in-anima-workspace
claude/wi-019e590b-build-wayland-dev-vm-base-image-debian-b
claude/wi-019e590b-implement-yaml-test-spec-parser-and-exec
claude/wi-019e590b-integrate-qa-system-with-solstice-ci-and
claude/wi-019e590b-neocde-shell-startup-smoke-test-suite
claude/wi-019e796e-spawn-companion-startrun-dispatch-as-det
claude/wi-019e6070-add-mcp-tool-for-createproject-enable-gr
claude/wi-019e590b-wayray-compositor-smoke-test-suite
claude/wi-019e796e-make-review-wi-creation-failure-fatal-to
claude/wi-019e79d9-auto-run-gap-analysis-as-a-post-merge-pi
claude/wi-019e590b-anima-desktop-ui-smoke-test-suite
claude/wi-019e846d-catch-unwired-dead-code-pub-in-pub-mod-h
claude/wi-019e846d-in-server-merge-engine-hardening-anima-g
claude/wi-019e89ec-native-solstice-dispatch-inject-git-cred
claude/wi-019e57a6-p0-3-build-deterministic-css-to-slint-te
claude/wi-019e796d-implement-schedule-gate-working-hours-ev
claude/wi-019e590b-implement-visual-testing-primitives-scre
claude/wi-019db1bb-doc-system-a1-workspace-provisioner-proj
claude/wi-019db1bb-doc-system-a4-to-graph-queue-api-rest-mc
claude/wi-019dc9e9-anima-agent-re-adopt-orphan-orch-contain
claude/wi-019d9caa-dashboard-add-progress-indicators-to-act
claude/wi-019d9cab-work-item-add-sender-avatar-uniform-time
claude/wi-019db1bc-doc-system-a7-kg-explorer-sessions-tab-i
claude/wi-019db1bc-doc-system-i3-admin-ui-model-registry-ro
claude/wi-019dc57c-p1-5-seshat-ingest-hook-transcripts-libr
claude/wi-019d9caa-dashboard-add-recent-activity-right-side
claude/wi-019d9caa-projects-switch-from-vertical-list-to-3
claude/wi-019d9caa-projects-show-last-updated-x-ago-per-car
claude/wi-019d9cab-project-detail-add-right-sidebar-decisio
claude/wi-019d9cab-project-detail-extend-breadcrumb-to-incl
claude/wi-019d9cab-work-item-add-meta-row-assigned-to-risk
claude/wi-019d9cab-work-item-replace-send-with-split-button
claude/wi-019d9cab-cross-cutting-align-sidebar-brand-label
claude/wi-019d9cab-design-gap-5-slint-scenarios-have-no-pen
claude/wi-019dabaa-sign-agent-authored-commits-with-a-gpg-k
claude/wi-019db1bb-doc-system-a6-retrieval-scope-cross-work
claude/wi-019dc0e8-companionrow-scope-deserialize-surface-e
claude/wi-019dc0e8-companionrepo-set-next-fire-document-or
claude/wi-019dc57c-p2-host-diagnostics-heartbeat-ps-snapsho
claude/wi-019dcaad-list-work-items-returns-inconsistent-cou
claude/wi-019db055-binary-non-markdown-asset-support-in-doc
claude/wi-019dc0e8-remove-or-fix-stale-projectrepo-set-git
claude/wi-019dc0e8-backfill-unit-tests-for-projectremoterep
claude/wi-019dc0e8-git-remote-url-normalization-ssh-vs-git
claude/wi-019dc0e8-companionrepo-validate-cron-expr-on-crea
claude/wi-019e57a6-p3-ci-integration-design-to-slint-pipeli
research/deployments-system
hotfix/migration-version-bump
claude/wi-019dff17-merge-loop-investigate-outcome-error-on
claude/wi-019ddfac-emit-otlp-traces-from-seshat-seshd-to-te
claude/wi-019dc57a-plan-container-diagnostics-sessions-feat
claude/wi-019dca69-gemini-2-5-pro-code-review-via-openroute
claude/wi-019dd0b7-wipsnapshot-retention-per-wi-cap-age-pru
claude/wi-019dbb8e-auth-for-akh-medu-seshat-envoy-gateway-s
claude/wi-019dbb9c-agent-annotation-pathway-broken-pat-lack
claude/wi-019db055-document-ingest-api-seshat-backed
claude/wi-019db1bb-doc-system-a2-anima-branded-knowledge-mc
claude/wi-019db1bb-doc-system-a3-session-transcript-write-p
claude/wi-019d9caa-project-detail-add-phased-structure-insi
claude/wi-019db20e-anima-agent-listener-add-auth-tls-before
claude/wi-019db01f-desktop-ui-companions-settings-page-dash
claude/wi-019dac79-drop-agent-diff-column-setagentdiff-rpc
claude/wi-019db01f-mcp-tools-list-create-update-toggle-comp
claude/wi-019db1ba-doc-system-d5-anima-import-cli-bulk-mark
claude/wi-019db1ba-doc-system-d4-cross-project-dependencies
claude/wi-019db1ba-doc-system-d6-slint-library-viewer-basic
claude/wi-0-merge-commit-sha-null-fix
claude/wi-019dff17-merge-loop-error-taxonomy
claude/wi-019d9cab-project-detail-fix-empty-vertical-space
claude/wi-019db1bc-doc-system-i0-anima-inference-crate-open
claude/wi-019db1bc-doc-system-i2-tool-plumbing-across-provi
claude/wi-019db1bc-doc-system-i1-anthropic-direct-ollama-ad
claude/wi-019db030-plan-management-mcp-tools-list-get-creat
claude/wi-019db031-project-management-mcp-tools-create-upda
claude/wi-019dfe46-pipeline-stall-agent-dispatcher-offline
integration/wi-019df3aa-df3ab
claude/wi-019df3ab-anima-triage-wip-prefix
claude/wi-019df3aa-ai-module-diff-ready-gate
claude/wi-019dedb2-open-prs-marker-idempotency
claude/wi-019dedd5-triage-plan-idempotency
claude/wi-019daba7-cluster-deploy-anima-agent-retire-archib
claude/wi-019ded22-anima-native-dispatch-bundle
claude/wi-019de043-amqp-result-binding
claude/wi-019dc402-hide-wip-snapshot-changes
claude/wi-019ded24-ui-filter-empty-projects
claude/wi-019ded23-snapshot-loop-rate-limit
claude/wi-019ded23-akh-medu-ssh-push-remote
claude/wi-019ded24-restore-ci-docker-job
claude/wi-019ddaa2-ci-workflow-s-docker-job-omits-anima-tri
claude/wi-019de043-disable-forgejo-actions-ci
claude/wi-019ddfe2-open-prs-idempotent-marker-commit
claude/wi-019db1b9-c06f-d1-seshat-mcp-client-v2
claude/wi-019dda29-add-pr-ready-phase
claude/wi-019db1b9-doc-system-d2-createdocument-updatedocum
claude/wi-019db1b9-doc-system-d1-seshat-mcp-client-in-anima
claude/wi-019db054-phase-24-sensory-grounding
claude/wi-019db054-phase-23-affective-system
claude/wi-019db054-phase-22-akhipedia
claude/wi-019db054-phase-35-t5-training-infrastructure
claude/wi-019db054-phase-34-communicative-intelligence
claude/wi-019db054-phase-33-grounded-verbalization
claude/wi-019db054-phase-29-capability-exposure
claude/wi-019db054-phase-27-live-kg-llm-training
claude/wi-019db054-phase-26-llm-vsa-deep-integration
claude/wi-019db054-phase-21-game-theoretic-social-reasoning
claude/wi-019db054-phase-20-active-inference
claude/wi-019db054-phase-19-epistemic-logic-theory-of-mind
claude/wi-019db01d-companion-schema-domain-repos-companions
claude/wi-019daf97-projects-support-multiple-remotes-per-pr
claude/priceless-turing-45892e
No results found.
Labels
Clear labels
Compat/Breaking
Breaking change that won't be backward compatible
Kind/Bug
Something is not working
Kind/Documentation
Documentation changes
Kind/Enhancement
Improve existing functionality
Kind/Feature
New functionality
Kind/Security
This is security issue
Kind/Testing
Issue or pull request related to testing
Priority
Critical
The priority is critical
Priority
High
The priority is high
Priority
Low
The priority is low
Priority
Medium
The priority is medium
Reviewed
Confirmed
Issue has been confirmed
Reviewed
Duplicate
This issue or pull request already exists
Reviewed
Invalid
Invalid issue
Reviewed
Won't Fix
This issue won't be fixed
Status
Abandoned
Somebody has started to work on this but abandoned work
Status
Blocked
Something is blocking this issue or pull request
Status
Need More Info
Feedback is required to reproduce issue or to continue work
No labels
Compat/Breaking
Kind/Bug
Kind/Documentation
Kind/Enhancement
Kind/Feature
Kind/Security
Kind/Testing
Priority
Critical
Priority
High
Priority
Low
Priority
Medium
Reviewed
Confirmed
Reviewed
Duplicate
Reviewed
Invalid
Reviewed
Won't Fix
Status
Abandoned
Status
Blocked
Status
Need More Info
Milestone
Clear milestone
No items
No milestone
Projects
Clear projects
No items
No project
Assignees
Clear assignees
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".
No due date set.
Dependencies
No dependencies set.
Reference
toasterson/Anima#184
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Goal
Implement screenshot capture, vision model verification via Ollama cloud, and input injection (keyboard + mouse) for GUI testing.
Background
Once VMs boot with Sway, we need to capture screenshots, assert visual correctness via a vision-capable LLM, and inject keyboard/mouse events. This WI delivers the three core visual testing primitives.
🔗 Remote Sources (all on code.aopc.cloud)
vm-manager:
Anima workspace:
crates/anima-qa/src/screenshot.rsOllama cloud API:
https://ollama.cloud/v1/chat/completionsminicpm-v(multimodal, supports image+text)OLLAMA_CLOUD_API_KEYenv variableimage_urlcontent type🏗️ Implementation Guide
Step 1: Screenshot capture
SSH into VM, set WAYLAND_DISPLAY=wayland-0, run
grim -g "x,y wxh" -→ read PNG from stdout → base64 encodeStep 2: Vision assertion
POST to Ollama cloud with multimodal request. Prompt: "Does this screenshot contain: {prompt}? Answer ONLY with YES or NO followed by a brief explanation." Parse YES/NO, extract confidence.
Step 3: Keyboard input
SSH execute
WAYLAND_DISPLAY=wayland-0 wtype '{escaped_text}'for text,wtype -k Tabfor keysStep 4: Mouse input
SSH execute
WAYLAND_DISPLAY=wayland-0 ydotool mousemove --absolute {x} {y} && ydotool click 0x00Step 5: Wait for text
Poll SSH command output for expected text. Loop: sleep 500ms, check, timeout after N seconds.
Step 6: Register MCP tools
qa_screenshot, qa_assert_visible, qa_type, qa_click_at, qa_key_combo, qa_wait_for_text
Environment:
OLLAMA_CLOUD_API_KEYandQA_VISION_MODEL(default: minicpm-v)✅ Acceptance Criteria
Related WIs