t-645 - omni

t-645·WorkTask····skills/coder.md

Created1 month ago·Updated1 week ago·pipeline runs →

Description

Before setting task→review, coder agents should produce a mini evidence artifact: a short structured doc showing what was actually built, commands run, and key outputs/screenshots. Modeled on Showboat (https://github.com/simonw/showboat) but baked into the coder skill workflow. Addresses the trust gap where agents claim success at max iterations without actually completing work. The artifact should be committed alongside the code or attached as a task comment.

Git Commits

919fb092skills/coder: require showboat-style proof-of-work evidence

Coder Agent8 weeks ago1 files

Timeline (16)

🔄[system]Open → InProgress1 month ago

💬[system]1 month ago

Pipeline: dev completed (run=dev-t-645-1771537476, cost=0.0c)

🔄[system]InProgress → Open1 month ago

💬[system]1 month ago

Pipeline: verification failed: Build failed for skills/coder.md (exit 1): 7[10000;10000H

[0m[38;5;1m[2Kfail: bild: nothing to build [0m[0m [0m

🔄[system]Open → InProgress1 month ago

💬[human]1 month ago

Pipeline scheduler: started run=pipeline-skills-coder-md-t-645-1771562441 domain=skills/coder.md

🔄[human]InProgress → Review1 month ago

💬[human]1 month ago

Pipeline scheduler: run=pipeline-skills-coder-md-t-645-1771562441 domain=skills/coder.md status=done cost=34c (fund-spend=failed)

💬[human]1 week ago

Ava triage: pipeline auto-run reached status=done but the agent made NO git commits and reported blockers (missing files, path mismatches, or need clarification). This task is not actually in review — there's nothing to review. Resetting status to Open so it can be re-scoped.

🔄[human]Review → Open1 week ago

🔄[human]Open → Verified1 week ago

Adopt Showboat proof-of-work pattern in Coder agent skill

Description

Git Commits

Timeline (16)