Audit and verify Engine testing coverage

t-141.7·WorkTask·
·
·
·Omni/Agent.hs
Parent:t-141·Created3 months ago·Updated3 months ago

Dependencies

Description

Edit

Before closing the epic, audit the testing situation for the new Engine:

Testing Checklist

Unit Tests (Engine.hs)

  • [ ] JSON roundtrip for Message, ToolCall, FunctionCall
  • [ ] Tool schema validation
  • [ ] LLM config defaults
  • [ ] Error handling for malformed responses

Unit Tests (Tools.hs)

  • [ ] Each tool has valid JSON schema
  • [ ] readFileTool handles missing files
  • [ ] writeFileTool creates parent directories
  • [ ] editFileTool handles no-match case
  • [ ] runBashTool captures exit codes
  • [ ] searchCodebaseTool returns structured results

Integration Tests

  • [ ] Engine can complete a trivial task (e.g., create a file)
  • [ ] Tool execution works end-to-end
  • [ ] Cost/token callbacks fire correctly
  • [ ] Activity callbacks fire at right times
  • [ ] Max iterations limit is respected

Comparison Tests (Optional)

  • [ ] Same task via amp vs Engine produces similar quality
  • [ ] Cost per task is comparable or better

Manual Testing

  • [ ] jr work t-xxx --use-engine completes successfully
  • [ ] Activity logging appears in web UI
  • [ ] Cost metrics recorded correctly
  • [ ] Retry handling works with Engine

Run all tests

bild --test Omni/Agent/Engine.hs bild --test Omni/Agent/Tools.hs

Success Criteria

  • All unit tests pass
  • At least one integration test proving end-to-end flow
  • Manual verification of a real task completion

Timeline (0)

No activity yet.