2 Commits (0a3c276a4610685bb5b1a0a97d3d0e2e94a49a46)

Author SHA1 Message Date
Priyanka Punukollu ff6eceb6dc test: add latency bounds test for tool execution — documents that tools run in <5s, LLM synthesis latency is separate and documented 1 month ago
Priyanka Punukollu 443818bacd test: expand eval dataset to 56 new cases — 20 happy path, 12 edge, 12 adversarial, 12 multi-step 2 months ago