2 Commits (67a7bb2bb56b6ab3dabe2a396d55e0ec0e5692d7)

Author SHA1 Message Date
Priyanka Punukollu ff6eceb6dc test: add latency bounds test for tool execution — documents that tools run in <5s, LLM synthesis latency is separate and documented 1 month ago
Priyanka Punukollu 443818bacd test: expand eval dataset to 56 new cases — 20 happy path, 12 edge, 12 adversarial, 12 multi-step 1 month ago