2 Commits (68f3af7fb0dc444adf460ce7bf49c269e36418a0)

Author SHA1 Message Date
Priyanka Punukollu ff6eceb6dc test: add latency bounds test for tool execution — documents that tools run in <5s, LLM synthesis latency is separate and documented 1 month ago
Priyanka Punukollu 443818bacd test: expand eval dataset to 56 new cases — 20 happy path, 12 edge, 12 adversarial, 12 multi-step 2 months ago