2 Commits (dbed673d92191b82fa5629915f6becfb07ed2376)

Author SHA1 Message Date
Priyanka Punukollu ff6eceb6dc test: add latency bounds test for tool execution — documents that tools run in <5s, LLM synthesis latency is separate and documented 1 month ago
Priyanka Punukollu 443818bacd test: expand eval dataset to 56 new cases — 20 happy path, 12 edge, 12 adversarial, 12 multi-step 1 month ago