2 Commits (5a5416bf2b17f4e01933c2cc4251e613808ad547)

Author SHA1 Message Date
Priyanka Punukollu ff6eceb6dc test: add latency bounds test for tool execution — documents that tools run in <5s, LLM synthesis latency is separate and documented 1 month ago
Priyanka Punukollu 443818bacd test: expand eval dataset to 56 new cases — 20 happy path, 12 edge, 12 adversarial, 12 multi-step 1 month ago