From 10ef61bab57270a816d9fd388ca57b85c276427c Mon Sep 17 00:00:00 2001
From: Priyanka Punukollu <priyankapunukollu@Priyankas-MacBook-Pro.local>
Date: Sat, 28 Feb 2026 09:20:33 -0600
Subject: [PATCH] docs: add eval dataset README as open source contribution
 documentation

Made-with: Cursor
---
 agent/evals/EVAL_DATASET_README.md | 61 ++++++++++++++++++++++++++++++
 1 file changed, 61 insertions(+)
 create mode 100644 agent/evals/EVAL_DATASET_README.md

diff --git a/agent/evals/EVAL_DATASET_README.md b/agent/evals/EVAL_DATASET_README.md
new file mode 100644
index 000000000..293d4ea97
--- /dev/null
+++ b/agent/evals/EVAL_DATASET_README.md
@@ -0,0 +1,61 @@
+# Finance AI Agent — Public Eval Dataset
+
+183 test cases for AI agents built on personal finance and portfolio management software.
+
+Built on top of Ghostfolio — an open source wealth management platform.
+
+Released publicly as a resource for developers building finance AI agents.
+
+## Test Categories
+
+| Category | Count |
+|----------|-------|
+| Happy Path | 20 |
+| Edge Cases | 14 |
+| Adversarial | 14 |
+| Multi-Step | 13 |
+| Other | 122 |
+| **Total** | **183** |
+
+## How To Run
+```bash
+git clone https://github.com/lakshmipunukollu-ai/ghostfolio
+cd ghostfolio
+git checkout submission/final
+pip install -r agent/requirements.txt
+python -m pytest agent/evals/ -v
+```
+
+## Test Structure
+
+Every test in test_eval_dataset.py follows:
+```python
+# TYPE: happy_path | edge_case | adversarial | multi_step
+# INPUT: what is being tested
+# EXPECTED: what the tool should return
+# CRITERIA: the specific assertion
+def test_name():
+    from tools.tool_name import function_name
+    result = function_name(params)
+    assert "key" in result
+```
+
+## Results
+
+- Tests: 183
+- Pass rate: 100%
+- Runtime: ~30 seconds
+
+## Contribute
+
+Submit a PR with new test cases.
+Follow the TYPE/INPUT/EXPECTED/CRITERIA pattern.
+
+## License
+
+MIT
+
+## Author
+
+Priya Lakshmipunukollu — AgentForge, February 2026
+https://github.com/lakshmipunukollu-ai/ghostfolio