docs: add eval dataset README as open source contribution documentation

Made-with: Cursor
4 months ago · 10ef61bab5
1 changed files with 61 additions and 0 deletions
--- a/agent/evals/EVAL_DATASET_README.md
+++ b/agent/evals/EVAL_DATASET_README.md
@ -0,0 +1,61 @@
 # Finance AI Agent — Public Eval Dataset
 183 test cases for AI agents built on personal finance and portfolio management software.
 Built on top of Ghostfolio — an open source wealth management platform.
 Released publicly as a resource for developers building finance AI agents.
 ## Test Categories
 | Category | Count |
 |----------|-------|
 | Happy Path | 20 |
 | Edge Cases | 14 |
 | Adversarial | 14 |
 | Multi-Step | 13 |
 | Other | 122 |
 | **Total** | **183** |
 ## How To Run
 ```bash
 git clone https://github.com/lakshmipunukollu-ai/ghostfolio
 cd ghostfolio
 git checkout submission/final
 pip install -r agent/requirements.txt
 python -m pytest agent/evals/ -v
 ```
 ## Test Structure
 Every test in test_eval_dataset.py follows:
 ```python
 # TYPE: happy_path | edge_case | adversarial | multi_step
 # INPUT: what is being tested
 # EXPECTED: what the tool should return
 # CRITERIA: the specific assertion
 def test_name():
    from tools.tool_name import function_name
    result = function_name(params)
    assert "key" in result
 ```
 ## Results
 - Tests: 183
 - Pass rate: 100%
 - Runtime: ~30 seconds
 ## Contribute
 Submit a PR with new test cases.
 Follow the TYPE/INPUT/EXPECTED/CRITERIA pattern.
 ## License
 MIT
 ## Author
 Priya Lakshmipunukollu — AgentForge, February 2026
 https://github.com/lakshmipunukollu-ai/ghostfolio