# Finance AI Agent — Public Eval Dataset

183 test cases for AI agents built on personal finance and portfolio management software.

Built on top of Ghostfolio — an open source wealth management platform.

Released publicly as a resource for developers building finance AI agents.

## Test Categories

| Category | Count |
|----------|-------|
| Happy Path | 20 |
| Edge Cases | 14 |
| Adversarial | 14 |
| Multi-Step | 13 |
| Other | 122 |
| **Total** | **183** |

## How To Run
```bash
git clone https://github.com/lakshmipunukollu-ai/ghostfolio
cd ghostfolio
git checkout submission/final
pip install -r agent/requirements.txt
python -m pytest agent/evals/ -v
```

## Test Structure

Every test in test_eval_dataset.py follows:
```python
# TYPE: happy_path | edge_case | adversarial | multi_step
# INPUT: what is being tested
# EXPECTED: what the tool should return
# CRITERIA: the specific assertion
def test_name():
    from tools.tool_name import function_name
    result = function_name(params)
    assert "key" in result
```

## Results

- Tests: 183
- Pass rate: 100%
- Runtime: ~30 seconds

## Contribute

Submit a PR with new test cases.
Follow the TYPE/INPUT/EXPECTED/CRITERIA pattern.

## License

MIT

## Author

Priya Lakshmipunukollu — AgentForge, February 2026
https://github.com/lakshmipunukollu-ai/ghostfolio