mirror of https://github.com/ghostfolio/ghostfolio
1 changed files with 61 additions and 0 deletions
@ -0,0 +1,61 @@ |
|||||
|
# Finance AI Agent — Public Eval Dataset |
||||
|
|
||||
|
183 test cases for AI agents built on personal finance and portfolio management software. |
||||
|
|
||||
|
Built on top of Ghostfolio — an open source wealth management platform. |
||||
|
|
||||
|
Released publicly as a resource for developers building finance AI agents. |
||||
|
|
||||
|
## Test Categories |
||||
|
|
||||
|
| Category | Count | |
||||
|
|----------|-------| |
||||
|
| Happy Path | 20 | |
||||
|
| Edge Cases | 14 | |
||||
|
| Adversarial | 14 | |
||||
|
| Multi-Step | 13 | |
||||
|
| Other | 122 | |
||||
|
| **Total** | **183** | |
||||
|
|
||||
|
## How To Run |
||||
|
```bash |
||||
|
git clone https://github.com/lakshmipunukollu-ai/ghostfolio |
||||
|
cd ghostfolio |
||||
|
git checkout submission/final |
||||
|
pip install -r agent/requirements.txt |
||||
|
python -m pytest agent/evals/ -v |
||||
|
``` |
||||
|
|
||||
|
## Test Structure |
||||
|
|
||||
|
Every test in test_eval_dataset.py follows: |
||||
|
```python |
||||
|
# TYPE: happy_path | edge_case | adversarial | multi_step |
||||
|
# INPUT: what is being tested |
||||
|
# EXPECTED: what the tool should return |
||||
|
# CRITERIA: the specific assertion |
||||
|
def test_name(): |
||||
|
from tools.tool_name import function_name |
||||
|
result = function_name(params) |
||||
|
assert "key" in result |
||||
|
``` |
||||
|
|
||||
|
## Results |
||||
|
|
||||
|
- Tests: 183 |
||||
|
- Pass rate: 100% |
||||
|
- Runtime: ~30 seconds |
||||
|
|
||||
|
## Contribute |
||||
|
|
||||
|
Submit a PR with new test cases. |
||||
|
Follow the TYPE/INPUT/EXPECTED/CRITERIA pattern. |
||||
|
|
||||
|
## License |
||||
|
|
||||
|
MIT |
||||
|
|
||||
|
## Author |
||||
|
|
||||
|
Priya Lakshmipunukollu — AgentForge, February 2026 |
||||
|
https://github.com/lakshmipunukollu-ai/ghostfolio |
||||
Loading…
Reference in new issue