You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Priyanka Punukollu 443818bacd test: expand eval dataset to 56 new cases — 20 happy path, 12 edge, 12 adversarial, 12 multi-step 1 month ago
..
__init__.py feat(agent): add login page, live thinking steps, and UI polish 1 month ago
conftest.py fix: restore 126 tests — add conftest mock for teleport API, fix async config 1 month ago
coverage_matrix.py feat(agent): add login page, live thinking steps, and UI polish 1 month ago
golden_results.json chore(evals): update golden results from latest run 1 month ago
golden_sets.yaml fix: achieve 25/25 evals — robust criteria + health check routing 1 month ago
labeled_scenarios.yaml fix: achieve 25/25 evals — robust criteria + health check routing 1 month ago
run_evals.py feat(agent): add login page, live thinking steps, and UI polish 1 month ago
run_golden_sets.py fix: achieve 25/25 evals — robust criteria + health check routing 1 month ago
test_cases.json feat(agent): add login page, live thinking steps, and UI polish 1 month ago
test_equity_advisor.py feat: add equity unlock advisor to property tracker 1 month ago
test_eval_dataset.py test: expand eval dataset to 56 new cases — 20 happy path, 12 edge, 12 adversarial, 12 multi-step 1 month ago
test_family_planner.py feat: add family financial planner with global childcare data 1 month ago
test_life_decision_advisor.py feat: add life decision advisor with safe tool orchestration 1 month ago
test_portfolio.py feat(agent): complete showcase — real ACTRIS data, property tracker, 27 UI features 1 month ago
test_property_onboarding.py test: add property onboarding and strategy assumption tests 1 month ago
test_property_tracker.py feat(agent): complete showcase — real ACTRIS data, property tracker, 27 UI features 1 month ago
test_real_estate.py test(real-estate): add bedroom/price filter + structured error tests (8 total) 1 month ago
test_realestate_strategy.py fix: strategy simulator uses user assumptions not hardcoded predictions 1 month ago
test_relocation_runway.py feat: add relocation runway calculator 1 month ago
test_wealth_bridge.py feat: complete property_tracker CRUD with SQLite + add 8 wealth bridge tests 1 month ago
test_wealth_visualizer.py feat: add wealth gap visualizer with Fed Reserve benchmarks 1 month ago