autogen/python/packages/agbench/benchmarks/GAIA/Scripts
afourney af5dcc7fdf
Significant updates to agbench. (#5313)
- Updated HumanEval template to use AgentChat
- Update templates to use config.yaml for model and other configuration
- Read environment from ENV.yaml (ENV.json still supported but
deprecated)
- Temporarily removed WebArena and AssistantBench. Neither had viable
Templates after `autogen_magentic_one` was removed. Templates need to be
update to AgentChat (in a future PR, but this PR is getting big enough
already)
2025-02-07 18:01:44 +00:00
..
custom_tabulate.py Significant updates to agbench. (#5313) 2025-02-07 18:01:44 +00:00
init_tasks.py Adding Benchmarks to agbench (#3803) 2024-10-18 06:33:33 +02:00