History

afourney 088a50faa5 Remove old autogen_magentic_one package. (#5305 ) This PR removes the older `autogen_magentic_one` package, and directs people to use the new AgentChat implementation. Hopefully this eases confusion. --------- Co-authored-by: Jack Gerrits <jack@jackgerrits.com> Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>		2025-01-31 15:14:40 -08:00
..
AssistantBench	Remove old autogen_magentic_one package. (#5305 )	2025-01-31 15:14:40 -08:00
GAIA	Remove old autogen_magentic_one package. (#5305 )	2025-01-31 15:14:40 -08:00
HumanEval	Remove old autogen_magentic_one package. (#5305 )	2025-01-31 15:14:40 -08:00
WebArena	Remove old autogen_magentic_one package. (#5305 )	2025-01-31 15:14:40 -08:00
.gitignore	Adding Benchmarks to agbench (#3803 )	2024-10-18 06:33:33 +02:00
README.md	Adding Benchmarks to agbench (#3803 )	2024-10-18 06:33:33 +02:00
process_logs.py	Adding Benchmarks to agbench (#3803 )	2024-10-18 06:33:33 +02:00

README.md

Benchmarking Agents

This directory provides ability to benchmarks agents (e.g., built using Autogen) using AgBench. Use the instructions below to prepare your environment for benchmarking. Once done, proceed to relevant benchmarks directory (e.g., benchmarks/GAIA) for further scenario-specific instructions.

Setup on WSL

Install Docker Desktop. After installation, restart is needed, then open Docker Desktop, in Settings, Ressources, WSL Integration, Enable integration with additional distros – Ubuntu
Clone autogen and export AUTOGEN_REPO_BASE. This environment variable enables the Docker containers to use the correct version agents.
```
git clone git@github.com:microsoft/autogen.git
export AUTOGEN_REPO_BASE=<path_to_autogen>
```
Install agbench. AgBench is currently a tool in the Autogen repo.
```
cd autogen/python/packages/agbench
pip install -e .
```

README.md Unescape Escape

Benchmarking Agents

Setup on WSL

README.md