|
| 1 | +# GAIA Validation - GPT5 |
| 2 | + |
| 3 | +MiroFlow now supports GPT-5 with MCP tool invocation, providing a unified workflow for multi-step reasoning, information integration, and scalable tool coordination. |
| 4 | + |
| 5 | +!!! info "Prerequisites" |
| 6 | + Before proceeding, please review the [GAIA Validation Prerequisites](gaia_validation_prerequisites.md) document, which covers common setup requirements, dataset preparation, and API key configuration. |
| 7 | + |
| 8 | +--- |
| 9 | + |
| 10 | +## Running the Evaluation |
| 11 | + |
| 12 | +### Step 1: Dataset Preparation |
| 13 | + |
| 14 | +Follow the [dataset preparation instructions](gaia_validation_prerequisites.md#dataset-preparation) in the prerequisites document. |
| 15 | + |
| 16 | +### Step 2: API Keys Configuration |
| 17 | + |
| 18 | +Configure the following API keys in your `.env` file: |
| 19 | + |
| 20 | +```env title="GPT-5 .env Configuration" |
| 21 | +# Search and web scraping capabilities |
| 22 | +SERPER_API_KEY="your-serper-api-key" |
| 23 | +JINA_API_KEY="your-jina-api-key" |
| 24 | +
|
| 25 | +# Code execution environment |
| 26 | +E2B_API_KEY="your-e2b-api-key" |
| 27 | +
|
| 28 | +# Vision understanding capabilities |
| 29 | +ANTHROPIC_API_KEY="your-anthropic-api-key" |
| 30 | +GEMINI_API_KEY="your-gemini-api-key" |
| 31 | +
|
| 32 | +# Primary LLM provider, LLM judge, reasoning, and hint generation |
| 33 | +OPENAI_API_KEY="your-openai-api-key" |
| 34 | +OPENAI_BASE_URL="https://api.openai.com/v1" |
| 35 | +
|
| 36 | +``` |
| 37 | + |
| 38 | +### Step 3: Run the Evaluation |
| 39 | + |
| 40 | +Execute the evaluation using the GPT-5 configuration: |
| 41 | + |
| 42 | +```bash title="Run GAIA Validation with GPT-5" |
| 43 | +uv run main.py common-benchmark \ |
| 44 | + --config_file_name=agent_gaia-validation-gpt5 \ |
| 45 | + output_dir="logs/gaia-validation-gpt5/$(date +"%Y%m%d_%H%M")" |
| 46 | +``` |
| 47 | + |
| 48 | +### Step 4: Monitor Progress |
| 49 | + |
| 50 | +Follow the [progress monitoring instructions](gaia_validation_prerequisites.md#progress-monitoring-and-resume) in the prerequisites document. |
| 51 | + |
| 52 | + |
| 53 | +--- |
| 54 | + |
| 55 | +!!! info "Documentation Info" |
| 56 | + **Last Updated:** October 2025 · **Doc Contributor:** Team @ MiroMind AI |
0 commit comments