@@ -581,20 +581,20 @@ We conducted a comprehensive benchmark evaluation using the [SimpleQA](https://h
581581
582582** Benchmark Configuration:**
583583
584- | Component | Parameter | Value |
585- | ----------------- | ---------------- | ---------------------- |
586- | ** Search Engine** | Provider | Tavily Basic Search |
587- | | Scraping Enabled | Yes |
588- | | Max Pages | 5 |
589- | | Content Limit | 33,000 characters |
590- | ** Agent** | Name | sgr_tool_calling_agent |
591- | | Max Steps | 20 |
592- | ** LLM (Agent)** | Model | gpt-4o -mini |
593- | | Max Tokens | 12,000 |
594- | | Temperature | 0.2 |
595- | ** LLM (Judge)** | Model | gpt-4o |
596- | | Max Tokens | Default |
597- | | Temperature | Default |
584+ | Component | Parameter | Value |
585+ | ----------------- | ---------------- | ----------------------- |
586+ | ** Search Engine** | Provider | Tavily Basic Search |
587+ | | Scraping Enabled | Yes |
588+ | | Max Pages | 5 |
589+ | | Content Limit | 33,000 characters |
590+ | ** Agent** | Name | sgr_tool_calling_agent |
591+ | | Max Steps | 20 |
592+ | ** LLM (Agent)** | Model | gpt-4.1 -mini |
593+ | | Max Tokens | 12,000 |
594+ | | Temperature | 0.2 |
595+ | ** LLM (Judge)** | Model | gpt-4o |
596+ | | Max Tokens | Default |
597+ | | Temperature | Default |
598598
599599Detailed benchmark results are available in [ this spreadsheet] ( assets/simpleqa_result.xlsx ) .
600600
0 commit comments