Skip to content

Commit a72bf6d

Browse files
author
maksimov maksim
committed
Merge branch 'benchmark_simpleqa' of https://github.com/vamplabAI/sgr-deep-research into benchmark_simpleqa
2 parents 480e26d + af3f230 commit a72bf6d

File tree

1 file changed

+14
-14
lines changed

1 file changed

+14
-14
lines changed

README.md

Lines changed: 14 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -581,20 +581,20 @@ We conducted a comprehensive benchmark evaluation using the [SimpleQA](https://h
581581

582582
**Benchmark Configuration:**
583583

584-
| Component | Parameter | Value |
585-
| ----------------- | ---------------- | ---------------------- |
586-
| **Search Engine** | Provider | Tavily Basic Search |
587-
| | Scraping Enabled | Yes |
588-
| | Max Pages | 5 |
589-
| | Content Limit | 33,000 characters |
590-
| **Agent** | Name | sgr_tool_calling_agent |
591-
| | Max Steps | 20 |
592-
| **LLM (Agent)** | Model | gpt-4o-mini |
593-
| | Max Tokens | 12,000 |
594-
| | Temperature | 0.2 |
595-
| **LLM (Judge)** | Model | gpt-4o |
596-
| | Max Tokens | Default |
597-
| | Temperature | Default |
584+
| Component | Parameter | Value |
585+
| ----------------- | ---------------- | ----------------------- |
586+
| **Search Engine** | Provider | Tavily Basic Search |
587+
| | Scraping Enabled | Yes |
588+
| | Max Pages | 5 |
589+
| | Content Limit | 33,000 characters |
590+
| **Agent** | Name | sgr_tool_calling_agent |
591+
| | Max Steps | 20 |
592+
| **LLM (Agent)** | Model | gpt-4.1-mini |
593+
| | Max Tokens | 12,000 |
594+
| | Temperature | 0.2 |
595+
| **LLM (Judge)** | Model | gpt-4o |
596+
| | Max Tokens | Default |
597+
| | Temperature | Default |
598598

599599
Detailed benchmark results are available in [this spreadsheet](assets/simpleqa_result.xlsx).
600600

0 commit comments

Comments
 (0)