feat(tool): incorporate open-source tools from MiroThinker #60

JubSteven · 2025-10-02T01:38:30Z

Describe this PR

Adapted open-source tools from Mirothinker and add relevant docs on deploying open-source models.

Checklist for PR

Must Do

Write a good PR title and description, i.e. feat(agent): add pdf tool via mcp, perf: make llm client async and fix(utils): load custom config via importlib etc. CI job check-pr-title enforces Angular commit message format to PR title.
Run make precommit locally. CI job lint enforce ruff default format/lint rules on all new codes.
Run make pytest. Check test summary (located at report.html) and coverage report (located at htmlcov/index.html) on new codes.

Nice To Have

(Optional) Write/update tests under /tests for feat and test PR.
(Optional) Write/update docs under /docs for docs and ci PR.

- Resolved formatting conflicts in utils/extract_futurex_results.py - Resolved formatting conflicts in utils/prepare_benchmark/gen_futurex.py - Resolved formatting conflicts in utils/progress_check/check_futurex_progress.py All conflicts were due to code formatting differences (whitespace, line breaks, trailing commas). Functionality remains identical between branches.

…ress file to exclude T1.

… greater china respectively.

… china region.

Copilot

Pull Request Overview

This PR adapts and incorporates open-source tools from MiroThinker, adding three new MCP servers that provide vision, reasoning, and audio processing capabilities using open-source models.

Added three new open-source MCP servers (vision, reasoning, and audio) with robust error handling
Created comprehensive documentation for deploying and using the open-source models
Added YAML configuration files to integrate the new tools into the existing tool system

Reviewed Changes

Copilot reviewed 10 out of 10 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
`src/tool/mcp_servers/vision_mcp_server_os.py`	New vision MCP server for VQA using open-source models like Qwen2.5-VL
`src/tool/mcp_servers/reasoning_mcp_server_os.py`	New reasoning MCP server with retry logic for complex problem solving
`src/tool/mcp_servers/audio_mcp_server_os.py`	New audio transcription server using open-source Whisper models
`docs/mkdocs/mkdocs.yml`	Updated navigation to include documentation for new open-source tools
`docs/mkdocs/docs/tool_vqa_os.md`	Documentation for open-source vision tool deployment and usage
`docs/mkdocs/docs/tool_reasoning_os.md`	Documentation for open-source reasoning tool deployment and usage
`docs/mkdocs/docs/tool_audio_os.md`	Documentation for open-source audio tool deployment and usage
`config/tool/tool-reasoning-os.yaml`	Configuration file for reasoning tool integration
`config/tool/tool-image-video-os.yaml`	Configuration file for vision tool integration
`config/tool/tool-audio-os.yaml`	Configuration file for audio tool integration

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-10-02T01:49:35Z

src/tool/mcp_servers/vision_mcp_server_os.py

+
+        payload = {"model": VISION_MODEL_NAME, "messages": messages_for_llm}
+
+        response = requests.post(VISION_BASE_URL, json=payload, headers=headers)


Using synchronous requests.post in an async function can block the event loop. Consider using aiohttp.ClientSession().post() instead since you're already importing and using aiohttp elsewhere in the function.

Copilot · 2025-10-02T01:49:36Z

src/tool/mcp_servers/audio_mcp_server_os.py

+            if duration > 0:
+                return duration
+    except Exception as e:
+        return f"[ERROR]: Failed to get audio duration: {e}"


The function _get_audio_duration should return a float according to its type hint and usage context, but this exception handler returns a string. This could cause type errors when the returned value is used in calculations.

Suggested change

return f"[ERROR]: Failed to get audio duration: {e}"

return 0.0

Copilot · 2025-10-02T01:49:36Z

src/tool/mcp_servers/reasoning_mcp_server_os.py

+
+@mcp.tool()
+async def reasoning(question: str) -> str:
+    """You can use this tool use solve hard math problem, puzzle, riddle and IQ test question that requires a lot of chain of thought efforts.


Grammar error: 'use solve' should be 'to solve'. The sentence should read: 'You can use this tool to solve hard math problem...'

Suggested change

"""You can use this tool use solve hard math problem, puzzle, riddle and IQ test question that requires a lot of chain of thought efforts.

"""You can use this tool to solve hard math problem, puzzle, riddle and IQ test question that requires a lot of chain of thought efforts.

BinWang28 · 2025-10-02T02:03:39Z

docs/mkdocs/docs/tool_audio_os.md

+---
+
+!!! info "Documentation Info"
+    **Last Updated:** January 2025 · **Doc Contributor:** Team @ MiroMind AI


should be "October 2025"

JubSteven added 19 commits September 18, 2025 10:35

upd: add futurex evaluation support.

56b235d

upd: support multiple eval for futurex and add relavent doc.

287a7bc

upd: fix bugs with doc for futurex.

bf43b37

debug: fix wrong calling path.

d1e1637

add preparation for finsearchcomp.

eb6f302

update a premature version of finsearchcomp benchmark.

4dabaee

clean redundent code in merging.

c086e41

upd: modify yaml to use Mirothinker as the main agent, add check prog…

d6a8715

…ress file to exclude T1.

upd: check_progress function for finsearchcomp now consider globe and…

e7163d3

… greater china respectively.

Merge remote-tracking branch 'upstream/miroflow-v0.3' into explorations

b0e494f

upd: add docs and shell script for multiple runs.

256ba2c

fix: check_finsearchcomp_progress not displaying results from greater…

835e590

… china region.

Merge remote-tracking branch 'upstream/miroflow-v0.3' into explorations

5ffc269

Merge branch 'miroflow-v0.3' into explorations

4918ee2

fix: catch ContextLimitError in more observed cases.

72e9bb6

initialize open source tools for audio, vision and reasoning.

e589468

Merge remote-tracking branch 'upstream/miroflow-v0.3' into explorations

948d856

upd: docs for open-source tools.

15a7ef9

BinWang28 requested a review from Copilot October 2, 2025 01:48

Copilot AI reviewed Oct 2, 2025

View reviewed changes

BinWang28 reviewed Oct 2, 2025

View reviewed changes

fix wrong date.

bf786ca

BinWang28 approved these changes Oct 2, 2025

View reviewed changes

BinWang28 merged commit 0b20ff3 into MiroMindAI:miroflow-v0.3 Oct 2, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(tool): incorporate open-source tools from MiroThinker #60

feat(tool): incorporate open-source tools from MiroThinker #60

Uh oh!

JubSteven commented Oct 2, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Oct 2, 2025

Uh oh!

Copilot AI Oct 2, 2025

Uh oh!

Copilot AI Oct 2, 2025

Uh oh!

BinWang28 Oct 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		payload = {"model": VISION_MODEL_NAME, "messages": messages_for_llm}

		response = requests.post(VISION_BASE_URL, json=payload, headers=headers)

	return f"[ERROR]: Failed to get audio duration: {e}"
	return 0.0

	"""You can use this tool use solve hard math problem, puzzle, riddle and IQ test question that requires a lot of chain of thought efforts.
	"""You can use this tool to solve hard math problem, puzzle, riddle and IQ test question that requires a lot of chain of thought efforts.

feat(tool): incorporate open-source tools from MiroThinker #60

feat(tool): incorporate open-source tools from MiroThinker #60

Uh oh!

Conversation

JubSteven commented Oct 2, 2025

Describe this PR

Checklist for PR

Must Do

Nice To Have

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

BinWang28 Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants