Can I set the embedding model in CPU memory and the LLM model in GPU with Ollama? #8334

ShiShuyang · 2025-06-18T01:20:32Z

ShiShuyang
Jun 18, 2025

Limited by the GPU memory, I have to move the embedding model to CPU memory. How can I deploy it with Ollama? Many thanks for your help!

ZhenhangTung · 2025-06-18T05:57:23Z

ZhenhangTung
Jun 18, 2025
Collaborator

pls check https://ragflow.io/docs/dev/deploy_local_llm @ShiShuyang

2 replies

ShiShuyang Jun 18, 2025
Author

Thank you for your reply! But this document seems that there is no information about how to set one embedding model in CPU and another LLM model in GPU.

ZhenhangTung Jun 18, 2025
Collaborator

Got it. I believe the key would be achieving this goal using Ollama? Pls check if https://github.com/copilot/share/884e1314-03c0-8ce1-b143-3a4360546806 helps and do some search around this topic?

ShiShuyang · 2025-06-23T02:47:52Z

ShiShuyang
Jun 23, 2025
Author

Under the help of ChatGPT, this problem has been solved.
First, create a plain text file (for example, named ollama_embedding_cpu.txt) with the following content:

FROM modelscope.cn/Qwen/Qwen3-Embedding-8B-GGUF:Qwen3-Embedding-8B-f16.gguf
PARAMETER num_gpu 0

Then, run the following command to create the model in Ollama:

ollama create qwen3-embed-cpu -f ollama_embedding_cpu.txt

After that, you can run the model using:

ollama run qwen3-embed-cpu

This will force the model to run on CPU only, which is useful if you don't have a GPU or want to test performance in a CPU-only environment.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

InfiniFlow

Can I set the embedding model in CPU memory and the LLM model in GPU with Ollama? #8334

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

InfiniFlow

Can I set the embedding model in CPU memory and the LLM model in GPU with Ollama? #8334

Uh oh!

ShiShuyang Jun 18, 2025

Replies: 2 comments · 2 replies

Uh oh!

ZhenhangTung Jun 18, 2025 Collaborator

Uh oh!

ShiShuyang Jun 18, 2025 Author

Uh oh!

ZhenhangTung Jun 18, 2025 Collaborator

Uh oh!

ShiShuyang Jun 23, 2025 Author

ShiShuyang
Jun 18, 2025

Replies: 2 comments 2 replies

ZhenhangTung
Jun 18, 2025
Collaborator

ShiShuyang Jun 18, 2025
Author

ZhenhangTung Jun 18, 2025
Collaborator

ShiShuyang
Jun 23, 2025
Author