Skip to content

Commit 3de81aa

Browse files
committed
[Doc] add qwen3 reranker
Signed-off-by: TingW09 <[email protected]>
1 parent 8df52e2 commit 3de81aa

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

docs/source/tutorials/Qwen3_embedding.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,6 @@ if you don't want to use the docker image as above, you can also build all from
2424
- Install `vllm-ascend` from source, refer to [installation](../installation.md).
2525

2626
## Deployment
27-
*only support single node single npu deployment*
2827

2928
Using the Qwen3-Embedding-8B model as an example, first run the docker container with the following command:
3029

@@ -92,6 +91,7 @@ Processed prompts: 100%|██████████████████
9291
```
9392

9493
## Performance
94+
9595
```bash
9696
vllm bench serve --model Qwen3-embedding --backend openai-embeddings --dataset-name random --tokenizer /data/Qwen3-reembedding --host 127.0.0.1 --port 8888 --endpoint /v1/embeddings
9797
```

docs/source/tutorials/Qwen3_reranker.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,6 @@ if you don't want to use the docker image as above, you can also build all from
2525
- Install `vllm-ascend` from source, refer to [installation](../installation.md).
2626

2727
## Deployment
28-
*only support single node single npu deployment*
2928

3029
Using the Qwen3-Reranker-8B model as an example, first run the docker container with the following command:
3130

@@ -162,6 +161,7 @@ If you run this script successfully, you will see a list of scores printed to th
162161
```
163162

164163
## Performance
164+
165165
```bash
166166
vllm bench serve --model Qwen3-reranker --backend vllm-rerank --dataset-name random-rerank --tokenizer /data/Qwen3-reranker --host 127.0.0.1 --port 8888 --endpoint /v1/rerank
167167
```

0 commit comments

Comments
 (0)