Skip to content

Commit f9cb21c

Browse files
committed
update vLLM wheel source to GitHub Releases and remove commit SHA argument
1 parent 0c8aacc commit f9cb21c

File tree

3 files changed

+9
-30
lines changed

3 files changed

+9
-30
lines changed

.github/workflows/release.yml

Lines changed: 0 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -24,11 +24,6 @@ on:
2424
required: false
2525
type: string
2626
default: "0.11.0"
27-
vllmCommitSha:
28-
description: 'vLLM commit SHA (from git rev-list -n 1 v{version})'
29-
required: false
30-
type: string
31-
default: "b8b302cde434df8c9289a2b465406b47ebab1c2d"
3227

3328
jobs:
3429
test:
@@ -135,7 +130,6 @@ jobs:
135130
"LLAMA_SERVER_VARIANT=cuda"
136131
"BASE_IMAGE=nvidia/cuda:12.9.0-runtime-ubuntu24.04"
137132
"VLLM_VERSION=${{ inputs.vllmVersion }}"
138-
"VLLM_COMMIT_SHA=${{ inputs.vllmCommitSha }}"
139133
"VLLM_CUDA_VERSION=cu129"
140134
"VLLM_PYTHON_TAG=cp38-abi3"
141135
push: true

Dockerfile

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -80,7 +80,6 @@ ENTRYPOINT ["/app/model-runner"]
8080
FROM llamacpp AS vllm
8181

8282
ARG VLLM_VERSION=0.11.0
83-
ARG VLLM_COMMIT_SHA=b8b302cde434df8c9289a2b465406b47ebab1c2d
8483
ARG VLLM_CUDA_VERSION=cu129
8584
ARG VLLM_PYTHON_TAG=cp38-abi3
8685
ARG TARGETARCH
@@ -101,7 +100,7 @@ RUN curl -LsSf https://astral.sh/uv/install.sh | sh \
101100
else \
102101
WHEEL_ARCH="manylinux2014_aarch64"; \
103102
fi \
104-
&& WHEEL_URL="https://wheels.vllm.ai/${VLLM_COMMIT_SHA}/vllm-${VLLM_VERSION}%2B${VLLM_CUDA_VERSION}-${VLLM_PYTHON_TAG}-${WHEEL_ARCH}.whl" \
103+
&& WHEEL_URL="https://github.com/vllm-project/vllm/releases/download/v${VLLM_VERSION}/vllm-${VLLM_VERSION}%2B${VLLM_CUDA_VERSION}-${VLLM_PYTHON_TAG}-${WHEEL_ARCH}.whl" \
105104
&& ~/.local/bin/uv pip install --python /opt/vllm-env/bin/python "$WHEEL_URL"
106105

107106
RUN /opt/vllm-env/bin/python -c "import vllm; print(vllm.__version__)" > /opt/vllm-env/version

README.md

Lines changed: 8 additions & 22 deletions
Original file line numberDiff line numberDiff line change
@@ -247,7 +247,6 @@ docker buildx build \
247247
--build-arg BASE_IMAGE=nvidia/cuda:12.9.0-runtime-ubuntu24.04 \
248248
--build-arg LLAMA_SERVER_VARIANT=cuda \
249249
--build-arg VLLM_VERSION=0.11.0 \
250-
--build-arg VLLM_COMMIT_SHA=b8b302cde434df8c9289a2b465406b47ebab1c2d \
251250
-t docker/model-runner:vllm .
252251
```
253252

@@ -256,7 +255,6 @@ docker buildx build \
256255
The vLLM variant supports the following build arguments:
257256

258257
- **VLLM_VERSION**: The vLLM version to install (default: `0.11.0`)
259-
- **VLLM_COMMIT_SHA**: The git commit SHA corresponding to the vLLM version (default: `b8b302cde434df8c9289a2b465406b47ebab1c2d` for v0.11.0)
260258
- **VLLM_CUDA_VERSION**: The CUDA version suffix for the wheel (default: `cu129`)
261259
- **VLLM_PYTHON_TAG**: The Python compatibility tag (default: `cp38-abi3`, compatible with Python 3.8+)
262260

@@ -280,28 +278,16 @@ docker buildx build \
280278

281279
#### Updating to a New vLLM Version
282280

283-
To update to a new vLLM version, you need to:
281+
To update to a new vLLM version:
284282

285-
1. **Find the commit SHA for the version:**
286-
```sh
287-
# Clone the vLLM repository (if not already cloned)
288-
git clone https://github.com/vllm-project/vllm.git
289-
cd vllm
290-
291-
# Get the commit SHA for a specific version
292-
git rev-list -n 1 v0.11.1
293-
```
294-
295-
2. **Build with the new version:**
296-
```sh
297-
docker buildx build \
298-
--target final-vllm \
299-
--build-arg VLLM_VERSION=0.11.1 \
300-
--build-arg VLLM_COMMIT_SHA=<commit-sha-from-step-1> \
301-
-t docker/model-runner:vllm-0.11.1 .
302-
```
283+
```sh
284+
docker buildx build \
285+
--target final-vllm \
286+
--build-arg VLLM_VERSION=0.11.1 \
287+
-t docker/model-runner:vllm-0.11.1 .
288+
```
303289

304-
The vLLM wheels are sourced from the official vLLM wheel repository at `https://wheels.vllm.ai/{commit_sha}/vllm/`, which provides prebuilt wheels for every commit.
290+
The vLLM wheels are sourced from the official vLLM GitHub Releases at `https://github.com/vllm-project/vllm/releases`, which provides prebuilt wheels for each release version.
305291

306292
## API Examples
307293

0 commit comments

Comments
 (0)