GitHub - janhq/vllm: A high-throughput and memory-efficient inference and serving engine for LLMs

janhq / vllm Public

forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

docs.vllm.ai

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit

About

A high-throughput and memory-efficient inference and serving engine for LLMs

docs.vllm.ai

No releases published

No packages published