Skip to content

b6975

Choose a tag to compare

@github-actions github-actions released this 07 Nov 21:13
16bcc12
kv-cache : pad the cache size to 256 for performance (#17046)

* kv-cache : pad the size of the small SWA cache for performance

* context : pad the total context to 256

* cont : future-proof the swa pad

* server : adjust test params to new logic