Skip to content

Commit 80a376f

Browse files
authored
ci: update CHANGELOG (#344)
Also reduce binary size but limit the maximum number of registers for `x_frag` and `o_frag` to 200.
1 parent 0d333ff commit 80a376f

File tree

2 files changed

+3
-1
lines changed

2 files changed

+3
-1
lines changed

CHANGELOG.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,8 @@
22

33
## [0.0.7](https://github.com/flashinfer-ai/flashinfer/compare/v0.0.6...v0.0.7) (2024-06-28)
44

5+
### Breaking Changes
6+
* `batch_decode_with_padded_kv_cache` was removed, we encourage user to use `BatchDecodeWithPagedKVCacheWrapper` instead. ([#343](https://github.com/flashinfer-ai/flashinfer/pull/343))
57

68
### Bugfix
79

include/flashinfer/attention/prefill.cuh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -53,7 +53,7 @@ constexpr bool is_invalid_configuration(uint32_t num_frags_x, uint32_t num_frags
5353
uint32_t num_warps_z) {
5454
return ((num_frags_y < 4) || (num_frags_y == 4 && num_frags_z % 2 == 1) ||
5555
(num_frags_y > 4 && num_frags_y % (2 * num_warps_x) != 0) ||
56-
(num_frags_x * (8 * num_frags_y + 2 * sizeof(DTypeQKAccum) * num_frags_z) >= 256));
56+
(num_frags_x * (8 * num_frags_y + 2 * sizeof(DTypeQKAccum) * num_frags_z) >= 200));
5757
}
5858

5959
/*!

0 commit comments

Comments
 (0)