Skip to content

Releases: lucidrains/MEGABYTE-pytorch

0.3.6

27 Dec 16:35

Choose a tag to compare

Full Changelog: 0.3.5...0.3.6

0.3.5

16 Sep 12:33

Choose a tag to compare

Full Changelog: 0.3.4...0.3.5

0.3.4

16 Sep 11:40

Choose a tag to compare

fix regression and some dimension conditional

0.3.3

09 Sep 23:08

Choose a tag to compare

What's Changed

  • chore: update flash attention config by @eegli in #18

New Contributors

  • @eegli made their first contribution in #18

Full Changelog: 0.3.2...0.3.3

0.3.2

07 Sep 12:42

Choose a tag to compare

Full Changelog: 0.3.1...0.3.2

0.3.1

07 Sep 12:09

Choose a tag to compare

Full Changelog: 0.3.0...0.3.1

0.3.0

03 May 02:13

Choose a tag to compare

Full Changelog: 0.2.1...0.3.0

0.2.1

15 Jun 20:12

Choose a tag to compare

make sure it supports greater than 2 hierarchies

0.2.0

15 Jun 19:38

Choose a tag to compare

move closer to what the paper did, with local and global token embedd…

0.1.7

15 Jun 18:13

Choose a tag to compare

switch to rotary embeddings, as they did in the paper