Skip to content

Releases: lucidrains/gateloop-transformer

0.0.18

10 Nov 03:15

Choose a tag to compare

additional swish gate for gateloop module

0.0.16

10 Nov 02:13

Choose a tag to compare

state transition should act on per gate loop head

0.0.15

10 Nov 01:13

Choose a tag to compare

increase default frac gradient for state transition projection

0.0.14

10 Nov 01:08

Choose a tag to compare

add an assert and encourage researchers to play around with heads

0.0.12

09 Nov 20:13

Choose a tag to compare

fix a misunderstanding, thanks to main author @tobiaskatsch for the d…

0.0.11

09 Nov 18:06

Choose a tag to compare

able to ablate state transitions

0.0.10

09 Nov 17:31

Choose a tag to compare

need to see something before deciding whether to invest time in cuda …

0.0.8

09 Nov 16:26

Choose a tag to compare

allow for training full attention with rotary + data dependent xpos s…

0.0.7

09 Nov 15:12

Choose a tag to compare

misunderstood how activation functions were applied

0.0.6

09 Nov 02:25

Choose a tag to compare

0.0.6