Skip to content

granitemoehybrid forward(): lots of logits upcast to float32, eating masive VRAM for minimal gain #24991

granitemoehybrid forward(): lots of logits upcast to float32, eating masive VRAM for minimal gain

granitemoehybrid forward(): lots of logits upcast to float32, eating masive VRAM for minimal gain #24991