Skip to content

cuPyLMA Release 0.1

Latest

Choose a tag to compare

@wang-xianghao wang-xianghao released this 31 Jul 01:34
· 15 commits to release-0.1 since this release
63fe6e7

This is a beta release of cuPyLMA.

Changelog

  • First release.

Known issues and Further Work

  • The multi-GPU acceleration is restricted by kernel calls' overheads: we will explore CUDA graph to minimize the overheads.
  • The optimizer does not inherit torch.optim.Optimizer which brings extra work on migrating the existing code: we will reconstruct our optimizer to make it follow PyTorch optimizer's interface.