Skip to content

Releases: google/gemma.cpp

v0.1.4

25 Mar 18:04

Choose a tag to compare

What's Changed

Full Changelog: v0.1.3...v0.1.4

v0.1.3

14 Mar 16:15

Choose a tag to compare

  • Support for PaliGemma 2 and Gemma 3.
  • Major update to MatMul and MatMul-using operations; significant performance increases in multiple parts of the codebase.
  • Codebase simplifications and refactors in many areas.
  • Bugfixes

What's Changed

Read more

v0.1.2

05 Apr 02:52

Choose a tag to compare

  • MQA implementation
  • Ops refactorings and optimizations
  • Bugfixes
  • Model exporting script (util/convert_weights.py)

Important Note: With the MQA implementation, older 2B model artifacts need to be updated. Please re-download weights from Kaggle and ensure you have the latest version (-mqa or version 3).

What's Changed

New Contributors

Full Changelog: v0.1.1...v0.1.2

v0.1.1

15 Mar 18:18

Choose a tag to compare

  • Refactor library interfaces
  • Fixes to enable android and windows builds + general improvements to builds
  • Bazel builds
  • CI automation
  • Allow either HF or Kaggle (vs Kaggle only) for artifact downloads
  • Many small fixes and quality-of-life improvements from initial 0.1.0 release

What's Changed

New Contributors

Full Changelog: v0.1.0...v0.1.1

v0.1.0

22 Feb 19:11

Choose a tag to compare

Initial release.