Skip to content

Latest commit

 

History

History
11 lines (9 loc) · 294 Bytes

File metadata and controls

11 lines (9 loc) · 294 Bytes

Changelog

[0.1.3] - 2025-06-06

Added

  • Configurable quantization and device offloading when loading local models.
  • Past-key-value caching in provider to reuse model states across tokens.

[0.1.2] - 2025-06-05

Added

  • Multitoken Generation
  • Structure to recursively add probabilities