Skip to content

Releases: PygmalionAI/aphrodite-engine

v0.4

03 Nov 12:53
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.3.7...v0.4

v0.3.7

24 Oct 08:23
Compare
Choose a tag to compare

What's Changed

  • fix: prompt processing overhead introduced by #66 by @AlpinDale in #71
  • fix: launch AWQ kernels on the current CUDAStream by @AlpinDale in #75
  • Added min_tokens and reimplemented ignore_eos using a new logit processor by @50h100a in #70
  • feat: add PagedAttention V2 kernels by @AlpinDale in #76
  • feat:Enable banning tokens by @StefanGliga in #80

Full Changelog: v0.3.6...v0.3.7

v0.3.6

13 Oct 07:22
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.3.5...v0.3.6

v0.3.5

09 Oct 09:56
2e70a6d
Compare
Choose a tag to compare

What's Changed

  • fix: add kcpp /generate/check stub by @g4rg in #47
  • fix: more KAI parameter adaptations by @g4rg in #45
  • Allow CORS connections from anywhere by @thesentinel2615 in #51
  • fix: attention kernel attribute by @AlpinDale in #52
  • feat: AWQ support for Turing GPUs by @AlpinDale in #53
  • Micromamba Runtime by @henk717 in #54
  • Make NVCC work for different versions by @official-elinas in #55
  • chore: allow the user to specify install method by @AlpinDale in #56

New Contributors

Full Changelog: v0.3.4...v0.3.5

v0.3.4

06 Oct 08:29
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.3.3...v0.3.4

v0.3.3

02 Oct 06:18
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.3.2...v0.3.3

v0.3.2

30 Sep 16:01
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.3.1...v0.3.2

v0.3.1

29 Sep 05:39
69a4c32
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.3...v0.3.1

v0.3

28 Sep 19:10
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: https://github.com/PygmalionAI/aphrodite-engine/commits/v0.3