Skip to content

Awesome-LLM-Inference v0.6

Compare
Choose a tag to compare
@DefTruth DefTruth released this 14 Apr 06:28
· 173 commits to main since this release
25cbc41

What's Changed

  • Add an ICLR paper for KV cache compression by @Janghyun1230 in #8
  • Add github link for paper FP8-Quantization[2208.09225] by @Mr-Philo in #9

New Contributors

Full Changelog: v0.5...v0.6