Awesome-LLM-Inference v0.6

DefTruth released this 14 Apr 06:28

· 173 commits to main since this release

What's Changed

Add an ICLR paper for KV cache compression by @Janghyun1230 in #8
Add github link for paper FP8-Quantization[2208.09225] by @Mr-Philo in #9

New Contributors

@Janghyun1230 made their first contribution in #8
@Mr-Philo made their first contribution in #9

Full Changelog: v0.5...v0.6

Contributors

Janghyun1230 and Mr-Philo

Assets 2