diff --git a/README.md b/README.md index 9679b9f..88682a5 100644 --- a/README.md +++ b/README.md @@ -18,7 +18,7 @@ Awesome-LLM-Inference: A small collection for Awesome LLM Inference **[Papers|Bl - [Awesome-LLM-Inference-v0.3.pdf](https://github.com/DefTruth/Awesome-LLM-Inference/releases/download/v0.3/Awesome-LLM-Inference-v0.3.zip): LLMs inference papers only, 500 pages PDF, contains ByteTransformer, FastServe, FlashAttention 1/2, FlexGen, FP8, LLM.int8(), Tensor Cores, PagedAttention, RoPE, SmoothQuant, SpecInfer, WINT8/4, Continuous Batching, ZeroQuant and more!