Skip to content

Commit

Permalink
Update README_GAUDI.md
Browse files Browse the repository at this point in the history
  • Loading branch information
michalkuligowski authored Sep 17, 2024
1 parent 556c321 commit 1a9e112
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion README_GAUDI.md
Original file line number Diff line number Diff line change
Expand Up @@ -81,14 +81,15 @@ Supported Features
- Inference with [HPU
Graphs](https://docs.habana.ai/en/latest/PyTorch/Inference_on_PyTorch/Inference_Using_HPU_Graphs.html)
for accelerating low-batch latency and throughput
- INC quantization

Unsupported Features
====================

- Beam search
- LoRA adapters
- Attention with Linear Biases (ALiBi)
- Quantization (AWQ)
- AWQ quantization
- Prefill chunking (mixed-batch inferencing)

Supported Configurations
Expand Down

0 comments on commit 1a9e112

Please sign in to comment.