Skip to content

Commit

Permalink
update readme_en
Browse files Browse the repository at this point in the history
  • Loading branch information
billlight committed Mar 29, 2024
1 parent 4432d97 commit bb460fc
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README_EN.md
Original file line number Diff line number Diff line change
Expand Up @@ -79,7 +79,7 @@ Notes:
## Model Quantization


XiaoMi SD adopts Post-Training Quantization (PTQ) to convert the Float32 model into an INT8 quantized model. PTQ process reduces the model size to approximately $/frac{1}{4}$ of its original size, making the model size about 1.1GB.
XiaoMi SD adopts Post-Training Quantization (PTQ) to convert the Float32 model into an INT8 quantized model. PTQ process reduces the model size to approximately $`\frac{1}{4}`$ of its original size, making the model size about 1.1GB.
The model is quantized by collecting high-quality calibration data and optimizing quantization methods:
* Calibration Data Collection: Collect high-quality prompt datasets from various domains as calibration datasets.
* Quantization Calibration Method: Use the AdaRound quantization algorithm and high-precision per-channel quantization methods to pick the optimal 8-bit fixed-point values for floating-point weight parameters.
Expand Down

0 comments on commit bb460fc

Please sign in to comment.