Obtain quantized weights for entropy coding #5341

aegroto · 2023-02-08T16:53:49Z

aegroto
Feb 8, 2023

Hello everyone! I have just began to use NNI in my projects and it seems awesome to compress trained models out-of-the-box. However, after following tutorials on QAT I still struggle to find out how to save the quantized weights in a PyTorch state dict that uses less than 32-bits.

My purpose is to entropy code the quantized weight to obtain dimensionality reduction on the original model and I have been able to find that ModelSpeedup is supposed to do so for pruned models: https://nni.readthedocs.io/zh/latest/tutorials/pruning_speedup.html. However, I cannot find an analogous object for quantized weight tensors or any similar references in the documentation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Obtain quantized weights for entropy coding #5341

{{title}}

Replies: 0 comments

Select a reply

Obtain quantized weights for entropy coding #5341

aegroto Feb 8, 2023

Replies: 0 comments

aegroto
Feb 8, 2023