[low-bit optim] Add COAT optimizer #1190

gau-nernst · 2024-10-29T01:24:50Z

Paper: https://arxiv.org/abs/2410.19313
Code: https://github.com/NVlabs/COAT (not available yet)

Seems like we already have most of the building blocks. The only new logic is "dynamic range expansion"

We can start implementing it first, then wait for the official code release for numeric checks.

MirMustafaAli · 2024-11-02T06:53:52Z

I would like to work on this @gau-nernst!!

gau-nernst · 2024-11-02T09:29:10Z

@MirMustafaAli Go ahead and submit a PR 😄. Let me know if you face any problems.

MirMustafaAli · 2024-11-05T04:51:50Z

@gau-nernst Which section should i look for to implement "dynamic range expansion"?. According to my understanding of repo it must be in float8 folder as it's aimed at hopper architecture utilizing type float8. Any pointers, PR's and reference methods which i can follow would be very much helpful for me.

gau-nernst · 2024-11-05T06:01:17Z

You can park it under torchao/prototype/low_bit_optim. The float8/ folder is more for training stuff (fp8 matmul).

You can extend our current OptimStateFp8. See https://github.com/pytorch/ao/blob/000a49026459dd1dadf5ca34322d98e7b1680250/torchao/prototype/low_bit_optim/subclass_fp8.py. I think we only need to change the quantize_fp8 function.

Another option is to create a separate optimizer. See https://github.com/pytorch/ao/blob/000a49026459dd1dadf5ca34322d98e7b1680250/torchao/prototype/low_bit_optim/adam.py. You can wrap all of the logic in a functional way (see single_param_adam() in the link above), and add boilerplate code for the optimizer (i.e. init optim states, call the functional optim step with torch.compile...)

MirMustafaAli · 2024-11-05T06:07:32Z

Thanks!! Will work on your advice.

gau-nernst added good first issue Good for newcomers optimizer labels Oct 29, 2024

MirMustafaAli mentioned this issue Nov 6, 2024

[low-bit optim] Add coat for float8 optimizer #1231

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[low-bit optim] Add COAT optimizer #1190

[low-bit optim] Add COAT optimizer #1190

gau-nernst commented Oct 29, 2024 •

edited

Loading

MirMustafaAli commented Nov 2, 2024

gau-nernst commented Nov 2, 2024

MirMustafaAli commented Nov 5, 2024

gau-nernst commented Nov 5, 2024

MirMustafaAli commented Nov 5, 2024 •

edited

Loading

[low-bit optim] Add COAT optimizer #1190

[low-bit optim] Add COAT optimizer #1190

Comments

gau-nernst commented Oct 29, 2024 • edited Loading

MirMustafaAli commented Nov 2, 2024

gau-nernst commented Nov 2, 2024

MirMustafaAli commented Nov 5, 2024

gau-nernst commented Nov 5, 2024

MirMustafaAli commented Nov 5, 2024 • edited Loading

gau-nernst commented Oct 29, 2024 •

edited

Loading

MirMustafaAli commented Nov 5, 2024 •

edited

Loading