Remove unused FSDP components #2016

ebsmothers · 2024-11-16T20:04:37Z

I was attempting to patch the changes from #1933 so we can merge them, but I'm unable to push to the remote branch (likely because the PR was opened off of the fork's main branch which has not yet been merged with latest changes from upstream). So I've just patched everything into this new PR instead.

Tagging @krammnic as the author of the original PR. The only changes I've made were merging latest changes from main and updating test_validate_missing_and_unexpected_for_lora to get it to pass (and adding some of the test cases from validate_state_dict_for_lora that were deleted)

pytorch-bot · 2024-11-16T20:04:40Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/2016

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[DomainsOnly] Jobs fail with GLIBC version not found

✅ No Failures

As of commit f772fbd with merge base ac14e96 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

codecov-commenter · 2024-11-16T20:09:18Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 64.47%. Comparing base (f15ba77) to head (ba9c60e).
Report is 2 commits behind head on main.

Additional details and impacted files

@@             Coverage Diff             @@
##             main    #2016       +/-   ##
===========================================
+ Coverage   24.74%   64.47%   +39.72%     
===========================================
  Files         317      317               
  Lines       17669    17535      -134     
===========================================
+ Hits         4373    11305     +6932     
+ Misses      13296     6230     -7066

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

SalmanMohammadi · 2024-11-16T20:56:07Z

One small thing - is it worth deprecating the public APIs and deleting next release? I'm not sure if people are really using these so your call.

SalmanMohammadi · 2024-11-16T21:23:01Z

docs/source/api_ref_modules.rst

@@ -77,7 +77,6 @@ PEFT Components
    peft.set_trainable_params
    peft.get_adapter_state_dict
    peft.validate_missing_and_unexpected_for_lora
-    peft.validate_state_dict_for_lora


Line 209 in the LoRA finetune tutorial:

.. note:: Whenever loading weights with :code:`strict=False`, you should verify that any missing or extra keys in the loaded :code:`state_dict` are as expected. torchtune's LoRA recipes do this by default via e.g. :func:`validate_state_dict_for_lora() <torchtune.modules.peft.validate_state_dict_for_lora>` or :func:`validate_missing_and_unexpected_for_lora() <torchtune.modules.peft.validate_missing_and_unexpected_for_lora>`.

Needs to be updated

SalmanMohammadi · 2024-11-16T21:25:06Z

docs/source/api_ref_training.rst

    init_distributed
    is_distributed
    get_world_size_and_rank
-    get_full_finetune_fsdp_wrap_policy


There's a reference to this in the QAT tutorial too.

SalmanMohammadi · 2024-11-16T21:26:24Z

torchtune/training/__init__.py

    get_full_optimizer_state_dict,
    get_shard_conditions,
    get_world_size_and_rank,
    init_distributed,
    is_distributed,
    load_from_full_model_state_dict,
    load_from_full_optimizer_state_dict,
-    lora_fsdp_wrap_policy,
-    prepare_model_for_fsdp_with_meta_device,


There's a reference to this in a comment here

SalmanMohammadi · 2024-11-16T21:27:53Z

Some big ol nits due to some floating references - thanks for picking this up : )

ebsmothers · 2024-11-16T21:40:31Z

One small thing - is it worth deprecating the public APIs and deleting next release? I'm not sure if people are really using these so your call

Yeah this is a fair question. In this case I am gonna yolo a bit because we haven't supported FSDP1 for multiple releases at this point. And I think most of these APIs should not really have been public to begin with, but we made them so by necessity because many were used directly in our recipes

krammnic and others added 6 commits October 31, 2024 08:03

Remove unused FSDP components

420c9c4

fix docs

161fbd8

remove useless call and extra args

5edabd5

remove incorrect reference api_ref_modules.rst

ec6449f

merge

1c3b4ef

delete unused test, migrate some test cases to key validation unit test

ba9c60e

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 16, 2024

ebsmothers requested review from SalmanMohammadi and RdoubleA November 16, 2024 20:05

SalmanMohammadi approved these changes Nov 16, 2024

View reviewed changes

SalmanMohammadi reviewed Nov 16, 2024

View reviewed changes

address comments

f772fbd

ebsmothers merged commit 0c31907 into pytorch:main Nov 16, 2024
17 checks passed

This was referenced Nov 17, 2024

Remove unused FSDP1 components #1933

Closed

[Cleanup] Delete FSDP1 code #1897

Closed

ebsmothers mentioned this pull request Nov 26, 2024

v0.5.0 tracker #2008

Closed

44 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove unused FSDP components #2016

Remove unused FSDP components #2016

ebsmothers commented Nov 16, 2024

pytorch-bot bot commented Nov 16, 2024 •

edited

Loading

codecov-commenter commented Nov 16, 2024 •

edited

Loading

SalmanMohammadi commented Nov 16, 2024 •

edited

Loading

SalmanMohammadi Nov 16, 2024 •

edited

Loading

SalmanMohammadi Nov 16, 2024

SalmanMohammadi Nov 16, 2024

SalmanMohammadi commented Nov 16, 2024

ebsmothers commented Nov 16, 2024

Remove unused FSDP components #2016

Remove unused FSDP components #2016

Conversation

ebsmothers commented Nov 16, 2024

pytorch-bot bot commented Nov 16, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/2016

❗ 1 Active SEVs

✅ No Failures

codecov-commenter commented Nov 16, 2024 • edited Loading

Codecov Report

SalmanMohammadi commented Nov 16, 2024 • edited Loading

SalmanMohammadi Nov 16, 2024 • edited Loading

Choose a reason for hiding this comment

SalmanMohammadi Nov 16, 2024

Choose a reason for hiding this comment

SalmanMohammadi Nov 16, 2024

Choose a reason for hiding this comment

SalmanMohammadi commented Nov 16, 2024

ebsmothers commented Nov 16, 2024

pytorch-bot bot commented Nov 16, 2024 •

edited

Loading

codecov-commenter commented Nov 16, 2024 •

edited

Loading

SalmanMohammadi commented Nov 16, 2024 •

edited

Loading

SalmanMohammadi Nov 16, 2024 •

edited

Loading