Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

build(deps): set transformers below 4.46, waiting on fixes #384

Merged
merged 1 commit into from
Nov 1, 2024

Conversation

anhuong
Copy link
Collaborator

@anhuong anhuong commented Nov 1, 2024

Description of the change

The triton fast_kernels are broken in transformers v4.46 due to the changes and the way the patching is done in fms-acceleration described here. In addition, the gradient_accumulation is deteriorated in v4.46 which is fixed in transformers PR which has been merged but not released yet.

In addition, upgrading transformers broke unit tests and is being resolved/discussion in #383.

Thus we will set transformers below v4.46 while we wait for these fixes to go in.

Related issue number

How to verify the PR

Was the PR tested

  • I have added >=1 unit test(s) for every new method I have added.
  • I have ensured all unit tests pass

Copy link

github-actions bot commented Nov 1, 2024

Thanks for making a pull request! 😃
One of the maintainers will review and advise on the next steps.

@github-actions github-actions bot added the build label Nov 1, 2024
Copy link
Collaborator

@aluu317 aluu317 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@anhuong anhuong merged commit 0e664f2 into foundation-model-stack:main Nov 1, 2024
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants