Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix model OOM issue in llama-405 and mixtral - 2nd attempt #644

Open
wants to merge 3 commits into
base: habana_main
Choose a base branch
from

Conversation

afierka-intel
Copy link

Another approach to fix the OOM issue in loading model. This time instead change specific models code, I updated model weights iterator. Hope this fix will easier to upstream.

@afierka-intel afierka-intel added the habana Issues or PRs submitted by Habana Labs label Dec 18, 2024
@afierka-intel afierka-intel self-assigned this Dec 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
habana Issues or PRs submitted by Habana Labs
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant