-
Notifications
You must be signed in to change notification settings - Fork 327
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support CUDA Graph for MoE models #1233
base: main
Are you sure you want to change the base?
Commits on Oct 9, 2024
-
Align RNG tracker with megatron
Signed-off-by: Robin Zhang <[email protected]> Co-authored-by: Yifei Song <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a95c7e3 - Browse repository at this point
Copy the full SHA a95c7e3View commit details -
Fix module_params order and warmup bug in cudagraph
Signed-off-by: Robin Zhang <[email protected]> Co-authored-by: Yifei Song <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1c9e35b - Browse repository at this point
Copy the full SHA 1c9e35bView commit details -
Add fp8_group argument and fix fp8 accuracy issue for cudagraph
Signed-off-by: Robin Zhang <[email protected]> Co-authored-by: Yifei Song <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for cef94e4 - Browse repository at this point
Copy the full SHA cef94e4View commit details -
Add TE modules and weights filters to support MoE models
Signed-off-by: Robin Zhang <[email protected]> Co-authored-by: Yifei Song <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 34967b6 - Browse repository at this point
Copy the full SHA 34967b6View commit details
Commits on Oct 10, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 076414c - Browse repository at this point
Copy the full SHA 076414cView commit details
Commits on Oct 11, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 88b614c - Browse repository at this point
Copy the full SHA 88b614cView commit details -
Use hooks to filter module params
Signed-off-by: Robin Zhang <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 10abcde - Browse repository at this point
Copy the full SHA 10abcdeView commit details -
Filter all TE modules in hooks
Signed-off-by: Robin Zhang <[email protected]> Co-authored-by: Yifei Song <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f920486 - Browse repository at this point
Copy the full SHA f920486View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0c378e5 - Browse repository at this point
Copy the full SHA 0c378e5View commit details -
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Configuration menu - View commit details
-
Copy full SHA for d639cfe - Browse repository at this point
Copy the full SHA d639cfeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 13208f3 - Browse repository at this point
Copy the full SHA 13208f3View commit details
Commits on Nov 1, 2024
-
Signed-off-by: Robin Zhang <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f92d901 - Browse repository at this point
Copy the full SHA f92d901View commit details -
Configuration menu - View commit details
-
Copy full SHA for e6e1eeb - Browse repository at this point
Copy the full SHA e6e1eebView commit details