Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Detect AMD Zen 5 microarchitecture #266

Merged
merged 1 commit into from
Nov 14, 2024
Merged

Conversation

fbarchard
Copy link
Contributor

Detect AMD Family 26 with any model as cpuinfo_uarch_zen5

@digantdesai
Copy link
Contributor

From @fbarchard

The reason for wanting zen5 is avx512 on AMD and Intel behave differently
AMD prefers short/wide tile sizes such as 6x64
Intel prefers tall/narrow tile sizes such as 13x32
Also GFNI is present on both, but GFNI is slow on AMD Zen4
And int32 multiply is very slow on Intel but fast on AMD.

@digantdesai
Copy link
Contributor

riscv failure seems unrelated.

@digantdesai digantdesai merged commit cebb093 into pytorch:main Nov 14, 2024
11 of 12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants