Unsupported Scalar Type 5? -- Portable/optimized ops don't consistently support half/bfloat16 #7748

bluejack · 2025-01-17T23:00:34Z

🐛 Describe the bug

After exporting a model to pte form and running it through executor_runner, I get:

E 00:00:02.220756 executorch:inputs_portable.cpp:45] Unsupported scalar type 5

I believe this is the "Half" type, or float16

Does that simply mean executor_runner does not support float16? Or does the whole framework not support float16?

Noted that when I run some investigation on the file using a python script, I get as far as sending it my float16 tensors, but it still fails to execute with a similar error:

[op_native_layer_norm.cpp:169] In function operator()(), assert failed (false): Unhandled dtype Half for native_layer_norm.out

I'm including the versions below, but note that this is using executorch built from head, rather than the last release. Should I expect the framework to support float16? And look to my own code for the error?

Versions

PyTorch version: 2.6.0.dev20250104
Is debug build: False
CUDA used to build PyTorch: None
ROCM used to build PyTorch: N/A

OS: macOS 14.6.1 (arm64)
GCC version: Could not collect
Clang version: 16.0.0 (clang-1600.0.26.4)
CMake version: version 3.31.4
Libc version: N/A

Python version: 3.12.7 | packaged by Anaconda, Inc. | (main, Oct 4 2024, 08:22:19) [Clang 14.0.6 ] (64-bit runtime)
Python platform: macOS-14.6.1-arm64-arm-64bit
Is CUDA available: False
CUDA runtime version: No CUDA
CUDA_MODULE_LOADING set to: N/A
GPU models and configuration: No CUDA
Nvidia driver version: No CUDA
cuDNN version: No CUDA
HIP runtime version: N/A
MIOpen runtime version: N/A
Is XNNPACK available: True

CPU:
Apple M3 Pro

Versions of relevant libraries:
[pip3] executorch==0.6.0a0+cd0e584
[pip3] flake8==7.0.0
[pip3] mypy==1.11.2
[pip3] mypy-extensions==1.0.0
[pip3] numpy==2.0.0
[pip3] numpydoc==1.7.0
[pip3] torch==2.6.0.dev20250104
[pip3] torchao==0.8.0+git2e032c6b
[pip3] torchaudio==2.6.0.dev20250104
[pip3] torchsr==1.0.4
[pip3] torchvision==0.22.0.dev20250104
[conda] executorch 0.6.0a0+cd0e584 pypi_0 pypi
[conda] numpy 2.0.0 pypi_0 pypi
[conda] numpydoc 1.7.0 py312hca03da5_0
[conda] torch 2.6.0.dev20250104 pypi_0 pypi
[conda] torchao 0.8.0+git2e032c6b pypi_0 pypi
[conda] torchaudio 2.6.0.dev20250104 pypi_0 pypi
[conda] torchsr 1.0.4 pypi_0 pypi
[conda] torchvision 0.22.0.dev20250104 pypi_0 pypi

The text was updated successfully, but these errors were encountered:

Partial fix for #7748. ghstack-source-id: 0c7e0a5712cba6829fdf5461ea50a8cc4afd39f0 ghstack-comment-id: 2599375147 Pull Request resolved: #7750

swolchok · 2025-01-17T23:37:46Z

Does that simply mean executor_runner does not support float16?

It looks like this particular function does not support float16. I've just sent #7750 to fix it.

Or does the whole framework not support float16?

We are capable of supporting it, but it looks like portable ops coverage is spotty. I'll send a fix for native_layer_norm and as many other places as I can find.

Partial fix for #7748. ghstack-source-id: 9f183dddcd87edb2493af0f97d7ad4e40d9be434 ghstack-comment-id: 2599398274 Pull Request resolved: #7758

Partial fix for #7748. ghstack-source-id: a72e5e33f005abc47cc1143f7b282f8050374955 ghstack-comment-id: 2599413770 Pull Request resolved: #7760

swolchok · 2025-01-18T00:42:12Z

By the way, if you're running on your Mac, you might want to enable the XNNPACK delegate when exporting; there's a good chance you will get both better performance and a workaround for the remaining instance of this issue I haven't got PRs out for yet (though I don't know whether XNNPACK has layer norm off the top of my head).

Partial fix for #7748. ghstack-source-id: 02bfc58615997b27f0ecb99f8efcf7fce0694b8c ghstack-comment-id: 2599413770 Pull Request resolved: #7760

Partial fix for #7748. ghstack-source-id: b7b33809ec99537c0f44c7abb5880c6502d30698 ghstack-comment-id: 2599481711 Pull Request resolved: #7767

Partial fix for #7748. ghstack-source-id: 02a1dc797b933f836efe17aa659b6a0c27ecc460 ghstack-comment-id: 2599483099 Pull Request resolved: #7769

bluejack · 2025-01-18T05:51:40Z

By the way, if you're running on your Mac, you might want to enable the XNNPACK delegate when exporting; there's a good chance you will get both better performance and a workaround for the remaining instance of this issue I haven't got PRs out for yet (though I don't know whether XNNPACK has layer norm off the top of my head).

Ok, I will look at this option, thanks for the tip.

swolchok mentioned this issue Jan 17, 2025

Support Half/BFloat16 in runner_util/inputs_portable #7750

Open

swolchok added a commit that referenced this issue Jan 17, 2025

Support Half/BFloat16 in runner_util/inputs_portable

d47abd4

Partial fix for #7748. ghstack-source-id: 0c7e0a5712cba6829fdf5461ea50a8cc4afd39f0 ghstack-comment-id: 2599375147 Pull Request resolved: #7750

This was referenced Jan 17, 2025

Support Half/BFloat16 in native_layer_norm #7752

Open

Support Half/BFloat16 in topk #7755

Open

Support Half/BFloat16 in split_with_sizes. #7758

Open

swolchok added a commit that referenced this issue Jan 18, 2025

Support Half/BFloat16 in split_with_sizes.

779eee1

Partial fix for #7748. ghstack-source-id: 9f183dddcd87edb2493af0f97d7ad4e40d9be434 ghstack-comment-id: 2599398274 Pull Request resolved: #7758

swolchok self-assigned this Jan 18, 2025

swolchok changed the title ~~Unsupported Scalar Type 5?~~ Unsupported Scalar Type 5? -- Portable/optimized ops don't consistently support half/bfloat16 Jan 18, 2025

swolchok mentioned this issue Jan 18, 2025

Support Half/BFloat16 in abs/neg #7760

Open

swolchok added a commit that referenced this issue Jan 18, 2025

Support Half/BFloat16 in abs/neg

ceacbee

Partial fix for #7748. ghstack-source-id: a72e5e33f005abc47cc1143f7b282f8050374955 ghstack-comment-id: 2599413770 Pull Request resolved: #7760

swolchok added a commit that referenced this issue Jan 18, 2025

Support Half/BFloat16 in abs/neg

98e83e0

Partial fix for #7748. ghstack-source-id: 02bfc58615997b27f0ecb99f8efcf7fce0694b8c ghstack-comment-id: 2599413770 Pull Request resolved: #7760

This was referenced Jan 18, 2025

Support Half/BFloat16 in op_allclose #7766

Open

Support Half/BFloat16 in amax/amin #7767

Open

swolchok added a commit that referenced this issue Jan 18, 2025

Support Half/BFloat16 in amax/amin

fbf28cd

Partial fix for #7748. ghstack-source-id: b7b33809ec99537c0f44c7abb5880c6502d30698 ghstack-comment-id: 2599481711 Pull Request resolved: #7767

swolchok mentioned this issue Jan 18, 2025

Support Half/BFloat16 in any #7769

Open

swolchok added a commit that referenced this issue Jan 18, 2025

Support Half/BFloat16 in any

48ac4ee

Partial fix for #7748. ghstack-source-id: 02a1dc797b933f836efe17aa659b6a0c27ecc460 ghstack-comment-id: 2599483099 Pull Request resolved: #7769

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unsupported Scalar Type 5? -- Portable/optimized ops don't consistently support half/bfloat16 #7748

Unsupported Scalar Type 5? -- Portable/optimized ops don't consistently support half/bfloat16 #7748

bluejack commented Jan 17, 2025

swolchok commented Jan 17, 2025

swolchok commented Jan 18, 2025 •

edited

Loading

bluejack commented Jan 18, 2025

Unsupported Scalar Type 5? -- Portable/optimized ops don't consistently support half/bfloat16 #7748

Unsupported Scalar Type 5? -- Portable/optimized ops don't consistently support half/bfloat16 #7748

Comments

bluejack commented Jan 17, 2025

🐛 Describe the bug

Versions

swolchok commented Jan 17, 2025

swolchok commented Jan 18, 2025 • edited Loading

bluejack commented Jan 18, 2025

swolchok commented Jan 18, 2025 •

edited

Loading