https://github.com/halide/Halide
Revision 6569a831f2ee7f1a2e3d809c36fb92514d710c44 authored by Andrew Adams on 11 September 2023, 17:40:29 UTC, committed by GitHub on 11 September 2023, 17:40:29 UTC
* Enable emission of float16/32 casts on x86

Fixes #7836
Fixes #4166

* Add support for zen4

* Add avx512_Zen4 target flag

It's a superset of cannon lake, and a subset of sapphire rapids

* Fix runtime detection, sapphire rapids CPUID bits

* Fix comment

* Don't catch bfloat casts

* Fix Zen4 model number

* Use llvm BFloat type for bfloat intrinsics

* Give up on native bfloat16 conversion for now

* Don't use llvm's bfloat type at all

* Add missing enum

* Fix constant in comment

* clang-format
1 parent b704abd
History
Tip revision: 6569a831f2ee7f1a2e3d809c36fb92514d710c44 authored by Andrew Adams on 11 September 2023, 17:40:29 UTC
Zen4 support (#7840)
Tip revision: 6569a83
File Mode Size
CMakeLists.txt -rw-r--r-- 2.6 KB
GenGen.cpp -rw-r--r-- 120 bytes
RunGen.h -rw-r--r-- 57.2 KB
RunGenMain.cpp -rw-r--r-- 22.9 KB
binary2cpp.cpp -rw-r--r-- 1.7 KB
build_halide_h.cpp -rw-r--r-- 2.6 KB
find_inverse.cpp -rw-r--r-- 11.8 KB
halide_benchmark.h -rw-r--r-- 9.4 KB
halide_config.make.tpl -rw-r--r-- 157 bytes
halide_image.h -rw-r--r-- 743 bytes
halide_image_info.h -rw-r--r-- 10.4 KB
halide_image_io.h -rw-r--r-- 76.0 KB
halide_malloc_trace.h -rw-r--r-- 3.3 KB
halide_thread_pool.h -rw-r--r-- 4.8 KB
halide_trace_config.h -rw-r--r-- 19.1 KB
launch_wasm_test.js -rw-r--r-- 936 bytes
makelib.sh -rwxr-xr-x 2.2 KB
regexp_replace.cpp -rw-r--r-- 860 bytes

back to top