https://github.com/halide/Halide
Revision 1b22dfe6c1d95332430cf45bfe1488af7e7dffe3 authored by John Lawson on 05 February 2021, 20:06:34 UTC, committed by GitHub on 05 February 2021, 20:06:34 UTC
The vcvtne2ps2bf16 instruction combines two f32x16 vectors and converts
them to one bf16x32 vector. We can use this to support converting a
f32x32 vector to bf16x32 vector by splitting the input vector into two.
1 parent 8ee7f4c
History
Tip revision: 1b22dfe6c1d95332430cf45bfe1488af7e7dffe3 authored by John Lawson on 05 February 2021, 20:06:34 UTC
Add support for AVX512 f32x32 to bf16x32 conversion (#5711)
Tip revision: 1b22dfe
File Mode Size
.github
apps
cmake
dependencies
doc
packaging
python_bindings
src
test
tools
tutorial
util
.clang-format -rw-r--r-- 1.4 KB
.clang-format-ignore -rw-r--r-- 265 bytes
.clang-tidy -rw-r--r-- 1.8 KB
.gitattributes -rw-r--r-- 342 bytes
.gitignore -rw-r--r-- 1.1 KB
.gitmodules -rw-r--r-- 0 bytes
CMakeLists.txt -rw-r--r-- 4.2 KB
CMakePresets.json -rw-r--r-- 2.4 KB
CODE_OF_CONDUCT.md -rw-r--r-- 3.5 KB
LICENSE.txt -rw-r--r-- 3.2 KB
Makefile -rw-r--r-- 100.0 KB
README.md -rw-r--r-- 14.8 KB
README_cmake.md -rw-r--r-- 69.0 KB
README_rungen.md -rw-r--r-- 12.1 KB
README_webassembly.md -rw-r--r-- 7.5 KB
run-clang-format.sh -rwxr-xr-x 1.4 KB
run-clang-tidy.sh -rwxr-xr-x 3.0 KB

README.md

back to top