https://github.com/halide/Halide
Revision ba478195bcb6c7ed52976ce2801ac186452f3473 authored by Andrew Adams on 03 May 2021, 23:51:26 UTC, committed by Andrew Adams on 03 May 2021, 23:51:26 UTC
This version lowers it without needing to widen, which is a large win on
x86 for 16 and 32-bit types (3.8x faster and 2.8x faster respectively).
It's a very slight slowdown for 8-bit because x86 doesn't have 8-bit
shift instructions.

Also drive-by typo fix.
1 parent 5a0d1e5
History
Tip revision: ba478195bcb6c7ed52976ce2801ac186452f3473 authored by Andrew Adams on 03 May 2021, 23:51:26 UTC
Non-widening lowering of rounding shifts
Tip revision: ba47819
File Mode Size
.github
apps
cmake
dependencies
doc
packaging
python_bindings
src
test
tools
tutorial
util
.clang-format -rw-r--r-- 1.4 KB
.clang-format-ignore -rw-r--r-- 265 bytes
.clang-tidy -rw-r--r-- 1.7 KB
.gitattributes -rw-r--r-- 342 bytes
.gitignore -rw-r--r-- 1.1 KB
.gitmodules -rw-r--r-- 0 bytes
CMakeLists.txt -rw-r--r-- 5.4 KB
CMakePresets.json -rw-r--r-- 2.2 KB
CODE_OF_CONDUCT.md -rw-r--r-- 3.5 KB
LICENSE.txt -rw-r--r-- 3.2 KB
Makefile -rw-r--r-- 99.9 KB
README.md -rw-r--r-- 15.1 KB
README_cmake.md -rw-r--r-- 69.0 KB
README_rungen.md -rw-r--r-- 12.1 KB
README_webassembly.md -rw-r--r-- 8.6 KB
run-clang-format.sh -rwxr-xr-x 1.4 KB
run-clang-tidy.sh -rwxr-xr-x 3.1 KB

README.md

back to top