https://github.com/halide/Halide
Revision 00211656fd208c5e6eb28f943dbbe8c65b45622f authored by Andrew Adams on 04 January 2022, 16:40:23 UTC, committed by GitHub on 04 January 2022, 16:40:23 UTC
* Make random 2x faster by putting the innermost var last

* Improve period of low bits of random noise

* Add new rewrite rules for quadratics

By pulling constant additions outside of quadratics, we can shave off a
few add instructions in the inner loop for random number generation,
which uses a quadratic modulo 2^32

I also removed the !overflows predicates, because rules already fail to
match if a fold overflows.

New rules formally verified.

* Make expensive_zero actually always zero
1 parent f11d820
History
Tip revision: 00211656fd208c5e6eb28f943dbbe8c65b45622f authored by Andrew Adams on 04 January 2022, 16:40:23 UTC
Make random faster by putting the innermost var last (#6504)
Tip revision: 0021165
File Mode Size
.github
apps
cmake
dependencies
doc
packaging
python_bindings
src
test
tools
tutorial
util
.clang-format -rw-r--r-- 1.4 KB
.clang-format-ignore -rw-r--r-- 265 bytes
.clang-tidy -rw-r--r-- 1.8 KB
.gitattributes -rw-r--r-- 342 bytes
.gitignore -rw-r--r-- 1.1 KB
.gitmodules -rw-r--r-- 0 bytes
CMakeLists.txt -rw-r--r-- 5.5 KB
CMakePresets.json -rw-r--r-- 5.2 KB
CODE_OF_CONDUCT.md -rw-r--r-- 3.5 KB
LICENSE.txt -rw-r--r-- 3.2 KB
Makefile -rw-r--r-- 101.8 KB
README.md -rw-r--r-- 16.4 KB
README_cmake.md -rw-r--r-- 69.2 KB
README_rungen.md -rw-r--r-- 12.1 KB
README_webassembly.md -rw-r--r-- 10.5 KB
run-clang-format.sh -rwxr-xr-x 1.4 KB
run-clang-tidy.sh -rwxr-xr-x 3.2 KB

README.md

back to top