Revision d7760f582f7d7ef21383edce142c1e64e1535922 authored by Derek Gerstmann on 15 September 2023, 01:05:12 UTC, committed by GitHub on 15 September 2023, 01:05:12 UTC
* Add tutorial on JIT compile/execute performance

* Addressing comments from review. Fix punctuation and comment nits.
Add timing estimates as comments.
Add std::function example.
Enable advanced scheduling directives.

* Addressing comments from review.

Added cases that match real usage patterns:

1. Defining and compiling the whole pipeline every time you want to run it (i.e. in the benchmarking loop)
2. Defining the pipeline outside the benchmarking loop, and realizing it repeatedly.
3. (optional) Same as 2), but calling compile_jit() outside the loop, saying what it does, and saying why the time isn't actually different to case 2 (benchmark() runs multiple times and takes a min, and realize only compiiles on the first run)
4. Compiling to a callable outside the benchmarking loop and showing that it has lower overhead than case 3 (if indeed it does. If not we may need to change the example so that it does, e.g. by adding a real input buffer.)

* Addressing comments from review for style nits, and typos in comments.

---------

Co-authored-by: Derek Gerstmann <dgerstmann@adobe.com>
Co-authored-by: Steven Johnson <srj@google.com>
1 parent 8797287
History
File Mode Size
BundleStatic.cmake -rw-r--r-- 7.8 KB
CheckFilesExist.cmake -rw-r--r-- 455 bytes
FindHalide.cmake -rw-r--r-- 292 bytes
FindHalide_WebGPU.cmake -rw-r--r-- 744 bytes
FindHexagonSDK.cmake -rw-r--r-- 2.5 KB
HalideGeneratorHelpers.cmake -rw-r--r-- 32.5 KB
HalideTargetHelpers.cmake -rw-r--r-- 2.3 KB
HalideTestHelpers.cmake -rw-r--r-- 4.6 KB
TargetExportScript.cmake -rw-r--r-- 2.3 KB
WipeStandardFlags.cmake -rw-r--r-- 702 bytes
toolchain.linux-aarch64.cmake -rw-r--r-- 689 bytes
toolchain.linux-arm32.cmake -rw-r--r-- 1.3 KB
toolchain.linux-i386.cmake -rw-r--r-- 612 bytes
toolchain.linux-x64-asan.cmake -rw-r--r-- 2.0 KB
toolchain.linux-x64-fuzzer.cmake -rw-r--r-- 1.4 KB

back to top