https://github.com/halide/Halide

sort by:
Revision Author Date Message Commit Date
e3606cc Fix GPU barrier deadlocks Partition loops shouldn't mess with serial loops containing thread barriers, potentially causing warp divergence and deadlock (seen in some obscure lens blur schedules). Also we were generating too many thread barriers in a branch where the base mutator class was accidentally always mutating something, so there's a change to FuseGPUThreadLoops to make it more bug-resistant. Without these additional barriers I have been unable to come up with a case where a barrier ends up somewhere that would deadlock, so no test. 13 August 2020, 18:41:13 UTC
98a116a Clean up is-jit-compiled checks in Pipeline (#5172) * Clean up is-jit-compiled checks in Pipeline Because WebAssembly is a special beast, the way it is 'jitted' is special, and the checks to avoid redundant jitting needed extra logic in compile_jit(). Unfortunately there was another place in Pipeline that also needed this special casing. This PR adds a `get_compiled_jit_target()` bottleneck to consolidate this. * defined() -> has_unknowns() 13 August 2020, 16:31:11 UTC
fa1abba Add infer_input_bounds(vector<int>) (#5174) * Add infer_input_bounds(vector<int>) Add a variant of infer_input_bounds() that takes an explicit vector of int, rather than the up-to-4-int version that is a holdover from the buffer_t days; deprecate the old version; convert all existing code to use the new one. Note that I'm using a new overload (with an initializer-list) as a way to subvert the mis-binding of `{}` and `{1}` to the deprecated function; this adds a trivial amount of overhead but (I think) allows us to ensure that converted code probably avoids the deprecated method. * Update Func.cpp 13 August 2020, 16:30:46 UTC
f6dcdde Merge pull request #5177 from halide/abadams/fix_stencil_chain_gpu_schedule Schedule last stage of stencil chain on GPU too 13 August 2020, 02:08:43 UTC
f7528c2 Merge pull request #5176 from halide/srj-hvx-codegen Remove unnecessary call to halide.hexagon.pack.vh in CG_HVX 12 August 2020, 21:06:03 UTC
d1592d1 Merge pull request #5155 from halide/abadams/add_missing_boundary_condition_overload Add missing overload for boundary conditions on a buffer 12 August 2020, 17:08:02 UTC
2e2649f Merge remote-tracking branch 'origin/master' into abadams/add_missing_boundary_condition_overload 11 August 2020, 19:54:08 UTC
b8ad19f Schedule last stage of stencil chain on GPU too 11 August 2020, 19:12:57 UTC
528b46b Remove unnecessary call to halide.hexagon.pack.vh in CG_HVX In the degenerate case of shuffle_vector() calling vlut() to shuffle a vector that is wider than 256 elements, the code was incorrectly using halide.hexagon.pack.vh on a vector-of-bool; this used to be necessary, but hasn't been for a while, so clearly this code path wasn't being exercised. Remove the halide.hexagon.pack.vh and added a test case to exercise that path. Also, drive-by removal of #include "EliminateBoolVectors.h" from CGHVX since it is no longer used there. 11 August 2020, 16:59:31 UTC
26b5be4 Update fft app to use new boundary condition syntax 09 August 2020, 16:05:40 UTC
52da814 Merge pull request #5165 from halide/abadams/rungen_set_host_dirty Make sure to set_host_dirty in rungen 09 August 2020, 00:46:32 UTC
aa92f5c Make sure to set_host_dirty in rungen Otherwise synthetic inputs like 'random' end up being 'zero' 08 August 2020, 18:36:47 UTC
41e10e0 Merge branch 'abadams/add_missing_boundary_condition_overload' of https://github.com/halide/Halide into abadams/add_missing_boundary_condition_overload 07 August 2020, 19:07:06 UTC
46ae5ff Fix boundary condition in blur app 07 August 2020, 19:06:57 UTC
49d0476 Merge pull request #5162 from halide/srj-wasm-shell-version Update WASM_SHELL_VERSION 07 August 2020, 04:31:25 UTC
640c324 Merge pull request #5161 from halide/srj-infer-input-bounds Add a Target to the args of infer_input_bounds() 07 August 2020, 04:31:10 UTC
4e2c25f Merge pull request #5163 from halide/srj-blur-fix Fix apps/blur on Hexagon 07 August 2020, 01:08:47 UTC
8127ba9 Merge branch 'master' into abadams/add_missing_boundary_condition_overload 06 August 2020, 23:22:07 UTC
32c6fdb Merge pull request #5158 from halide/abadams/fix_nl_means_estimates Fix incorrect estimates for nl_means autoscheduler 06 August 2020, 23:21:30 UTC
fabd9b7 Fix apps/blur on Hexagon Don't rely on HALIDE_RUNTIME_HEXAGON; examine the metadata to see how we were compiled 06 August 2020, 22:21:54 UTC
ada32bc Merge pull request #5157 from halide/rename-to-nvptx Rename WITH_PTX to WITH_NVPTX so all WITH_* macros match LLVM component names 05 August 2020, 23:28:16 UTC
8b8a9ea Update WASM_SHELL_VERSION 8.6.325 is the minimum version that fixes some wasm-simd bugs that affect us. 05 August 2020, 23:04:18 UTC
3355a33 Add a Target to the args of infer_input_bounds() Currently, if the code isn't already jitted (or is jitted with NoBoundsQuery enabled), infer_input_bounds() falls back to whatever is in HL_JIT_TARGET. This is suboptimal, because it hides the fact that infer_input_bounds() relies on the value of that env var. It can also cause unintentional incorrect re-jits; e.g. if simd_op_check is run for HL_TARGET=wasm-32-wasmrt (and HL_JIT_TARGET not set at all), the infer_input_bounds() call can end up re-jitting for native code instead of WebAssembly. 05 August 2020, 22:30:50 UTC
09518b4 Merge branch 'master' into rename-to-nvptx 05 August 2020, 17:22:16 UTC
e3a573b Update TARGET_PTX to TARGET_NVPTX internally. 05 August 2020, 17:21:43 UTC
2e7b5ac Merge pull request #5159 from halide/vksnk/ucon-no-asserts Fix 'unused var' warning for user_context generated by CodeGen_C with no_asserts flag 04 August 2020, 22:53:21 UTC
d65846b Fix 'unused var' warning for user_context generated by CodeGen_C with no_asserts flag 04 August 2020, 21:08:20 UTC
fe2e78e Fix incorrect estimates for nl_means autoscheduler We run it on the parrot image, which is 5 megapixels. 04 August 2020, 20:57:01 UTC
74829ea Make Makefile use WITH_NVPTX in interface. 04 August 2020, 20:18:38 UTC
89f9a92 Rename WITH_PTX to WITH_NVPTX so all WITH_* macros match LLVM component names 04 August 2020, 20:10:53 UTC
6da8008 clang-format 04 August 2020, 17:22:05 UTC
3433d49 Add missing overload for boundary conditions on a buffer The following syntax did not work due to a lack of appropriate overload: repeat_edge(buffer, {{0, W}, {0, H}}) The version that worked was: repeat_edge(buffer, 0, W, 0, H) but that syntax has fallen out of favor This PR adds an overload to make the former syntax work, and explicitly deprecates the latter. Also removed inline qualifiers on some templated functions, because they're already inline, and changed an arg to be by const ref instead of pointlessly by value. 04 August 2020, 17:07:13 UTC
682d73f Merge pull request #5153 from halide/srj-tidy2 Fix clang-tidy presubmit 03 August 2020, 21:28:07 UTC
34f105d Fix clang-tidy presubmit Need to run `sudo apt-get update` before `sudo apt-get install` to avoid possibly stale install info. 03 August 2020, 21:21:33 UTC
0d6d7e0 Merge pull request #5150 from halide/srj-roundingmode Remove RoundingMode.h 30 July 2020, 20:11:35 UTC
5a4d098 Remove RoundingMode.h It is never included anywhere. 30 July 2020, 17:00:06 UTC
d63a800 Merge pull request #5148 from halide/abadams/fix_autoschedule_feature_transposition Fix transposed variable names 30 July 2020, 16:36:57 UTC
7dc9790 Merge pull request #5149 from halide/vksnk/c-no-asserts Fix 'unused variable' warning in the code generated by CodeGen_C with 'no_asserts' feature 30 July 2020, 06:36:38 UTC
f5efe01 Updated test 30 July 2020, 00:53:32 UTC
b481633 Remove create_assertion(string, string) 29 July 2020, 22:38:27 UTC
dcc56b7 Fix 'unused variable' warning in the code generated by CodeGen_C with 'no_asserts' feature 29 July 2020, 21:33:43 UTC
0e361d4 Fix transposed variable names 29 July 2020, 18:19:45 UTC
83b452f Merge pull request #5097 from halide/srj-wabt WebAssembly Updates 29 July 2020, 17:24:31 UTC
419c2dd Merge branch 'master' into srj-wabt 28 July 2020, 22:12:36 UTC
cc281d9 Merge pull request #5142 from halide/vksnk/frees-are-free-after-all-2 Handle Allocate without matching Free in the CodeGen_C 28 July 2020, 21:42:59 UTC
aa3041f Merge branch 'master' into srj-wabt 28 July 2020, 17:21:13 UTC
3d7854d fix option name in dependent option 28 July 2020, 17:12:00 UTC
0bb6172 Merge pull request #5137 from halide/tests-env Set HL_{JIT_}TARGET from CMake Halide_TARGET variable. 28 July 2020, 17:07:47 UTC
9db0d79 Fix case when LLVM reports WebAssembly capability, but liblld is not installed 28 July 2020, 16:57:46 UTC
f923ef7 Merge pull request #5143 from halide/vksnk/update-alloc-doc Update comment description for the Allocate node 28 July 2020, 16:40:47 UTC
fae67fc Update comment 28 July 2020, 16:16:27 UTC
1cd8bde make format 28 July 2020, 15:00:51 UTC
43c8f8d Update comment description for the Allocate node 28 July 2020, 06:20:14 UTC
3c977d6 Handle Allocate without matching Free in the CodeGen_C 28 July 2020, 06:06:38 UTC
f3b2abd Merge pull request #5139 from halide/abadams/no_asserts Don't codegen asserts in blocks when noasserts is on 26 July 2020, 22:08:24 UTC
3d4f56a Don't codegen asserts in blocks when noasserts is on Fixes #5138 25 July 2020, 19:00:56 UTC
baa5e04 Set HL_{JIT_}TARGET from Halide_TARGET in tests. 24 July 2020, 19:14:13 UTC
30f2f3c Drop APPS_INTERNAL because it is confusing. 24 July 2020, 19:13:15 UTC
4232fe2 fixes 24 July 2020, 17:48:30 UTC
ef3724b Merge branch 'master' into srj-wabt 24 July 2020, 17:47:10 UTC
e9084ae Merge pull request #5124 from halide/srj-halide-target Rename the CMake var HL_TARGET -> Halide_TARGET, to disambiguate from the env var of the same name 24 July 2020, 17:33:29 UTC
a51673e Tweaks 23 July 2020, 23:57:15 UTC
e61d821 Update README_webassembly.md 23 July 2020, 23:52:42 UTC
18001d2 HALIDE_ -> Halide_ 23 July 2020, 23:50:51 UTC
bdaf9b8 Update CMakeLists.txt 23 July 2020, 21:58:07 UTC
33852a6 Update presubmit.yml 23 July 2020, 21:28:26 UTC
a617746 Update README_webassembly.md 23 July 2020, 21:26:56 UTC
02e89f7 Revamp WebAssembly Support 23 July 2020, 21:20:33 UTC
4978fdc Merge branch 'master' into srj-halide-target 23 July 2020, 21:15:37 UTC
4bd2b37 HL -> HALIDE_ in CMake files 23 July 2020, 21:15:27 UTC
d6a35d6 Merge pull request #5136 from halide/fix-doxygen Fix Doxygen warnings in runtime 23 July 2020, 00:04:46 UTC
c9eec41 Make Doxygen comments consistent (prefer ///, /** to //!, /*!) 23 July 2020, 00:00:30 UTC
b1015b3 Fix descriptions 23 July 2020, 00:00:29 UTC
995c3a4 Fix Doxygen comments 23 July 2020, 00:00:29 UTC
3a84528 Merge pull request #5131 from halide/srj-hvx-check Improve HVX codegen error reporting 22 July 2020, 17:23:44 UTC
8918446 Merge pull request #5133 from halide/alexreinking-patch-1 Get rid of stale Travis CI build info from README. 21 July 2020, 23:16:16 UTC
daf7aa7 Get rid of stale Travis CI build info from README. 21 July 2020, 22:38:25 UTC
54f854e Update CodeGen_Hexagon.cpp 21 July 2020, 18:44:09 UTC
0fb2489 Update CodeGen_Hexagon.cpp 21 July 2020, 18:41:56 UTC
26bfbb4 Update CodeGen_Hexagon.cpp 21 July 2020, 18:06:00 UTC
3237682 Update CodeGen_Hexagon.cpp 21 July 2020, 17:34:16 UTC
4fcae20 Improve HVX codegen error reporting If you try to compile HVX standalone code with HL_TARGET=hexagon-32-noos, you will die because necessary glue functions are defined in hvx_64 or hvx_128 but not 'baseline' hvx. Add an assertion check with a helpful error meesage to avoid just segfaulting deep inside LLVM. 20 July 2020, 23:47:49 UTC
41a756e Merge pull request #5129 from halide/srj-mkdir Add a couple of missing 'mkdir' usages in Makefile 20 July 2020, 21:55:21 UTC
96ca7f3 Merge branch 'master' into srj-mkdir 20 July 2020, 21:55:08 UTC
a30f220 Merge pull request #5128 from halide/srj-llvm Fix for trunk LLVM 20 July 2020, 21:54:54 UTC
937f797 Add a couple of missing 'mkdir' usages in Makefile 20 July 2020, 21:09:28 UTC
df33922 Fix for trunk LLVM PrintMachineCode has been removed in LLVM 12/trunk 20 July 2020, 20:25:13 UTC
554e1dd Merge pull request #5125 from halide/abadams/rungenmain_error Add an error message if you forget to compile RunGenMain with a registration file 20 July 2020, 18:10:59 UTC
eb99441 Merge pull request #5126 from halide/shoaibkamil/llvm_clone_tag Update README to suggest cloning a release of LLVM, not a branch 17 July 2020, 21:58:04 UTC
50c947b Update README to suggest cloning a release of LLVM, not a branch 17 July 2020, 20:48:39 UTC
43f94b3 Add an error message if you forget to compile RunGenMain with a registration file 17 July 2020, 20:38:21 UTC
c7393ad Merge pull request #5122 from halide/srj-clangfmt Upgrade clang-format to v10 16 July 2020, 17:45:20 UTC
33ecc3f Upgrade clang-format to v10 Upgrade the clang-format checks to clang-format-10, and reformat code accordingly. Also add a way tp specify the clang-format version for `make format`; it defaults to the version of Clang for the current LLVM, but since clang-format doesn't provide stable formatting across versions, this might be wrong. 16 July 2020, 17:44:49 UTC
fb4b536 Merge pull request #5117 from Infinoid/python-vector-tile Add vector tile methods to the Python bindings 16 July 2020, 17:35:34 UTC
3eac5c6 Rename the CMake var HL_TARGET -> HALIDE_TARGET, to disambiguate from the env var of the same name 15 July 2020, 22:25:13 UTC
0ddd4e1 Merge pull request #5123 from halide/srj-clangtidy Update clang-tidy checks to use clang-tidy-10 15 July 2020, 21:40:09 UTC
f678add Update clang-tidy checks to use clang-tidy-10 15 July 2020, 20:12:38 UTC
4bd8b09 Merge pull request #5120 from halide/srj-llvm12 Allow LLVM v12 15 July 2020, 18:16:06 UTC
45ac38a Update Makefile 15 July 2020, 18:07:32 UTC
8eda1fd Also update GHA scripts 15 July 2020, 18:07:08 UTC
back to top