e3606cc | Andrew Adams | 13 August 2020, 17:37:30 UTC | Fix GPU barrier deadlocks Partition loops shouldn't mess with serial loops containing thread barriers, potentially causing warp divergence and deadlock (seen in some obscure lens blur schedules). Also we were generating too many thread barriers in a branch where the base mutator class was accidentally always mutating something, so there's a change to FuseGPUThreadLoops to make it more bug-resistant. Without these additional barriers I have been unable to come up with a case where a barrier ends up somewhere that would deadlock, so no test. | 13 August 2020, 18:41:13 UTC |
98a116a | Steven Johnson | 13 August 2020, 16:31:11 UTC | Clean up is-jit-compiled checks in Pipeline (#5172) * Clean up is-jit-compiled checks in Pipeline Because WebAssembly is a special beast, the way it is 'jitted' is special, and the checks to avoid redundant jitting needed extra logic in compile_jit(). Unfortunately there was another place in Pipeline that also needed this special casing. This PR adds a `get_compiled_jit_target()` bottleneck to consolidate this. * defined() -> has_unknowns() | 13 August 2020, 16:31:11 UTC |
fa1abba | Steven Johnson | 13 August 2020, 16:30:46 UTC | Add infer_input_bounds(vector<int>) (#5174) * Add infer_input_bounds(vector<int>) Add a variant of infer_input_bounds() that takes an explicit vector of int, rather than the up-to-4-int version that is a holdover from the buffer_t days; deprecate the old version; convert all existing code to use the new one. Note that I'm using a new overload (with an initializer-list) as a way to subvert the mis-binding of `{}` and `{1}` to the deprecated function; this adds a trivial amount of overhead but (I think) allows us to ensure that converted code probably avoids the deprecated method. * Update Func.cpp | 13 August 2020, 16:30:46 UTC |
f6dcdde | Andrew Adams | 13 August 2020, 02:08:43 UTC | Merge pull request #5177 from halide/abadams/fix_stencil_chain_gpu_schedule Schedule last stage of stencil chain on GPU too | 13 August 2020, 02:08:43 UTC |
f7528c2 | Steven Johnson | 12 August 2020, 21:06:03 UTC | Merge pull request #5176 from halide/srj-hvx-codegen Remove unnecessary call to halide.hexagon.pack.vh in CG_HVX | 12 August 2020, 21:06:03 UTC |
d1592d1 | Steven Johnson | 12 August 2020, 17:08:02 UTC | Merge pull request #5155 from halide/abadams/add_missing_boundary_condition_overload Add missing overload for boundary conditions on a buffer | 12 August 2020, 17:08:02 UTC |
2e2649f | Andrew Adams | 11 August 2020, 19:54:08 UTC | Merge remote-tracking branch 'origin/master' into abadams/add_missing_boundary_condition_overload | 11 August 2020, 19:54:08 UTC |
b8ad19f | Andrew Adams | 11 August 2020, 19:12:57 UTC | Schedule last stage of stencil chain on GPU too | 11 August 2020, 19:12:57 UTC |
528b46b | Steven Johnson | 11 August 2020, 01:15:17 UTC | Remove unnecessary call to halide.hexagon.pack.vh in CG_HVX In the degenerate case of shuffle_vector() calling vlut() to shuffle a vector that is wider than 256 elements, the code was incorrectly using halide.hexagon.pack.vh on a vector-of-bool; this used to be necessary, but hasn't been for a while, so clearly this code path wasn't being exercised. Remove the halide.hexagon.pack.vh and added a test case to exercise that path. Also, drive-by removal of #include "EliminateBoolVectors.h" from CGHVX since it is no longer used there. | 11 August 2020, 16:59:31 UTC |
26b5be4 | Andrew Adams | 09 August 2020, 16:05:40 UTC | Update fft app to use new boundary condition syntax | 09 August 2020, 16:05:40 UTC |
52da814 | Andrew Adams | 09 August 2020, 00:46:32 UTC | Merge pull request #5165 from halide/abadams/rungen_set_host_dirty Make sure to set_host_dirty in rungen | 09 August 2020, 00:46:32 UTC |
aa92f5c | Andrew Adams | 08 August 2020, 18:36:47 UTC | Make sure to set_host_dirty in rungen Otherwise synthetic inputs like 'random' end up being 'zero' | 08 August 2020, 18:36:47 UTC |
41e10e0 | Andrew Adams | 07 August 2020, 19:07:06 UTC | Merge branch 'abadams/add_missing_boundary_condition_overload' of https://github.com/halide/Halide into abadams/add_missing_boundary_condition_overload | 07 August 2020, 19:07:06 UTC |
46ae5ff | Andrew Adams | 07 August 2020, 19:06:57 UTC | Fix boundary condition in blur app | 07 August 2020, 19:06:57 UTC |
49d0476 | Steven Johnson | 07 August 2020, 04:31:25 UTC | Merge pull request #5162 from halide/srj-wasm-shell-version Update WASM_SHELL_VERSION | 07 August 2020, 04:31:25 UTC |
640c324 | Steven Johnson | 07 August 2020, 04:31:10 UTC | Merge pull request #5161 from halide/srj-infer-input-bounds Add a Target to the args of infer_input_bounds() | 07 August 2020, 04:31:10 UTC |
4e2c25f | Alex Reinking | 07 August 2020, 01:08:47 UTC | Merge pull request #5163 from halide/srj-blur-fix Fix apps/blur on Hexagon | 07 August 2020, 01:08:47 UTC |
8127ba9 | Steven Johnson | 06 August 2020, 23:22:07 UTC | Merge branch 'master' into abadams/add_missing_boundary_condition_overload | 06 August 2020, 23:22:07 UTC |
32c6fdb | Steven Johnson | 06 August 2020, 23:21:30 UTC | Merge pull request #5158 from halide/abadams/fix_nl_means_estimates Fix incorrect estimates for nl_means autoscheduler | 06 August 2020, 23:21:30 UTC |
fabd9b7 | Steven Johnson | 06 August 2020, 22:21:54 UTC | Fix apps/blur on Hexagon Don't rely on HALIDE_RUNTIME_HEXAGON; examine the metadata to see how we were compiled | 06 August 2020, 22:21:54 UTC |
ada32bc | Alex Reinking | 05 August 2020, 23:28:16 UTC | Merge pull request #5157 from halide/rename-to-nvptx Rename WITH_PTX to WITH_NVPTX so all WITH_* macros match LLVM component names | 05 August 2020, 23:28:16 UTC |
8b8a9ea | Steven Johnson | 05 August 2020, 23:04:18 UTC | Update WASM_SHELL_VERSION 8.6.325 is the minimum version that fixes some wasm-simd bugs that affect us. | 05 August 2020, 23:04:18 UTC |
3355a33 | Steven Johnson | 05 August 2020, 22:30:50 UTC | Add a Target to the args of infer_input_bounds() Currently, if the code isn't already jitted (or is jitted with NoBoundsQuery enabled), infer_input_bounds() falls back to whatever is in HL_JIT_TARGET. This is suboptimal, because it hides the fact that infer_input_bounds() relies on the value of that env var. It can also cause unintentional incorrect re-jits; e.g. if simd_op_check is run for HL_TARGET=wasm-32-wasmrt (and HL_JIT_TARGET not set at all), the infer_input_bounds() call can end up re-jitting for native code instead of WebAssembly. | 05 August 2020, 22:30:50 UTC |
09518b4 | Alex Reinking | 05 August 2020, 17:22:16 UTC | Merge branch 'master' into rename-to-nvptx | 05 August 2020, 17:22:16 UTC |
e3a573b | Alex Reinking | 05 August 2020, 17:21:43 UTC | Update TARGET_PTX to TARGET_NVPTX internally. | 05 August 2020, 17:21:43 UTC |
2e7b5ac | Volodymyr Kysenko | 04 August 2020, 22:53:21 UTC | Merge pull request #5159 from halide/vksnk/ucon-no-asserts Fix 'unused var' warning for user_context generated by CodeGen_C with no_asserts flag | 04 August 2020, 22:53:21 UTC |
d65846b | Volodymyr Kysenko | 04 August 2020, 21:07:19 UTC | Fix 'unused var' warning for user_context generated by CodeGen_C with no_asserts flag | 04 August 2020, 21:08:20 UTC |
fe2e78e | Andrew Adams | 04 August 2020, 20:57:01 UTC | Fix incorrect estimates for nl_means autoscheduler We run it on the parrot image, which is 5 megapixels. | 04 August 2020, 20:57:01 UTC |
74829ea | Alex Reinking | 04 August 2020, 20:18:38 UTC | Make Makefile use WITH_NVPTX in interface. | 04 August 2020, 20:18:38 UTC |
89f9a92 | Alex Reinking | 04 August 2020, 20:10:53 UTC | Rename WITH_PTX to WITH_NVPTX so all WITH_* macros match LLVM component names | 04 August 2020, 20:10:53 UTC |
6da8008 | Andrew Adams | 04 August 2020, 17:22:05 UTC | clang-format | 04 August 2020, 17:22:05 UTC |
3433d49 | Andrew Adams | 04 August 2020, 17:07:13 UTC | Add missing overload for boundary conditions on a buffer The following syntax did not work due to a lack of appropriate overload: repeat_edge(buffer, {{0, W}, {0, H}}) The version that worked was: repeat_edge(buffer, 0, W, 0, H) but that syntax has fallen out of favor This PR adds an overload to make the former syntax work, and explicitly deprecates the latter. Also removed inline qualifiers on some templated functions, because they're already inline, and changed an arg to be by const ref instead of pointlessly by value. | 04 August 2020, 17:07:13 UTC |
682d73f | Steven Johnson | 03 August 2020, 21:28:07 UTC | Merge pull request #5153 from halide/srj-tidy2 Fix clang-tidy presubmit | 03 August 2020, 21:28:07 UTC |
34f105d | Steven Johnson | 03 August 2020, 21:21:33 UTC | Fix clang-tidy presubmit Need to run `sudo apt-get update` before `sudo apt-get install` to avoid possibly stale install info. | 03 August 2020, 21:21:33 UTC |
0d6d7e0 | Steven Johnson | 30 July 2020, 20:11:35 UTC | Merge pull request #5150 from halide/srj-roundingmode Remove RoundingMode.h | 30 July 2020, 20:11:35 UTC |
5a4d098 | Steven Johnson | 30 July 2020, 17:00:06 UTC | Remove RoundingMode.h It is never included anywhere. | 30 July 2020, 17:00:06 UTC |
d63a800 | Andrew Adams | 30 July 2020, 16:36:57 UTC | Merge pull request #5148 from halide/abadams/fix_autoschedule_feature_transposition Fix transposed variable names | 30 July 2020, 16:36:57 UTC |
7dc9790 | Volodymyr Kysenko | 30 July 2020, 06:36:38 UTC | Merge pull request #5149 from halide/vksnk/c-no-asserts Fix 'unused variable' warning in the code generated by CodeGen_C with 'no_asserts' feature | 30 July 2020, 06:36:38 UTC |
f5efe01 | Volodymyr Kysenko | 30 July 2020, 00:53:32 UTC | Updated test | 30 July 2020, 00:53:32 UTC |
b481633 | Volodymyr Kysenko | 29 July 2020, 22:38:27 UTC | Remove create_assertion(string, string) | 29 July 2020, 22:38:27 UTC |
dcc56b7 | Volodymyr Kysenko | 29 July 2020, 21:29:43 UTC | Fix 'unused variable' warning in the code generated by CodeGen_C with 'no_asserts' feature | 29 July 2020, 21:33:43 UTC |
0e361d4 | Andrew Adams | 29 July 2020, 18:19:45 UTC | Fix transposed variable names | 29 July 2020, 18:19:45 UTC |
83b452f | Steven Johnson | 29 July 2020, 17:24:31 UTC | Merge pull request #5097 from halide/srj-wabt WebAssembly Updates | 29 July 2020, 17:24:31 UTC |
419c2dd | Steven Johnson | 28 July 2020, 22:12:36 UTC | Merge branch 'master' into srj-wabt | 28 July 2020, 22:12:36 UTC |
cc281d9 | Volodymyr Kysenko | 28 July 2020, 21:42:59 UTC | Merge pull request #5142 from halide/vksnk/frees-are-free-after-all-2 Handle Allocate without matching Free in the CodeGen_C | 28 July 2020, 21:42:59 UTC |
aa3041f | Alex Reinking | 28 July 2020, 17:21:13 UTC | Merge branch 'master' into srj-wabt | 28 July 2020, 17:21:13 UTC |
3d7854d | Alex Reinking | 28 July 2020, 17:12:00 UTC | fix option name in dependent option | 28 July 2020, 17:12:00 UTC |
0bb6172 | Steven Johnson | 28 July 2020, 17:07:47 UTC | Merge pull request #5137 from halide/tests-env Set HL_{JIT_}TARGET from CMake Halide_TARGET variable. | 28 July 2020, 17:07:47 UTC |
9db0d79 | Alex Reinking | 28 July 2020, 16:57:46 UTC | Fix case when LLVM reports WebAssembly capability, but liblld is not installed | 28 July 2020, 16:57:46 UTC |
f923ef7 | Steven Johnson | 28 July 2020, 16:40:47 UTC | Merge pull request #5143 from halide/vksnk/update-alloc-doc Update comment description for the Allocate node | 28 July 2020, 16:40:47 UTC |
fae67fc | Volodymyr Kysenko | 28 July 2020, 16:16:27 UTC | Update comment | 28 July 2020, 16:16:27 UTC |
1cd8bde | Volodymyr Kysenko | 28 July 2020, 15:00:51 UTC | make format | 28 July 2020, 15:00:51 UTC |
43c8f8d | Volodymyr Kysenko | 28 July 2020, 06:20:14 UTC | Update comment description for the Allocate node | 28 July 2020, 06:20:14 UTC |
3c977d6 | Volodymyr Kysenko | 28 July 2020, 06:02:53 UTC | Handle Allocate without matching Free in the CodeGen_C | 28 July 2020, 06:06:38 UTC |
f3b2abd | Andrew Adams | 26 July 2020, 22:08:24 UTC | Merge pull request #5139 from halide/abadams/no_asserts Don't codegen asserts in blocks when noasserts is on | 26 July 2020, 22:08:24 UTC |
3d4f56a | Andrew Adams | 25 July 2020, 19:00:56 UTC | Don't codegen asserts in blocks when noasserts is on Fixes #5138 | 25 July 2020, 19:00:56 UTC |
baa5e04 | Alex Reinking | 24 July 2020, 19:14:13 UTC | Set HL_{JIT_}TARGET from Halide_TARGET in tests. | 24 July 2020, 19:14:13 UTC |
30f2f3c | Alex Reinking | 24 July 2020, 19:13:15 UTC | Drop APPS_INTERNAL because it is confusing. | 24 July 2020, 19:13:15 UTC |
4232fe2 | Steven Johnson | 24 July 2020, 17:48:30 UTC | fixes | 24 July 2020, 17:48:30 UTC |
ef3724b | Steven Johnson | 24 July 2020, 17:47:10 UTC | Merge branch 'master' into srj-wabt | 24 July 2020, 17:47:10 UTC |
e9084ae | Steven Johnson | 24 July 2020, 17:33:29 UTC | Merge pull request #5124 from halide/srj-halide-target Rename the CMake var HL_TARGET -> Halide_TARGET, to disambiguate from the env var of the same name | 24 July 2020, 17:33:29 UTC |
a51673e | Steven Johnson | 23 July 2020, 23:57:15 UTC | Tweaks | 23 July 2020, 23:57:15 UTC |
e61d821 | Steven Johnson | 23 July 2020, 23:52:42 UTC | Update README_webassembly.md | 23 July 2020, 23:52:42 UTC |
18001d2 | Steven Johnson | 23 July 2020, 23:50:51 UTC | HALIDE_ -> Halide_ | 23 July 2020, 23:50:51 UTC |
bdaf9b8 | Steven Johnson | 23 July 2020, 21:58:07 UTC | Update CMakeLists.txt | 23 July 2020, 21:58:07 UTC |
33852a6 | Steven Johnson | 23 July 2020, 21:28:26 UTC | Update presubmit.yml | 23 July 2020, 21:28:26 UTC |
a617746 | Steven Johnson | 23 July 2020, 21:26:56 UTC | Update README_webassembly.md | 23 July 2020, 21:26:56 UTC |
02e89f7 | Steven Johnson | 20 July 2020, 22:35:21 UTC | Revamp WebAssembly Support | 23 July 2020, 21:20:33 UTC |
4978fdc | Steven Johnson | 23 July 2020, 21:15:37 UTC | Merge branch 'master' into srj-halide-target | 23 July 2020, 21:15:37 UTC |
4bd2b37 | Steven Johnson | 23 July 2020, 21:15:27 UTC | HL -> HALIDE_ in CMake files | 23 July 2020, 21:15:27 UTC |
d6a35d6 | Alex Reinking | 23 July 2020, 00:04:46 UTC | Merge pull request #5136 from halide/fix-doxygen Fix Doxygen warnings in runtime | 23 July 2020, 00:04:46 UTC |
c9eec41 | Alex Reinking | 22 July 2020, 20:32:10 UTC | Make Doxygen comments consistent (prefer ///, /** to //!, /*!) | 23 July 2020, 00:00:30 UTC |
b1015b3 | Alex Reinking | 22 July 2020, 20:28:42 UTC | Fix descriptions | 23 July 2020, 00:00:29 UTC |
995c3a4 | Alex Reinking | 22 July 2020, 20:17:48 UTC | Fix Doxygen comments | 23 July 2020, 00:00:29 UTC |
3a84528 | Steven Johnson | 22 July 2020, 17:23:44 UTC | Merge pull request #5131 from halide/srj-hvx-check Improve HVX codegen error reporting | 22 July 2020, 17:23:44 UTC |
8918446 | Steven Johnson | 21 July 2020, 23:16:16 UTC | Merge pull request #5133 from halide/alexreinking-patch-1 Get rid of stale Travis CI build info from README. | 21 July 2020, 23:16:16 UTC |
daf7aa7 | Alex Reinking | 21 July 2020, 22:38:25 UTC | Get rid of stale Travis CI build info from README. | 21 July 2020, 22:38:25 UTC |
54f854e | Steven Johnson | 21 July 2020, 18:44:09 UTC | Update CodeGen_Hexagon.cpp | 21 July 2020, 18:44:09 UTC |
0fb2489 | Steven Johnson | 21 July 2020, 18:41:56 UTC | Update CodeGen_Hexagon.cpp | 21 July 2020, 18:41:56 UTC |
26bfbb4 | Steven Johnson | 21 July 2020, 18:06:00 UTC | Update CodeGen_Hexagon.cpp | 21 July 2020, 18:06:00 UTC |
3237682 | Steven Johnson | 21 July 2020, 17:34:16 UTC | Update CodeGen_Hexagon.cpp | 21 July 2020, 17:34:16 UTC |
4fcae20 | Steven Johnson | 20 July 2020, 23:47:49 UTC | Improve HVX codegen error reporting If you try to compile HVX standalone code with HL_TARGET=hexagon-32-noos, you will die because necessary glue functions are defined in hvx_64 or hvx_128 but not 'baseline' hvx. Add an assertion check with a helpful error meesage to avoid just segfaulting deep inside LLVM. | 20 July 2020, 23:47:49 UTC |
41a756e | Steven Johnson | 20 July 2020, 21:55:21 UTC | Merge pull request #5129 from halide/srj-mkdir Add a couple of missing 'mkdir' usages in Makefile | 20 July 2020, 21:55:21 UTC |
96ca7f3 | Steven Johnson | 20 July 2020, 21:55:08 UTC | Merge branch 'master' into srj-mkdir | 20 July 2020, 21:55:08 UTC |
a30f220 | Steven Johnson | 20 July 2020, 21:54:54 UTC | Merge pull request #5128 from halide/srj-llvm Fix for trunk LLVM | 20 July 2020, 21:54:54 UTC |
937f797 | Steven Johnson | 20 July 2020, 21:09:28 UTC | Add a couple of missing 'mkdir' usages in Makefile | 20 July 2020, 21:09:28 UTC |
df33922 | Steven Johnson | 20 July 2020, 20:25:13 UTC | Fix for trunk LLVM PrintMachineCode has been removed in LLVM 12/trunk | 20 July 2020, 20:25:13 UTC |
554e1dd | Andrew Adams | 20 July 2020, 18:10:59 UTC | Merge pull request #5125 from halide/abadams/rungenmain_error Add an error message if you forget to compile RunGenMain with a registration file | 20 July 2020, 18:10:59 UTC |
eb99441 | Alex Reinking | 17 July 2020, 21:58:04 UTC | Merge pull request #5126 from halide/shoaibkamil/llvm_clone_tag Update README to suggest cloning a release of LLVM, not a branch | 17 July 2020, 21:58:04 UTC |
50c947b | Shoaib Kamil | 17 July 2020, 20:48:39 UTC | Update README to suggest cloning a release of LLVM, not a branch | 17 July 2020, 20:48:39 UTC |
43f94b3 | Andrew Adams | 17 July 2020, 20:38:21 UTC | Add an error message if you forget to compile RunGenMain with a registration file | 17 July 2020, 20:38:21 UTC |
c7393ad | Steven Johnson | 16 July 2020, 17:45:20 UTC | Merge pull request #5122 from halide/srj-clangfmt Upgrade clang-format to v10 | 16 July 2020, 17:45:20 UTC |
33ecc3f | Steven Johnson | 16 July 2020, 17:29:14 UTC | Upgrade clang-format to v10 Upgrade the clang-format checks to clang-format-10, and reformat code accordingly. Also add a way tp specify the clang-format version for `make format`; it defaults to the version of Clang for the current LLVM, but since clang-format doesn't provide stable formatting across versions, this might be wrong. | 16 July 2020, 17:44:49 UTC |
fb4b536 | Steven Johnson | 16 July 2020, 17:35:34 UTC | Merge pull request #5117 from Infinoid/python-vector-tile Add vector tile methods to the Python bindings | 16 July 2020, 17:35:34 UTC |
3eac5c6 | Steven Johnson | 15 July 2020, 22:25:13 UTC | Rename the CMake var HL_TARGET -> HALIDE_TARGET, to disambiguate from the env var of the same name | 15 July 2020, 22:25:13 UTC |
0ddd4e1 | Steven Johnson | 15 July 2020, 21:40:09 UTC | Merge pull request #5123 from halide/srj-clangtidy Update clang-tidy checks to use clang-tidy-10 | 15 July 2020, 21:40:09 UTC |
f678add | Steven Johnson | 15 July 2020, 19:45:32 UTC | Update clang-tidy checks to use clang-tidy-10 | 15 July 2020, 20:12:38 UTC |
4bd8b09 | Steven Johnson | 15 July 2020, 18:16:06 UTC | Merge pull request #5120 from halide/srj-llvm12 Allow LLVM v12 | 15 July 2020, 18:16:06 UTC |
45ac38a | Steven Johnson | 15 July 2020, 18:07:32 UTC | Update Makefile | 15 July 2020, 18:07:32 UTC |
8eda1fd | Steven Johnson | 15 July 2020, 18:07:08 UTC | Also update GHA scripts | 15 July 2020, 18:07:08 UTC |