2e48dad | Jing Pu | 21 August 2018, 19:51:41 UTC | sync the test format to #2996 | 21 August 2018, 19:51:41 UTC |
ee3b29d | Jing Pu | 21 August 2018, 19:41:45 UTC | Merge branch 'master' into simplify_mod | 21 August 2018, 19:41:45 UTC |
4518a15 | Steven Johnson | 21 August 2018, 17:38:23 UTC | Merge pull request #3224 from halide/reorder_rvar Support reordering RVar if a function is associative. | 21 August 2018, 17:38:23 UTC |
f571a02 | Patricia Suriana | 21 August 2018, 16:13:47 UTC | Merge pull request #3226 from halide/bug_inline_unbounded Fix inlining unbounded functions | 21 August 2018, 16:13:47 UTC |
4e716a5 | Jing Pu | 21 August 2018, 06:44:17 UTC | Merge pull request #3222 from inazarenko/fix_scope Sync ScopedBinding<void> to the main template. | 21 August 2018, 06:44:17 UTC |
a5b0ab4 | Jing Pu | 20 August 2018, 21:22:22 UTC | Check commutativity. | 20 August 2018, 21:22:22 UTC |
37637fe | Patricia Suriana | 20 August 2018, 20:02:29 UTC | Fix inlining unbounded functions | 20 August 2018, 20:02:29 UTC |
3ca81a2 | Jing Pu | 20 August 2018, 18:08:39 UTC | Support reordering Rvars of associative functions. | 20 August 2018, 18:08:39 UTC |
83fa9aa | Igor Nazarenko | 16 August 2018, 20:20:38 UTC | Sync ScopedBinding<void> to the main template. Adds a constructor that allows to skip binding the name, which is already present for other types, but not in the specialization for void. | 16 August 2018, 20:20:38 UTC |
b225e0d | Steven Johnson | 14 August 2018, 22:43:00 UTC | Merge pull request #3219 from halide/revert-3212-srj-codegen Revert "Ensure calls to codegen() are well-ordered" | 14 August 2018, 22:43:00 UTC |
50f4a13 | Steven Johnson | 14 August 2018, 22:10:22 UTC | Revert "Ensure calls to codegen() are well-ordered" | 14 August 2018, 22:10:22 UTC |
29ba9d2 | Dillon Sharlet | 14 August 2018, 21:58:53 UTC | Merge pull request #3212 from halide/srj-codegen Ensure calls to codegen() are well-ordered | 14 August 2018, 21:58:53 UTC |
e5d3125 | Steven Johnson | 14 August 2018, 18:17:27 UTC | Merge pull request #3217 from halide/srj-tidy Hygiene: move nontrivial Target method bodies into .cpp file | 14 August 2018, 18:17:27 UTC |
51895c9 | Steven Johnson | 13 August 2018, 22:44:29 UTC | Hygiene: move nontrivial Target method bodies into .cpp file Not at all important, but some of these methods have grown pretty long over the years. | 13 August 2018, 22:44:29 UTC |
47203ba | Steven Johnson | 13 August 2018, 21:17:48 UTC | Merge pull request #3216 from halide/srj-sf Special-case strict_float in Bounds.cpp to avoid n^2 behavior | 13 August 2018, 21:17:48 UTC |
c273948 | Steven Johnson | 13 August 2018, 19:50:58 UTC | Merge branch 'master' into srj-codegen | 13 August 2018, 19:50:58 UTC |
e538132 | Steven Johnson | 13 August 2018, 19:49:57 UTC | Reorder code | 13 August 2018, 19:49:57 UTC |
5c19d27 | Steven Johnson | 13 August 2018, 18:32:34 UTC | Special-case strict_float in Bounds.cpp to avoid n^2 behavior | 13 August 2018, 18:32:34 UTC |
c4dc874 | Zalman Stern | 11 August 2018, 22:28:15 UTC | Merge pull request #3213 from halide/srj-select Remove special-case for int32 from CodeGen_LLVM::visit(const Select *op) | 11 August 2018, 22:28:15 UTC |
6977295 | Zalman Stern | 11 August 2018, 18:06:06 UTC | Merge pull request #3105 from halide/unsafe_promises Add unsafe promises feature. | 11 August 2018, 18:06:06 UTC |
57120fc | Zalman Stern | 11 August 2018, 00:02:25 UTC | Merge branch 'master' into unsafe_promises | 11 August 2018, 00:02:25 UTC |
f981846 | Steven Johnson | 10 August 2018, 23:06:46 UTC | Reorder to to be closer to original | 10 August 2018, 23:06:46 UTC |
a915469 | Steven Johnson | 10 August 2018, 19:40:30 UTC | Remove special-case for int32 from CodeGen_LLVM::visit(const Select *op) LLVM circa v3.2 or so had a bug wherein complex pipeline would compile extremely slowly, so a workaround was inserted; I can't replicate this as of LLVM5 or later (which is all we are supporting now), so let's consider backing out this workaround. | 10 August 2018, 19:40:30 UTC |
6d61433 | Steven Johnson | 10 August 2018, 19:17:15 UTC | Ensure calls to codegen() are well-ordered C++11 doesn't guarantee that function arguments are evaluated in a particular order, thus calls of the form `builder->CreateFoo(codegen(a), codegen(b))` might generate LLVM IR with either a-then-b or b-then-a; at best, this makes comparing IR between compilers a nuisance; at worst, it can trigger subtle bugs and make them harder to find (see https://github.com/halide/Halide/issues/3203). This PR looks for all calls that evalutate codegen() more than once as a function arg and rearranges code to use temporaries to ensure a well-defined order. (Note that a few with only a single call to codegen() were also pulled into temporaries where I thought it improved clarity or helped forestall reinsertion of the bad code pattern by future edits.) | 10 August 2018, 19:17:15 UTC |
8c2f090 | Steven Johnson | 10 August 2018, 17:04:45 UTC | Merge pull request #3199 from halide/srj-rt-11 Explicitly specify -std=gnu++98 for runtime .cpp building | 10 August 2018, 17:04:45 UTC |
b83d7e7 | Shoaib Kamil | 10 August 2018, 13:59:18 UTC | Merge pull request #3205 from halide/kamil/coverity_issue Appease Coverity | 10 August 2018, 13:59:18 UTC |
3dfadd9 | Jing Pu | 10 August 2018, 05:46:50 UTC | Merge pull request #3204 from halide/fix_comment Fix a typo in comment. | 10 August 2018, 05:46:50 UTC |
25dedcf | Zalman Stern | 09 August 2018, 22:10:48 UTC | Merge pull request #3184 from halide/gpu_types_for_kernel_run Add support for passing typecode, not just size, in args to GPU kernel run routine. | 09 August 2018, 22:10:48 UTC |
890df56 | Shoaib Kamil | 09 August 2018, 20:43:47 UTC | Appease Coverity | 09 August 2018, 20:43:47 UTC |
a20a60a | Jing Pu | 09 August 2018, 18:06:27 UTC | Fix a typo in comment. | 09 August 2018, 18:06:27 UTC |
7ddda78 | Jing Pu | 09 August 2018, 04:43:30 UTC | Merge pull request #3201 from halide/expr_uses_var Add an argument of a scope of variable binding to expr_uses_var. | 09 August 2018, 04:43:30 UTC |
27f4deb | Jing Pu | 09 August 2018, 00:03:10 UTC | pass scope argument. | 09 August 2018, 00:03:10 UTC |
68a316b | Jing Pu | 09 August 2018, 00:00:43 UTC | Add an argument of scope of variable binding to stmt_uses_var. | 09 August 2018, 00:00:43 UTC |
e5aecbe | Steven Johnson | 08 August 2018, 01:04:48 UTC | Explicitly specify -std=gnu++98 for runtime .cpp building We don't want c++11, as we don't want static locals to get thread-sync helper code (which c++11 mandates); previously, we didn't specify any version, which left us at the mercy of the system compiler (which might have defaulted to c++11). Choosing gnu++98 is arbitrary but gives us a predictable baseline across common build environments. | 08 August 2018, 01:04:48 UTC |
793734e | Z Stern | 07 August 2018, 19:30:21 UTC | Address review feedback. | 07 August 2018, 19:30:21 UTC |
2caad82 | Z Stern | 07 August 2018, 19:17:29 UTC | Merge branch 'master' into gpu_types_for_kernel_run | 07 August 2018, 19:17:29 UTC |
ea9c863 | Steven Johnson | 07 August 2018, 17:19:49 UTC | Merge pull request #3195 from halide/srj-uint64 call to halide_hexagon_run(): arg_sizes should be uint64*, not size_t* | 07 August 2018, 17:19:49 UTC |
9c53d76 | Steven Johnson | 07 August 2018, 17:19:37 UTC | Merge pull request #3196 from halide/srj-stable Use std::stable_sort in LoopCarry | 07 August 2018, 17:19:37 UTC |
ddb1619 | Steven Johnson | 06 August 2018, 21:39:18 UTC | Use std::stable_sort in LoopCarry This is really just for debugging hygiene: use of std::sort means that chains of same length can be in different orders in different C++ compilers; this doesn't appear to affect correctness of output, but having the sort be stable does make comparing IR from different compiler versions easier. | 06 August 2018, 21:39:18 UTC |
e21e61a | Steven Johnson | 06 August 2018, 17:43:08 UTC | call to halide_hexagon_run(): arg_sizes should be uint64*, not size_t* | 06 August 2018, 17:43:08 UTC |
1289ad3 | Steven Johnson | 03 August 2018, 22:27:18 UTC | Merge pull request #3190 from halide/srj-foreach2 Make Buffer::for_each_value const-correct | 03 August 2018, 22:27:18 UTC |
a7215a6 | Steven Johnson | 03 August 2018, 18:25:35 UTC | Make Buffer::for_each_value const-correct Declare it const, then add perfect-forwarding to some helper methods; this allows us to call it on `const Buffer<T>` (which are fine to mutate if T is not const) and also `Buffer<const T>` (which are now correctly prohibited from mutating the values via nonconst reference args to the lambda). | 03 August 2018, 18:25:35 UTC |
0b97b66 | Z Stern | 02 August 2018, 23:31:33 UTC | Appease compiler pedantry. | 02 August 2018, 23:31:33 UTC |
6704b56 | Z Stern | 02 August 2018, 18:58:44 UTC | Fix plain-C issue in HalideRuntimeOpenGLCompute.h . | 02 August 2018, 18:58:44 UTC |
366f949 | Steven Johnson | 02 August 2018, 17:36:59 UTC | Merge pull request #3041 from matthiaskramm/ctlsz Make CodeGen_C aware of count_leading_zeros | 02 August 2018, 17:36:59 UTC |
fd47a65 | Steven Johnson | 02 August 2018, 17:16:35 UTC | Merge pull request #3118 from SanderVocke/disable_leakcheck Disable asan leak checking during code generation. | 02 August 2018, 17:16:35 UTC |
2101955 | Steven Johnson | 02 August 2018, 16:48:42 UTC | Merge pull request #3173 from halide/srj-copy-as Add some HalideBuffer convenience methods (Issue #3113) | 02 August 2018, 16:48:42 UTC |
2e7fb15 | Andrew Adams | 02 August 2018, 15:19:52 UTC | Merge pull request #3172 from halide/fix_warp_shuffles Fix some pathological behavior in warp shuffles | 02 August 2018, 15:19:52 UTC |
aa70aaa | Steven Johnson | 01 August 2018, 23:58:33 UTC | Merge pull request #3183 from halide/bump_llvm_version LLVM is bumping their version number to 8 | 01 August 2018, 23:58:33 UTC |
6c3d30b | Steven Johnson | 01 August 2018, 23:08:18 UTC | Change copy() to just return nonconst | 01 August 2018, 23:08:18 UTC |
6ca35a9 | Z Stern | 01 August 2018, 22:55:56 UTC | Add support for passing typecode insitead of just size with args to GPU kernel run routine. Eventually all GPU runtimes should probably use this one mechanism, but for now it is made optional per backedn to prevent breaking applications which have replaced the weak linked kernel run routine for a given API. OpenGL Compute is switched to the types based model as it was previously broken for types other than int32_t. Remove tabs in some files. | 01 August 2018, 22:55:56 UTC |
e0316d1 | Steven Johnson | 01 August 2018, 22:43:58 UTC | Remove nonstatic make_with_shape_of() | 01 August 2018, 22:43:58 UTC |
d8917c4 | Andrew Adams | 01 August 2018, 22:17:50 UTC | Some build bots don't have the requisite cuda version | 01 August 2018, 22:17:50 UTC |
ac129fe | Andrew Adams | 01 August 2018, 22:00:29 UTC | LLVM is bumping their version number to 8 | 01 August 2018, 22:00:29 UTC |
f39ea35 | Andrew Adams | 01 August 2018, 18:12:42 UTC | Merge remote-tracking branch 'origin/master' into fix_warp_shuffles | 01 August 2018, 18:12:42 UTC |
8c772dd | Andrew Adams | 01 August 2018, 17:26:01 UTC | Merge pull request #3165 from halide/quieter_test_correctness only report failures in test_correctness | 01 August 2018, 17:26:01 UTC |
0122fd7 | Sander Vocke | 01 August 2018, 11:13:40 UTC | Merge remote-tracking branch 'origin/master' into disable_leakcheck | 01 August 2018, 11:13:40 UTC |
92a16ea | Zalman Stern | 01 August 2018, 01:05:52 UTC | Merge pull request #3177 from halide/readme_update Update README.md to reflect a slightly more current version of target support. | 01 August 2018, 01:05:52 UTC |
a9de173 | Zalman Stern | 01 August 2018, 00:52:25 UTC | Merge pull request #3178 from halide/srj-warn Don't emit #pragma message in D3D12 backend | 01 August 2018, 00:52:25 UTC |
58d2c8a | Steven Johnson | 01 August 2018, 00:48:46 UTC | Don't emit #pragma message in D3D12 backend | 01 August 2018, 00:48:46 UTC |
4829e0a | Zalman Stern | 01 August 2018, 00:38:35 UTC | Typo. | 01 August 2018, 00:38:35 UTC |
b61de00 | Zalman Stern | 01 August 2018, 00:33:25 UTC | Update README.md to reflect a slightly more current version of target support. | 01 August 2018, 00:33:25 UTC |
fd5947a | Steven Johnson | 31 July 2018, 20:47:11 UTC | Fix comment | 31 July 2018, 20:47:11 UTC |
d8edcae | Dillon Sharlet | 31 July 2018, 19:51:15 UTC | Merge pull request #3166 from halide/faster_cascaded_filters Speed up cascaded filters test | 31 July 2018, 19:51:15 UTC |
f43288e | Steven Johnson | 31 July 2018, 18:21:15 UTC | Merge branch 'master' into unsafe_promises | 31 July 2018, 18:21:15 UTC |
39c5839 | Steven Johnson | 31 July 2018, 18:19:02 UTC | Add some HalideBuffer convenience methods (Issue #3113) - augment copy() to allow copying-to-a-different-type - add nonstatic variant of make_with_shape_of() - add reset() | 31 July 2018, 18:19:02 UTC |
e0e73f1 | Andrew Adams | 31 July 2018, 18:10:17 UTC | Fix some pathological behavior in warp shuffles | 31 July 2018, 18:10:17 UTC |
ddbf7e6 | Andrew Adams | 31 July 2018, 18:08:57 UTC | Merge remote-tracking branch 'origin/master' into quieter_test_correctness | 31 July 2018, 18:08:57 UTC |
51fd2c8 | Steven Johnson | 31 July 2018, 17:04:29 UTC | Merge pull request #3155 from suyogsarda/master Update image info API's for halide_buffer_t | 31 July 2018, 17:04:29 UTC |
92fe75a | Steven Johnson | 31 July 2018, 16:51:49 UTC | Merge pull request #3168 from halide/srj-buf Buffer move-construct ctor was missing assert_can_convert_from | 31 July 2018, 16:51:49 UTC |
9269c83 | Andrew Adams | 31 July 2018, 15:12:28 UTC | Merge branch 'unsafe_promises' of https://github.com/halide/Halide into unsafe_promises | 31 July 2018, 15:12:28 UTC |
ffb8196 | Andrew Adams | 31 July 2018, 15:12:17 UTC | Lower unsafe promises earlier This gives codegen a chance to peephole things like dense vector loads Also fix a minor bug that showed up in bounds. | 31 July 2018, 15:12:17 UTC |
a3ee5f7 | Steven Johnson | 31 July 2018, 00:56:30 UTC | Merge pull request #3158 from halide/srj-incl Add missing #include to HexagonAlignment.h | 31 July 2018, 00:56:30 UTC |
01cad4f | Steven Johnson | 31 July 2018, 00:37:46 UTC | Buffer move-construct ctor was missing assert_can_convert_from | 31 July 2018, 00:37:46 UTC |
db256bd | Zalman Stern | 30 July 2018, 23:40:17 UTC | Merge pull request #3160 from halide/openglcompute-location-fix Add explicit location layout specifier to uniforms. | 30 July 2018, 23:40:17 UTC |
30ab4c4 | Steven Johnson | 30 July 2018, 23:35:27 UTC | Merge branch 'master' into faster_cascaded_filters | 30 July 2018, 23:35:27 UTC |
0295c73 | Steven Johnson | 30 July 2018, 23:34:51 UTC | Merge branch 'master' into quieter_test_correctness | 30 July 2018, 23:34:51 UTC |
9652625 | Steven Johnson | 30 July 2018, 23:30:57 UTC | Merge pull request #3159 from halide/srj-d3d Fix D3D12 issues | 30 July 2018, 23:30:57 UTC |
d8ab812 | Steven Johnson | 30 July 2018, 20:02:54 UTC | Merge branch 'master' into srj-d3d | 30 July 2018, 20:02:54 UTC |
49136b3 | Steven Johnson | 30 July 2018, 18:11:39 UTC | Merge branch 'master' into openglcompute-location-fix | 30 July 2018, 18:11:39 UTC |
3cc9e27 | Steven Johnson | 30 July 2018, 18:11:03 UTC | Merge branch 'master' into unsafe_promises | 30 July 2018, 18:11:03 UTC |
77ed646 | Steven Johnson | 30 July 2018, 18:09:19 UTC | Merge pull request #3081 from tbennun/master Makefile LLVM static library fix | 30 July 2018, 18:09:19 UTC |
a861bf9 | Andrew Adams | 30 July 2018, 17:25:47 UTC | Speed up cascaded filters test | 30 July 2018, 17:25:47 UTC |
dbca67d | Andrew Adams | 30 July 2018, 17:11:36 UTC | only failures in test_correctness | 30 July 2018, 17:11:36 UTC |
98f0b1a | Matthias Kramm | 14 June 2018, 01:17:54 UTC | always use inline implementation of clz | 30 July 2018, 05:46:50 UTC |
bf76726 | Matthias Kramm | 11 June 2018, 12:53:22 UTC | use "sizeof(long long) * 8" instead of "64" | 30 July 2018, 05:46:50 UTC |
9f5dc7c | Matthias Kramm | 09 June 2018, 18:08:52 UTC | add tests for 8/16/32/64 bit | 30 July 2018, 05:46:50 UTC |
a1c9683 | Matthias Kramm | 09 June 2018, 02:17:17 UTC | use clzll instead of clz, for 64 bit integers | 30 July 2018, 05:46:50 UTC |
f3a26ef | Matthias Kramm | 08 June 2018, 21:57:55 UTC | Make CodeGen_C aware of count_leading_zeros | 30 July 2018, 05:46:50 UTC |
d14d3a1 | Z Stern | 28 July 2018, 00:28:59 UTC | Add explicit location layout specifier to uniforms as GLSL does not guarantee they are in any particular order otherwise. | 28 July 2018, 00:28:59 UTC |
be66f83 | Steven Johnson | 27 July 2018, 21:38:42 UTC | Remove redundant 'return' to silence ClangTidy | 27 July 2018, 21:38:42 UTC |
866c3ce | Steven Johnson | 27 July 2018, 21:09:16 UTC | Fix D3D12 issues - avoid the use of fallthru-without-break in CodeGen_D3D12Compute_Dev.cpp (some compiler settings assume this is an error, and the conventional equivalent is fine here) - allow runtime linker to compile when WITH_D3D12 is not defined | 27 July 2018, 21:09:16 UTC |
6a7ff1b | Steven Johnson | 27 July 2018, 20:20:23 UTC | Add missing #include to HexagonAlignment.h | 27 July 2018, 20:20:23 UTC |
cd95b27 | Steven Johnson | 27 July 2018, 16:56:13 UTC | Merge pull request #3116 from halide/srj-features Rework runtime to allow more than 64 Target::Features (Issue #2911) | 27 July 2018, 16:56:13 UTC |
78fde6c | Suyog Sarda | 27 July 2018, 15:48:04 UTC | Update image info API's for halide_buffer_t Replace buffer_t with halide_buffer_t and other data structure. | 27 July 2018, 15:48:04 UTC |
14f594e | Tal Ben-Nun | 27 July 2018, 11:52:25 UTC | Merge pull request #2 from halide/master Sync | 27 July 2018, 11:52:25 UTC |
d734dc0 | Steven Johnson | 26 July 2018, 22:29:32 UTC | Merge branch 'master' into srj-features | 26 July 2018, 22:29:32 UTC |
d91647c | Dillon Sharlet | 26 July 2018, 16:15:07 UTC | Merge pull request #2913 from suyogsarda/master Add v65 instructions for HVX | 26 July 2018, 16:15:07 UTC |
b6b9d6b | Dillon Sharlet | 26 July 2018, 16:14:11 UTC | Merge pull request #3152 from halide/interleave-bug Interleave bug | 26 July 2018, 16:14:11 UTC |
457d333 | Marcos Slomp | 26 July 2018, 14:18:23 UTC | Merge pull request #2755 from halide/d3d12 Direct3D 12 (compute) GPU back-end | 26 July 2018, 14:18:23 UTC |