c34fe9e | Patricia Adriana Suriana | 20 September 2018, 15:44:00 UTC | Merge branch 'master' of https://github.com/halide/Halide into compute_with_remove_is_right_level | 20 September 2018, 15:44:00 UTC |
0e39b46 | Dillon Sharlet | 19 September 2018, 23:10:43 UTC | Merge pull request #3266 from halide/pdb_add_precompiled_sim_qurt Add binary libsim_qurt.a to hexagon_remote | 19 September 2018, 23:10:43 UTC |
caf9f8d | Patricia Suriana | 19 September 2018, 21:42:39 UTC | Update comments for compute_with | 19 September 2018, 21:42:43 UTC |
96195f7 | Patricia Suriana | 19 September 2018, 21:35:37 UTC | Remove is_the_right_level since compute_at aliasing at fused group is not currently supported | 19 September 2018, 21:35:37 UTC |
2de2f25 | Andrew Adams | 19 September 2018, 03:25:07 UTC | Merge pull request #3278 from halide/fix_warp_shuffles_2 Add new simplifier rules to handle warp shuffle breakage | 19 September 2018, 03:25:07 UTC |
45ae466 | Andrew Adams | 18 September 2018, 18:58:13 UTC | Add new simplifier rules to handle warp shuffle breakage | 18 September 2018, 18:58:13 UTC |
e48e6e2 | Andrew Adams | 14 September 2018, 23:08:50 UTC | Merge pull request #2845 from halide/metaprogrammed_simplifier_rules Metaprogrammed simplifier rules | 14 September 2018, 23:08:50 UTC |
a5117ef | Andrew Adams | 14 September 2018, 16:28:15 UTC | Merge pull request #3264 from halide/use_host_cuda_capability_v3 Use host cuda capability v3 | 14 September 2018, 16:28:15 UTC |
44bddf8 | Andrew Adams | 14 September 2018, 16:15:15 UTC | Merge pull request #3265 from halide/fix_3061_v2 Fix 3061 v2 | 14 September 2018, 16:15:15 UTC |
a9276a6 | Andrew Adams | 14 September 2018, 15:38:25 UTC | Merge branch 'metaprogrammed_simplifier_rules' of https://github.com/halide/Halide into metaprogrammed_simplifier_rules | 14 September 2018, 15:38:25 UTC |
7dafba1 | Andrew Adams | 14 September 2018, 15:38:17 UTC | Make failed proof probing conditional on debug level | 14 September 2018, 15:38:17 UTC |
c072dcc | Andrew Adams | 14 September 2018, 04:04:59 UTC | Switch to unordered_map in scope. It's slightly faster | 14 September 2018, 04:04:59 UTC |
2a39b9f | Andrew Adams | 13 September 2018, 17:20:36 UTC | Add some missing library loads | 13 September 2018, 17:20:36 UTC |
5c517cf | Andrew Adams | 13 September 2018, 17:09:20 UTC | Merge remote-tracking branch 'origin/master' into fix_3061_v2 | 13 September 2018, 17:09:20 UTC |
676d8bf | Steven Johnson | 12 September 2018, 18:42:44 UTC | Merge pull request #3270 from halide/srj-bug Add correct LLVM bug to simd_op_check comment | 12 September 2018, 18:42:44 UTC |
c43fcea | Steven Johnson | 12 September 2018, 18:03:10 UTC | Add correct LLVM bug to simd_op_check comment | 12 September 2018, 18:03:10 UTC |
a39b712 | Andrew Adams | 12 September 2018, 15:48:53 UTC | Merge remote-tracking branch 'origin/master' into fix_3061_v2 | 12 September 2018, 15:48:53 UTC |
7ba586a | Andrew Adams | 12 September 2018, 15:39:24 UTC | Fix generator_aot_acquire_release for cuda | 12 September 2018, 15:39:24 UTC |
bf93ae1 | Steven Johnson | 11 September 2018, 20:10:28 UTC | Merge pull request #3268 from halide/srj-addus Re-enable addus/subus in simd_op_check | 11 September 2018, 20:10:28 UTC |
04c4440 | Andrew Adams | 11 September 2018, 17:30:38 UTC | Wrap long line | 11 September 2018, 17:30:38 UTC |
47cdabc | Andrew Adams | 11 September 2018, 17:29:48 UTC | Add Halide namespace to test | 11 September 2018, 17:29:48 UTC |
8264d19 | Steven Johnson | 11 September 2018, 17:04:47 UTC | Re-enable addus/subus in simd_op_check Underlying bug in LLVM fixed in trunk. Note that pcmp is still disabled due to related bug which is not yet fixed. | 11 September 2018, 17:04:47 UTC |
723a1c3 | Pranav Bhandarkar | 10 September 2018, 22:06:47 UTC | A previous PR had moved libsim_qurt.a to from Halide/tools to Halide/src/runtime/hexagon_remote. However, we had forgotten to include a library binary in the PR. This made the PR inconsistent with our previous approach of leaving the binary in place in the repo. This PR fixes that. | 10 September 2018, 22:06:47 UTC |
74f1678 | Andrew Adams | 10 September 2018, 21:15:22 UTC | Merge remote-tracking branch 'origin/master' into fix_3061 | 10 September 2018, 21:15:22 UTC |
dabd88c | Andrew Adams | 07 September 2018, 16:59:08 UTC | Merge remote-tracking branch 'origin/master' into use_host_cuda_capability_v2 | 07 September 2018, 16:59:08 UTC |
b4ea2b0 | Andrew Adams | 07 September 2018, 16:57:48 UTC | Merge branch 'master' into metaprogrammed_simplifier_rules | 07 September 2018, 16:57:48 UTC |
09dc5d5 | Steven Johnson | 07 September 2018, 01:45:09 UTC | Merge pull request #3260 from halide/srj-fill Buffer::fill() and similar should return ref-to-self | 07 September 2018, 01:45:09 UTC |
ffa7e0e | Steven Johnson | 06 September 2018, 23:17:45 UTC | Merge pull request #3261 from halide/srj-asconst Add Buffer::as_const() | 06 September 2018, 23:17:45 UTC |
a98714f | Steven Johnson | 06 September 2018, 23:17:26 UTC | Merge pull request #3258 from halide/srj-alias Add Buffer<>::alias() | 06 September 2018, 23:17:26 UTC |
37581d3 | Pranav Bhandarkar | 06 September 2018, 20:18:34 UTC | Merge pull request #3259 from aankit-ca/sim_qurt_cleanup Changed the directory structure for tools/sim_qurt | 06 September 2018, 20:18:34 UTC |
6b2d2de | Ankit Aggarwal | 06 September 2018, 19:18:59 UTC | Moved libsim_qurt.a from bin/v60/sim_qurt to bin/v60 | 06 September 2018, 19:18:59 UTC |
0bf8b5c | Ankit Aggarwal | 06 September 2018, 18:44:48 UTC | Minor change | 06 September 2018, 18:44:48 UTC |
7622ba7 | Steven Johnson | 06 September 2018, 18:38:57 UTC | Address review comments | 06 September 2018, 18:38:57 UTC |
6e298e9 | Steven Johnson | 06 September 2018, 18:20:06 UTC | Add Buffer::as_const() This is just syntactic sugar for `as<const T>()`, for the case of having a Buffer<T>& (or Buffer<T>*) and needing the const variation; it avoid needing to recapitulate the type. (Looks likely to be used commonly enough in my use case to be worthwhile, but if there is pushback as too-much-sugar, LMK.) Also, drive-by addition of HALIDE_ALWAYS_INLINE. | 06 September 2018, 18:20:06 UTC |
b83ead4 | Steven Johnson | 05 September 2018, 23:28:29 UTC | Add proper return-type forwarding for Halide::Buffer as well | 05 September 2018, 23:28:29 UTC |
5d05655 | Steven Johnson | 05 September 2018, 23:05:57 UTC | Merge pull request #3257 from halide/srj-copyfrom Buffer<const T>::copy_from() should be illegal | 05 September 2018, 23:05:57 UTC |
d1becce | Steven Johnson | 05 September 2018, 22:52:48 UTC | Buffer::fill() and similar should return ref-to-self This allows for somewhat easier initializing of `Buffer<const T>` via use of fill(), for_each_value(), etc while minimizing the use of named temporaries. (Related: would it make sense to add a reference-to-self as a final argument to the callback for `for_each_element`?) | 05 September 2018, 22:52:48 UTC |
9976385 | Ankit Aggarwal | 05 September 2018, 11:32:20 UTC | Changed the install_qc target in top level Makefile to adjust for new location of libsim_qurt.a | 05 September 2018, 11:32:20 UTC |
645bc1a | Ankit Aggarwal | 05 September 2018, 10:21:28 UTC | Changed the directory structure for tools/sim_qurt 1. Moved tools/sim_qurt/sim_qurt.cpp to src/runtime/hexagon_remote/ 2. Modified the src/runtime/hexagon_remote/Makefile to build libsim_qurt.a 3. Removed duplicate code from sim_remote.cpp 4. Link libsim_qurt.a with hexagon_sim_remote for definitions of qurt_hvx_lock and qurt_hvx_unlock functions | 05 September 2018, 10:21:28 UTC |
dc47203 | Steven Johnson | 04 September 2018, 23:59:21 UTC | Add Buffer<>::alias() Per discussion in https://github.com/halide/Halide/pull/3253, add an alias() method as syntactic sugar to make a shallow Buffer copy. (Not sure if ready to land or not; putting it out for reference.) | 04 September 2018, 23:59:21 UTC |
c84c065 | Steven Johnson | 04 September 2018, 22:05:40 UTC | Buffer<const T>::copy_from() should be illegal It should not be legal to call copy_from() on a Buffer with a const value type, but (thanks to explicit casting in the implementation which subverts the usual checks), it is. Added a static_cast() to prevent this. Also, a drive-by fix to make is_bounds_query() a const method. | 04 September 2018, 22:05:40 UTC |
cd76c21 | Andrew Adams | 04 September 2018, 19:28:50 UTC | Merge pull request #3230 from halide/solve_div_simplifier_test Add a simplify test from #3229. | 04 September 2018, 19:28:50 UTC |
1114608 | Andrew Adams | 04 September 2018, 19:28:00 UTC | Merge remote-tracking branch 'origin/master' into fix_3061 | 04 September 2018, 19:28:00 UTC |
540fe6e | Andrew Adams | 04 September 2018, 18:48:14 UTC | Merge pull request #3246 from halide/srj-feat Add missing Feature enums to PyEnums and halide.cmake | 04 September 2018, 18:48:14 UTC |
efa9ae3 | Andrew Adams | 04 September 2018, 18:47:49 UTC | Merge pull request #3223 from jn80842/verification_pr rewrite x/x to 1 only if we can show x != 0 | 04 September 2018, 18:47:49 UTC |
51790cc | Andrew Adams | 04 September 2018, 18:47:01 UTC | Merge remote-tracking branch 'origin/master' into use_host_cuda_capability_v2 | 04 September 2018, 18:47:01 UTC |
c682592 | Andrew Adams | 04 September 2018, 18:38:25 UTC | Merge remote-tracking branch 'origin/master' into metaprogrammed_simplifier_rules | 04 September 2018, 18:38:25 UTC |
10c4bbe | Zalman Stern | 29 August 2018, 00:23:30 UTC | Merge pull request #3251 from halide/op_table_bugs Fix type conversion bugs in associative ops table | 29 August 2018, 00:23:30 UTC |
792f7e0 | Steven Johnson | 28 August 2018, 21:05:45 UTC | Merge branch 'master' into op_table_bugs | 28 August 2018, 21:05:45 UTC |
3f8f48c | Zalman Stern | 28 August 2018, 20:08:50 UTC | Merge pull request #3249 from halide/srj-order Ensure calls to codegen() are well-ordered | 28 August 2018, 20:08:50 UTC |
d887783 | Steven Johnson | 28 August 2018, 17:44:22 UTC | Merge branch 'master' into srj-feat | 28 August 2018, 17:44:22 UTC |
d751ca1 | Steven Johnson | 28 August 2018, 17:43:56 UTC | Merge branch 'master' into srj-order | 28 August 2018, 17:43:56 UTC |
1604d13 | Patricia Suriana | 28 August 2018, 17:00:59 UTC | Fix type conversion bugs in associative ops table | 28 August 2018, 17:00:59 UTC |
1e2169e | Steven Johnson | 28 August 2018, 17:00:04 UTC | Merge pull request #3243 from halide/srj-addus Add LL helpers for paddus/psubus on LLVM8.0+ | 28 August 2018, 17:00:04 UTC |
72f9758 | Zalman Stern | 28 August 2018, 16:36:30 UTC | Merge pull request #3250 from inazarenko/skip_stages Don't skip stages that produce values of some other func. | 28 August 2018, 16:36:30 UTC |
ea88418 | Zalman Stern | 28 August 2018, 02:54:05 UTC | Merge pull request #3248 from halide/rvar_reorder_only_prove_assoc_if_needed Change RVar reordering code to not do associativity/commutativity | 28 August 2018, 02:54:05 UTC |
8b0bb22 | Steven Johnson | 27 August 2018, 23:37:22 UTC | Ensure calls to codegen() are well-ordered C++11 doesn't guarantee that function arguments are evaluated in a particular order, thus calls of the form builder->CreateFoo(codegen(a), codegen(b)) might generate LLVM IR with either a-then-b or b-then-a; at best, this makes comparing IR between compilers a nuisance; at worst, it can trigger subtle bugs and make them harder to find (see #3203). This PR looks for all calls that evalutate codegen() more than once as a function arg and rearranges code to use temporaries to ensure a well-defined order. (Note that a few with only a single call to codegen() were also pulled into temporaries where I thought it improved clarity or helped forestall reinsertion of the bad code pattern by future edits.) | 27 August 2018, 23:37:22 UTC |
2956ee6 | Igor Nazarenko | 27 August 2018, 17:56:40 UTC | Don't skip stages that produce values of some other func. When determining whether it's safe to skip producing a func, consider stores to non-local buffers of other funcs inside the Producer stage. The predicate for skipping the computation must imply that conditions around those stores are false. | 27 August 2018, 22:50:45 UTC |
7b3e2b7 | Steven Johnson | 27 August 2018, 21:49:01 UTC | Temporarily disable some tests in simd_op_check | 27 August 2018, 21:49:01 UTC |
79ae87f | Z Stern | 27 August 2018, 21:00:21 UTC | Change RVar reordering code to not do associativity/commutativity proof if it is not needed for a particular reordering. The proof is somewhat expensive and also results in possibly infinite loops on some pieces of code so running it all the time is problematic. | 27 August 2018, 21:00:21 UTC |
174c3dd | Steven Johnson | 27 August 2018, 19:43:15 UTC | Add missing Feature enums to PyEnums and halide.cmake | 27 August 2018, 19:43:15 UTC |
35544ab | Zalman Stern | 27 August 2018, 18:25:41 UTC | Merge pull request #3241 from halide/another_buffer_copy_fix Fix cases where halide_buffer_copy could copy to/from a NULL host pointer... | 27 August 2018, 18:25:41 UTC |
99cdb03 | Steven Johnson | 27 August 2018, 18:11:17 UTC | Add LL helpers for paddus/psubus on LLVM8.0+ Needed to work properly with https://reviews.llvm.org/D46179#1211902; unfortunately this patch is (still) broken by https://bugs.llvm.org/show_bug.cgi?id=38691, and probably shouldn't land until that is resolved. Posting here as a PR to capture the work-in-progress, but not really ready to review yet. | 27 August 2018, 18:11:17 UTC |
37540bb | jn80842 | 27 August 2018, 16:27:23 UTC | fixed merge | 27 August 2018, 16:27:23 UTC |
b3e18ee | jn80842 | 27 August 2018, 16:21:23 UTC | merge | 27 August 2018, 16:21:23 UTC |
06f4c1a | jn80842 | 27 August 2018, 15:16:09 UTC | assume that rewritten expressions contain no division by zero | 27 August 2018, 15:16:09 UTC |
d19f5c2 | jn80842 | 17 August 2018, 19:47:04 UTC | whitespace | 27 August 2018, 14:56:43 UTC |
2ef21d2 | jn80842 | 17 August 2018, 19:45:28 UTC | revert term reordering fixes | 27 August 2018, 14:56:43 UTC |
df8cb94 | jn80842 | 17 August 2018, 19:27:29 UTC | rewrite x/x to 1 only if we can show x != 0 | 27 August 2018, 14:56:43 UTC |
20eee94 | jn80842 | 16 August 2018, 18:17:28 UTC | fixing rules that cannot match because exprs are reordered/rewritten before matching | 27 August 2018, 14:56:43 UTC |
7701abe | Z Stern | 27 August 2018, 09:04:43 UTC | Fix cases where halide_buffer_copy could copy to/from a host pointer that was NULL where the case was valid by compying from the device allocation. Add tests for these cases. Change name of do_multidimensional_copy in opencl and cuda runtimes to be unique to each runtime as the opencl runtime was calling the cuda do_multidimensional_copy despite both being in anonymous namespaces inside their respective files. Weak linking and C++ namespaces and our unusual runtime linking and probably at least one bug somewhere caused this to go badly. Required trying to use both cuda and opencl at the same time. | 27 August 2018, 09:04:43 UTC |
1e69e8b | Zalman Stern | 24 August 2018, 18:31:23 UTC | Merge pull request #3238 from halide/empty_pipeline_bound Ignore function with empty pipeline bounds when creating partitioner | 24 August 2018, 18:31:23 UTC |
5481f15 | Jing Pu | 24 August 2018, 17:47:40 UTC | Merge pull request #2995 from halide/simplify_mod Add simplify rules for a modulo of a difference. | 24 August 2018, 17:47:40 UTC |
1f43e2f | Zalman Stern | 24 August 2018, 17:35:20 UTC | Merge pull request #3236 from halide/buffer_copy_fix Fix halide_buffer_copy crash introduced by latest changes for host to… | 24 August 2018, 17:35:20 UTC |
f44f2df | Zalman Stern | 24 August 2018, 17:33:04 UTC | Merge pull request #3237 from halide/avx_512_alignment Move x86 to 64-byte alignment because of AVX-512. | 24 August 2018, 17:33:04 UTC |
0e6a58d | Z Stern | 24 August 2018, 06:30:03 UTC | Move x86 to 64-byte alignment because of AVX-512. | 24 August 2018, 06:30:03 UTC |
e0d693f | Z Stern | 24 August 2018, 06:14:36 UTC | Fix halide_buffer_copy crash introduced by latest changes for host to host copy. Add a test to cover this simple case. | 24 August 2018, 06:14:36 UTC |
e1f1408 | Patricia Suriana | 23 August 2018, 23:00:19 UTC | Ignore function with empty pipeline bounds when creating partitioner | 23 August 2018, 23:00:19 UTC |
fd1b76f | Zalman Stern | 23 August 2018, 22:56:30 UTC | Merge pull request #3176 from halide/buffer_copy_improvements Buffer copy improvements | 23 August 2018, 22:56:30 UTC |
14a4ae3 | Z Stern | 23 August 2018, 19:55:59 UTC | Merge branch 'master' into buffer_copy_improvements | 23 August 2018, 19:55:59 UTC |
9d97eaa | Zalman Stern | 23 August 2018, 19:54:31 UTC | Merge pull request #3233 from zanbri/acquire_release_cuda_context_bug Incorrect acquire_context and release_context function names in test script | 23 August 2018, 19:54:31 UTC |
f9d3254 | Alexander (Zan) Bridi | 23 August 2018, 16:49:51 UTC | Fixed acquire_release_bug: issue #3232 | 23 August 2018, 16:49:51 UTC |
0cb5825 | Z Stern | 22 August 2018, 23:37:09 UTC | Address review feedback. This makes an error condition actually return an error in one case as well. | 22 August 2018, 23:37:09 UTC |
d9973ca | Z Stern | 22 August 2018, 23:20:38 UTC | Merge branch 'master' into buffer_copy_improvements | 22 August 2018, 23:20:38 UTC |
331df56 | Jing Pu | 21 August 2018, 20:16:29 UTC | Add a simplify test from #3229. | 21 August 2018, 20:16:29 UTC |
2e48dad | Jing Pu | 21 August 2018, 19:51:41 UTC | sync the test format to #2996 | 21 August 2018, 19:51:41 UTC |
ee3b29d | Jing Pu | 21 August 2018, 19:41:45 UTC | Merge branch 'master' into simplify_mod | 21 August 2018, 19:41:45 UTC |
4518a15 | Steven Johnson | 21 August 2018, 17:38:23 UTC | Merge pull request #3224 from halide/reorder_rvar Support reordering RVar if a function is associative. | 21 August 2018, 17:38:23 UTC |
f571a02 | Patricia Suriana | 21 August 2018, 16:13:47 UTC | Merge pull request #3226 from halide/bug_inline_unbounded Fix inlining unbounded functions | 21 August 2018, 16:13:47 UTC |
4e716a5 | Jing Pu | 21 August 2018, 06:44:17 UTC | Merge pull request #3222 from inazarenko/fix_scope Sync ScopedBinding<void> to the main template. | 21 August 2018, 06:44:17 UTC |
a5b0ab4 | Jing Pu | 20 August 2018, 21:22:22 UTC | Check commutativity. | 20 August 2018, 21:22:22 UTC |
37637fe | Patricia Suriana | 20 August 2018, 20:02:29 UTC | Fix inlining unbounded functions | 20 August 2018, 20:02:29 UTC |
3ca81a2 | Jing Pu | 20 August 2018, 18:08:39 UTC | Support reordering Rvars of associative functions. | 20 August 2018, 18:08:39 UTC |
26e2625 | jn80842 | 17 August 2018, 19:47:04 UTC | whitespace | 17 August 2018, 19:47:04 UTC |
72c58f3 | jn80842 | 17 August 2018, 19:45:28 UTC | revert term reordering fixes | 17 August 2018, 19:45:28 UTC |
6a6d11a | jn80842 | 17 August 2018, 19:27:29 UTC | rewrite x/x to 1 only if we can show x != 0 | 17 August 2018, 19:27:29 UTC |
416ea2c | jn80842 | 17 August 2018, 19:25:52 UTC | Merge remote-tracking branch 'upstream/metaprogrammed_simplifier_rules' into verification_pr | 17 August 2018, 19:25:52 UTC |
50fa921 | Andrew Adams | 17 August 2018, 19:21:42 UTC | Back off the stack size limit on the fft app slightly | 17 August 2018, 19:21:42 UTC |
17d549c | Andrew Adams | 17 August 2018, 19:20:57 UTC | Merge remote-tracking branch 'origin/master' into metaprogrammed_simplifier_rules | 17 August 2018, 19:20:57 UTC |
83fa9aa | Igor Nazarenko | 16 August 2018, 20:20:38 UTC | Sync ScopedBinding<void> to the main template. Adds a constructor that allows to skip binding the name, which is already present for other types, but not in the specialization for void. | 16 August 2018, 20:20:38 UTC |