https://github.com/halide/Halide

sort by:
Revision Author Date Message Commit Date
c646ef2 fix comment. 20 September 2018, 00:39:00 UTC
8eb5c62 Merge branch 'master' into solve_div_master 19 September 2018, 18:48:40 UTC
2de2f25 Merge pull request #3278 from halide/fix_warp_shuffles_2 Add new simplifier rules to handle warp shuffle breakage 19 September 2018, 03:25:07 UTC
45ae466 Add new simplifier rules to handle warp shuffle breakage 18 September 2018, 18:58:13 UTC
e852bb0 fixed a overflow issue. 15 September 2018, 02:41:27 UTC
1aec3c8 Merge branch 'master' into solve_div_master 15 September 2018, 02:36:45 UTC
e48e6e2 Merge pull request #2845 from halide/metaprogrammed_simplifier_rules Metaprogrammed simplifier rules 14 September 2018, 23:08:50 UTC
a5117ef Merge pull request #3264 from halide/use_host_cuda_capability_v3 Use host cuda capability v3 14 September 2018, 16:28:15 UTC
44bddf8 Merge pull request #3265 from halide/fix_3061_v2 Fix 3061 v2 14 September 2018, 16:15:15 UTC
a9276a6 Merge branch 'metaprogrammed_simplifier_rules' of https://github.com/halide/Halide into metaprogrammed_simplifier_rules 14 September 2018, 15:38:25 UTC
7dafba1 Make failed proof probing conditional on debug level 14 September 2018, 15:38:17 UTC
c072dcc Switch to unordered_map in scope. It's slightly faster 14 September 2018, 04:04:59 UTC
2a39b9f Add some missing library loads 13 September 2018, 17:20:36 UTC
5c517cf Merge remote-tracking branch 'origin/master' into fix_3061_v2 13 September 2018, 17:09:20 UTC
676d8bf Merge pull request #3270 from halide/srj-bug Add correct LLVM bug to simd_op_check comment 12 September 2018, 18:42:44 UTC
c43fcea Add correct LLVM bug to simd_op_check comment 12 September 2018, 18:03:10 UTC
a39b712 Merge remote-tracking branch 'origin/master' into fix_3061_v2 12 September 2018, 15:48:53 UTC
7ba586a Fix generator_aot_acquire_release for cuda 12 September 2018, 15:39:24 UTC
bf93ae1 Merge pull request #3268 from halide/srj-addus Re-enable addus/subus in simd_op_check 11 September 2018, 20:10:28 UTC
04c4440 Wrap long line 11 September 2018, 17:30:38 UTC
47cdabc Add Halide namespace to test 11 September 2018, 17:29:48 UTC
8264d19 Re-enable addus/subus in simd_op_check Underlying bug in LLVM fixed in trunk. Note that pcmp is still disabled due to related bug which is not yet fixed. 11 September 2018, 17:04:47 UTC
74f1678 Merge remote-tracking branch 'origin/master' into fix_3061 10 September 2018, 21:15:22 UTC
dabd88c Merge remote-tracking branch 'origin/master' into use_host_cuda_capability_v2 07 September 2018, 16:59:08 UTC
b4ea2b0 Merge branch 'master' into metaprogrammed_simplifier_rules 07 September 2018, 16:57:48 UTC
09dc5d5 Merge pull request #3260 from halide/srj-fill Buffer::fill() and similar should return ref-to-self 07 September 2018, 01:45:09 UTC
ffa7e0e Merge pull request #3261 from halide/srj-asconst Add Buffer::as_const() 06 September 2018, 23:17:45 UTC
a98714f Merge pull request #3258 from halide/srj-alias Add Buffer<>::alias() 06 September 2018, 23:17:26 UTC
37581d3 Merge pull request #3259 from aankit-ca/sim_qurt_cleanup Changed the directory structure for tools/sim_qurt 06 September 2018, 20:18:34 UTC
6b2d2de Moved libsim_qurt.a from bin/v60/sim_qurt to bin/v60 06 September 2018, 19:18:59 UTC
0bf8b5c Minor change 06 September 2018, 18:44:48 UTC
7622ba7 Address review comments 06 September 2018, 18:38:57 UTC
6e298e9 Add Buffer::as_const() This is just syntactic sugar for `as<const T>()`, for the case of having a Buffer<T>& (or Buffer<T>*) and needing the const variation; it avoid needing to recapitulate the type. (Looks likely to be used commonly enough in my use case to be worthwhile, but if there is pushback as too-much-sugar, LMK.) Also, drive-by addition of HALIDE_ALWAYS_INLINE. 06 September 2018, 18:20:06 UTC
b83ead4 Add proper return-type forwarding for Halide::Buffer as well 05 September 2018, 23:28:29 UTC
5d05655 Merge pull request #3257 from halide/srj-copyfrom Buffer<const T>::copy_from() should be illegal 05 September 2018, 23:05:57 UTC
d1becce Buffer::fill() and similar should return ref-to-self This allows for somewhat easier initializing of `Buffer<const T>` via use of fill(), for_each_value(), etc while minimizing the use of named temporaries. (Related: would it make sense to add a reference-to-self as a final argument to the callback for `for_each_element`?) 05 September 2018, 22:52:48 UTC
9976385 Changed the install_qc target in top level Makefile to adjust for new location of libsim_qurt.a 05 September 2018, 11:32:20 UTC
645bc1a Changed the directory structure for tools/sim_qurt 1. Moved tools/sim_qurt/sim_qurt.cpp to src/runtime/hexagon_remote/ 2. Modified the src/runtime/hexagon_remote/Makefile to build libsim_qurt.a 3. Removed duplicate code from sim_remote.cpp 4. Link libsim_qurt.a with hexagon_sim_remote for definitions of qurt_hvx_lock and qurt_hvx_unlock functions 05 September 2018, 10:21:28 UTC
dc47203 Add Buffer<>::alias() Per discussion in https://github.com/halide/Halide/pull/3253, add an alias() method as syntactic sugar to make a shallow Buffer copy. (Not sure if ready to land or not; putting it out for reference.) 04 September 2018, 23:59:21 UTC
c84c065 Buffer<const T>::copy_from() should be illegal It should not be legal to call copy_from() on a Buffer with a const value type, but (thanks to explicit casting in the implementation which subverts the usual checks), it is. Added a static_cast() to prevent this. Also, a drive-by fix to make is_bounds_query() a const method. 04 September 2018, 22:05:40 UTC
cd76c21 Merge pull request #3230 from halide/solve_div_simplifier_test Add a simplify test from #3229. 04 September 2018, 19:28:50 UTC
1114608 Merge remote-tracking branch 'origin/master' into fix_3061 04 September 2018, 19:28:00 UTC
540fe6e Merge pull request #3246 from halide/srj-feat Add missing Feature enums to PyEnums and halide.cmake 04 September 2018, 18:48:14 UTC
efa9ae3 Merge pull request #3223 from jn80842/verification_pr rewrite x/x to 1 only if we can show x != 0 04 September 2018, 18:47:49 UTC
51790cc Merge remote-tracking branch 'origin/master' into use_host_cuda_capability_v2 04 September 2018, 18:47:01 UTC
c682592 Merge remote-tracking branch 'origin/master' into metaprogrammed_simplifier_rules 04 September 2018, 18:38:25 UTC
10c4bbe Merge pull request #3251 from halide/op_table_bugs Fix type conversion bugs in associative ops table 29 August 2018, 00:23:30 UTC
792f7e0 Merge branch 'master' into op_table_bugs 28 August 2018, 21:05:45 UTC
3f8f48c Merge pull request #3249 from halide/srj-order Ensure calls to codegen() are well-ordered 28 August 2018, 20:08:50 UTC
d887783 Merge branch 'master' into srj-feat 28 August 2018, 17:44:22 UTC
d751ca1 Merge branch 'master' into srj-order 28 August 2018, 17:43:56 UTC
1604d13 Fix type conversion bugs in associative ops table 28 August 2018, 17:00:59 UTC
1e2169e Merge pull request #3243 from halide/srj-addus Add LL helpers for paddus/psubus on LLVM8.0+ 28 August 2018, 17:00:04 UTC
72f9758 Merge pull request #3250 from inazarenko/skip_stages Don't skip stages that produce values of some other func. 28 August 2018, 16:36:30 UTC
ea88418 Merge pull request #3248 from halide/rvar_reorder_only_prove_assoc_if_needed Change RVar reordering code to not do associativity/commutativity 28 August 2018, 02:54:05 UTC
8b0bb22 Ensure calls to codegen() are well-ordered C++11 doesn't guarantee that function arguments are evaluated in a particular order, thus calls of the form builder->CreateFoo(codegen(a), codegen(b)) might generate LLVM IR with either a-then-b or b-then-a; at best, this makes comparing IR between compilers a nuisance; at worst, it can trigger subtle bugs and make them harder to find (see #3203). This PR looks for all calls that evalutate codegen() more than once as a function arg and rearranges code to use temporaries to ensure a well-defined order. (Note that a few with only a single call to codegen() were also pulled into temporaries where I thought it improved clarity or helped forestall reinsertion of the bad code pattern by future edits.) 27 August 2018, 23:37:22 UTC
2956ee6 Don't skip stages that produce values of some other func. When determining whether it's safe to skip producing a func, consider stores to non-local buffers of other funcs inside the Producer stage. The predicate for skipping the computation must imply that conditions around those stores are false. 27 August 2018, 22:50:45 UTC
7b3e2b7 Temporarily disable some tests in simd_op_check 27 August 2018, 21:49:01 UTC
79ae87f Change RVar reordering code to not do associativity/commutativity proof if it is not needed for a particular reordering. The proof is somewhat expensive and also results in possibly infinite loops on some pieces of code so running it all the time is problematic. 27 August 2018, 21:00:21 UTC
174c3dd Add missing Feature enums to PyEnums and halide.cmake 27 August 2018, 19:43:15 UTC
35544ab Merge pull request #3241 from halide/another_buffer_copy_fix Fix cases where halide_buffer_copy could copy to/from a NULL host pointer... 27 August 2018, 18:25:41 UTC
99cdb03 Add LL helpers for paddus/psubus on LLVM8.0+ Needed to work properly with https://reviews.llvm.org/D46179#1211902; unfortunately this patch is (still) broken by https://bugs.llvm.org/show_bug.cgi?id=38691, and probably shouldn't land until that is resolved. Posting here as a PR to capture the work-in-progress, but not really ready to review yet. 27 August 2018, 18:11:17 UTC
37540bb fixed merge 27 August 2018, 16:27:23 UTC
b3e18ee merge 27 August 2018, 16:21:23 UTC
06f4c1a assume that rewritten expressions contain no division by zero 27 August 2018, 15:16:09 UTC
d19f5c2 whitespace 27 August 2018, 14:56:43 UTC
2ef21d2 revert term reordering fixes 27 August 2018, 14:56:43 UTC
df8cb94 rewrite x/x to 1 only if we can show x != 0 27 August 2018, 14:56:43 UTC
20eee94 fixing rules that cannot match because exprs are reordered/rewritten before matching 27 August 2018, 14:56:43 UTC
7701abe Fix cases where halide_buffer_copy could copy to/from a host pointer that was NULL where the case was valid by compying from the device allocation. Add tests for these cases. Change name of do_multidimensional_copy in opencl and cuda runtimes to be unique to each runtime as the opencl runtime was calling the cuda do_multidimensional_copy despite both being in anonymous namespaces inside their respective files. Weak linking and C++ namespaces and our unusual runtime linking and probably at least one bug somewhere caused this to go badly. Required trying to use both cuda and opencl at the same time. 27 August 2018, 09:04:43 UTC
1e69e8b Merge pull request #3238 from halide/empty_pipeline_bound Ignore function with empty pipeline bounds when creating partitioner 24 August 2018, 18:31:23 UTC
5481f15 Merge pull request #2995 from halide/simplify_mod Add simplify rules for a modulo of a difference. 24 August 2018, 17:47:40 UTC
1f43e2f Merge pull request #3236 from halide/buffer_copy_fix Fix halide_buffer_copy crash introduced by latest changes for host to… 24 August 2018, 17:35:20 UTC
f44f2df Merge pull request #3237 from halide/avx_512_alignment Move x86 to 64-byte alignment because of AVX-512. 24 August 2018, 17:33:04 UTC
0e6a58d Move x86 to 64-byte alignment because of AVX-512. 24 August 2018, 06:30:03 UTC
e0d693f Fix halide_buffer_copy crash introduced by latest changes for host to host copy. Add a test to cover this simple case. 24 August 2018, 06:14:36 UTC
e1f1408 Ignore function with empty pipeline bounds when creating partitioner 23 August 2018, 23:00:19 UTC
fd1b76f Merge pull request #3176 from halide/buffer_copy_improvements Buffer copy improvements 23 August 2018, 22:56:30 UTC
14a4ae3 Merge branch 'master' into buffer_copy_improvements 23 August 2018, 19:55:59 UTC
9d97eaa Merge pull request #3233 from zanbri/acquire_release_cuda_context_bug Incorrect acquire_context and release_context function names in test script 23 August 2018, 19:54:31 UTC
f9d3254 Fixed acquire_release_bug: issue #3232 23 August 2018, 16:49:51 UTC
0cb5825 Address review feedback. This makes an error condition actually return an error in one case as well. 22 August 2018, 23:37:09 UTC
d9973ca Merge branch 'master' into buffer_copy_improvements 22 August 2018, 23:20:38 UTC
331df56 Add a simplify test from #3229. 21 August 2018, 20:16:29 UTC
e0eb519 Add a division visitor method to SolveExpression. 21 August 2018, 20:06:28 UTC
2e48dad sync the test format to #2996 21 August 2018, 19:51:41 UTC
ee3b29d Merge branch 'master' into simplify_mod 21 August 2018, 19:41:45 UTC
4518a15 Merge pull request #3224 from halide/reorder_rvar Support reordering RVar if a function is associative. 21 August 2018, 17:38:23 UTC
f571a02 Merge pull request #3226 from halide/bug_inline_unbounded Fix inlining unbounded functions 21 August 2018, 16:13:47 UTC
4e716a5 Merge pull request #3222 from inazarenko/fix_scope Sync ScopedBinding<void> to the main template. 21 August 2018, 06:44:17 UTC
a5b0ab4 Check commutativity. 20 August 2018, 21:22:22 UTC
37637fe Fix inlining unbounded functions 20 August 2018, 20:02:29 UTC
3ca81a2 Support reordering Rvars of associative functions. 20 August 2018, 18:08:39 UTC
26e2625 whitespace 17 August 2018, 19:47:04 UTC
72c58f3 revert term reordering fixes 17 August 2018, 19:45:28 UTC
6a6d11a rewrite x/x to 1 only if we can show x != 0 17 August 2018, 19:27:29 UTC
416ea2c Merge remote-tracking branch 'upstream/metaprogrammed_simplifier_rules' into verification_pr 17 August 2018, 19:25:52 UTC
50fa921 Back off the stack size limit on the fft app slightly 17 August 2018, 19:21:42 UTC
17d549c Merge remote-tracking branch 'origin/master' into metaprogrammed_simplifier_rules 17 August 2018, 19:20:57 UTC
83fa9aa Sync ScopedBinding<void> to the main template. Adds a constructor that allows to skip binding the name, which is already present for other types, but not in the specialization for void. 16 August 2018, 20:20:38 UTC
back to top