https://github.com/halide/Halide

sort by:
Revision Author Date Message Commit Date
2e48dad sync the test format to #2996 21 August 2018, 19:51:41 UTC
ee3b29d Merge branch 'master' into simplify_mod 21 August 2018, 19:41:45 UTC
4518a15 Merge pull request #3224 from halide/reorder_rvar Support reordering RVar if a function is associative. 21 August 2018, 17:38:23 UTC
f571a02 Merge pull request #3226 from halide/bug_inline_unbounded Fix inlining unbounded functions 21 August 2018, 16:13:47 UTC
4e716a5 Merge pull request #3222 from inazarenko/fix_scope Sync ScopedBinding<void> to the main template. 21 August 2018, 06:44:17 UTC
a5b0ab4 Check commutativity. 20 August 2018, 21:22:22 UTC
37637fe Fix inlining unbounded functions 20 August 2018, 20:02:29 UTC
3ca81a2 Support reordering Rvars of associative functions. 20 August 2018, 18:08:39 UTC
83fa9aa Sync ScopedBinding<void> to the main template. Adds a constructor that allows to skip binding the name, which is already present for other types, but not in the specialization for void. 16 August 2018, 20:20:38 UTC
b225e0d Merge pull request #3219 from halide/revert-3212-srj-codegen Revert "Ensure calls to codegen() are well-ordered" 14 August 2018, 22:43:00 UTC
50f4a13 Revert "Ensure calls to codegen() are well-ordered" 14 August 2018, 22:10:22 UTC
29ba9d2 Merge pull request #3212 from halide/srj-codegen Ensure calls to codegen() are well-ordered 14 August 2018, 21:58:53 UTC
e5d3125 Merge pull request #3217 from halide/srj-tidy Hygiene: move nontrivial Target method bodies into .cpp file 14 August 2018, 18:17:27 UTC
51895c9 Hygiene: move nontrivial Target method bodies into .cpp file Not at all important, but some of these methods have grown pretty long over the years. 13 August 2018, 22:44:29 UTC
47203ba Merge pull request #3216 from halide/srj-sf Special-case strict_float in Bounds.cpp to avoid n^2 behavior 13 August 2018, 21:17:48 UTC
c273948 Merge branch 'master' into srj-codegen 13 August 2018, 19:50:58 UTC
e538132 Reorder code 13 August 2018, 19:49:57 UTC
5c19d27 Special-case strict_float in Bounds.cpp to avoid n^2 behavior 13 August 2018, 18:32:34 UTC
c4dc874 Merge pull request #3213 from halide/srj-select Remove special-case for int32 from CodeGen_LLVM::visit(const Select *op) 11 August 2018, 22:28:15 UTC
6977295 Merge pull request #3105 from halide/unsafe_promises Add unsafe promises feature. 11 August 2018, 18:06:06 UTC
57120fc Merge branch 'master' into unsafe_promises 11 August 2018, 00:02:25 UTC
f981846 Reorder to to be closer to original 10 August 2018, 23:06:46 UTC
a915469 Remove special-case for int32 from CodeGen_LLVM::visit(const Select *op) LLVM circa v3.2 or so had a bug wherein complex pipeline would compile extremely slowly, so a workaround was inserted; I can't replicate this as of LLVM5 or later (which is all we are supporting now), so let's consider backing out this workaround. 10 August 2018, 19:40:30 UTC
6d61433 Ensure calls to codegen() are well-ordered C++11 doesn't guarantee that function arguments are evaluated in a particular order, thus calls of the form `builder->CreateFoo(codegen(a), codegen(b))` might generate LLVM IR with either a-then-b or b-then-a; at best, this makes comparing IR between compilers a nuisance; at worst, it can trigger subtle bugs and make them harder to find (see https://github.com/halide/Halide/issues/3203). This PR looks for all calls that evalutate codegen() more than once as a function arg and rearranges code to use temporaries to ensure a well-defined order. (Note that a few with only a single call to codegen() were also pulled into temporaries where I thought it improved clarity or helped forestall reinsertion of the bad code pattern by future edits.) 10 August 2018, 19:17:15 UTC
8c2f090 Merge pull request #3199 from halide/srj-rt-11 Explicitly specify -std=gnu++98 for runtime .cpp building 10 August 2018, 17:04:45 UTC
b83d7e7 Merge pull request #3205 from halide/kamil/coverity_issue Appease Coverity 10 August 2018, 13:59:18 UTC
3dfadd9 Merge pull request #3204 from halide/fix_comment Fix a typo in comment. 10 August 2018, 05:46:50 UTC
25dedcf Merge pull request #3184 from halide/gpu_types_for_kernel_run Add support for passing typecode, not just size, in args to GPU kernel run routine. 09 August 2018, 22:10:48 UTC
890df56 Appease Coverity 09 August 2018, 20:43:47 UTC
a20a60a Fix a typo in comment. 09 August 2018, 18:06:27 UTC
7ddda78 Merge pull request #3201 from halide/expr_uses_var Add an argument of a scope of variable binding to expr_uses_var. 09 August 2018, 04:43:30 UTC
27f4deb pass scope argument. 09 August 2018, 00:03:10 UTC
68a316b Add an argument of scope of variable binding to stmt_uses_var. 09 August 2018, 00:00:43 UTC
e5aecbe Explicitly specify -std=gnu++98 for runtime .cpp building We don't want c++11, as we don't want static locals to get thread-sync helper code (which c++11 mandates); previously, we didn't specify any version, which left us at the mercy of the system compiler (which might have defaulted to c++11). Choosing gnu++98 is arbitrary but gives us a predictable baseline across common build environments. 08 August 2018, 01:04:48 UTC
793734e Address review feedback. 07 August 2018, 19:30:21 UTC
2caad82 Merge branch 'master' into gpu_types_for_kernel_run 07 August 2018, 19:17:29 UTC
ea9c863 Merge pull request #3195 from halide/srj-uint64 call to halide_hexagon_run(): arg_sizes should be uint64*, not size_t* 07 August 2018, 17:19:49 UTC
9c53d76 Merge pull request #3196 from halide/srj-stable Use std::stable_sort in LoopCarry 07 August 2018, 17:19:37 UTC
ddb1619 Use std::stable_sort in LoopCarry This is really just for debugging hygiene: use of std::sort means that chains of same length can be in different orders in different C++ compilers; this doesn't appear to affect correctness of output, but having the sort be stable does make comparing IR from different compiler versions easier. 06 August 2018, 21:39:18 UTC
e21e61a call to halide_hexagon_run(): arg_sizes should be uint64*, not size_t* 06 August 2018, 17:43:08 UTC
1289ad3 Merge pull request #3190 from halide/srj-foreach2 Make Buffer::for_each_value const-correct 03 August 2018, 22:27:18 UTC
a7215a6 Make Buffer::for_each_value const-correct Declare it const, then add perfect-forwarding to some helper methods; this allows us to call it on `const Buffer<T>` (which are fine to mutate if T is not const) and also `Buffer<const T>` (which are now correctly prohibited from mutating the values via nonconst reference args to the lambda). 03 August 2018, 18:25:35 UTC
0b97b66 Appease compiler pedantry. 02 August 2018, 23:31:33 UTC
6704b56 Fix plain-C issue in HalideRuntimeOpenGLCompute.h . 02 August 2018, 18:58:44 UTC
366f949 Merge pull request #3041 from matthiaskramm/ctlsz Make CodeGen_C aware of count_leading_zeros 02 August 2018, 17:36:59 UTC
fd47a65 Merge pull request #3118 from SanderVocke/disable_leakcheck Disable asan leak checking during code generation. 02 August 2018, 17:16:35 UTC
2101955 Merge pull request #3173 from halide/srj-copy-as Add some HalideBuffer convenience methods (Issue #3113) 02 August 2018, 16:48:42 UTC
2e7fb15 Merge pull request #3172 from halide/fix_warp_shuffles Fix some pathological behavior in warp shuffles 02 August 2018, 15:19:52 UTC
aa70aaa Merge pull request #3183 from halide/bump_llvm_version LLVM is bumping their version number to 8 01 August 2018, 23:58:33 UTC
6c3d30b Change copy() to just return nonconst 01 August 2018, 23:08:18 UTC
6ca35a9 Add support for passing typecode insitead of just size with args to GPU kernel run routine. Eventually all GPU runtimes should probably use this one mechanism, but for now it is made optional per backedn to prevent breaking applications which have replaced the weak linked kernel run routine for a given API. OpenGL Compute is switched to the types based model as it was previously broken for types other than int32_t. Remove tabs in some files. 01 August 2018, 22:55:56 UTC
e0316d1 Remove nonstatic make_with_shape_of() 01 August 2018, 22:43:58 UTC
d8917c4 Some build bots don't have the requisite cuda version 01 August 2018, 22:17:50 UTC
ac129fe LLVM is bumping their version number to 8 01 August 2018, 22:00:29 UTC
f39ea35 Merge remote-tracking branch 'origin/master' into fix_warp_shuffles 01 August 2018, 18:12:42 UTC
8c772dd Merge pull request #3165 from halide/quieter_test_correctness only report failures in test_correctness 01 August 2018, 17:26:01 UTC
0122fd7 Merge remote-tracking branch 'origin/master' into disable_leakcheck 01 August 2018, 11:13:40 UTC
92a16ea Merge pull request #3177 from halide/readme_update Update README.md to reflect a slightly more current version of target support. 01 August 2018, 01:05:52 UTC
a9de173 Merge pull request #3178 from halide/srj-warn Don't emit #pragma message in D3D12 backend 01 August 2018, 00:52:25 UTC
58d2c8a Don't emit #pragma message in D3D12 backend 01 August 2018, 00:48:46 UTC
4829e0a Typo. 01 August 2018, 00:38:35 UTC
b61de00 Update README.md to reflect a slightly more current version of target support. 01 August 2018, 00:33:25 UTC
fd5947a Fix comment 31 July 2018, 20:47:11 UTC
d8edcae Merge pull request #3166 from halide/faster_cascaded_filters Speed up cascaded filters test 31 July 2018, 19:51:15 UTC
f43288e Merge branch 'master' into unsafe_promises 31 July 2018, 18:21:15 UTC
39c5839 Add some HalideBuffer convenience methods (Issue #3113) - augment copy() to allow copying-to-a-different-type - add nonstatic variant of make_with_shape_of() - add reset() 31 July 2018, 18:19:02 UTC
e0e73f1 Fix some pathological behavior in warp shuffles 31 July 2018, 18:10:17 UTC
ddbf7e6 Merge remote-tracking branch 'origin/master' into quieter_test_correctness 31 July 2018, 18:08:57 UTC
51fd2c8 Merge pull request #3155 from suyogsarda/master Update image info API's for halide_buffer_t 31 July 2018, 17:04:29 UTC
92fe75a Merge pull request #3168 from halide/srj-buf Buffer move-construct ctor was missing assert_can_convert_from 31 July 2018, 16:51:49 UTC
9269c83 Merge branch 'unsafe_promises' of https://github.com/halide/Halide into unsafe_promises 31 July 2018, 15:12:28 UTC
ffb8196 Lower unsafe promises earlier This gives codegen a chance to peephole things like dense vector loads Also fix a minor bug that showed up in bounds. 31 July 2018, 15:12:17 UTC
a3ee5f7 Merge pull request #3158 from halide/srj-incl Add missing #include to HexagonAlignment.h 31 July 2018, 00:56:30 UTC
01cad4f Buffer move-construct ctor was missing assert_can_convert_from 31 July 2018, 00:37:46 UTC
db256bd Merge pull request #3160 from halide/openglcompute-location-fix Add explicit location layout specifier to uniforms. 30 July 2018, 23:40:17 UTC
30ab4c4 Merge branch 'master' into faster_cascaded_filters 30 July 2018, 23:35:27 UTC
0295c73 Merge branch 'master' into quieter_test_correctness 30 July 2018, 23:34:51 UTC
9652625 Merge pull request #3159 from halide/srj-d3d Fix D3D12 issues 30 July 2018, 23:30:57 UTC
d8ab812 Merge branch 'master' into srj-d3d 30 July 2018, 20:02:54 UTC
49136b3 Merge branch 'master' into openglcompute-location-fix 30 July 2018, 18:11:39 UTC
3cc9e27 Merge branch 'master' into unsafe_promises 30 July 2018, 18:11:03 UTC
77ed646 Merge pull request #3081 from tbennun/master Makefile LLVM static library fix 30 July 2018, 18:09:19 UTC
a861bf9 Speed up cascaded filters test 30 July 2018, 17:25:47 UTC
dbca67d only failures in test_correctness 30 July 2018, 17:11:36 UTC
98f0b1a always use inline implementation of clz 30 July 2018, 05:46:50 UTC
bf76726 use "sizeof(long long) * 8" instead of "64" 30 July 2018, 05:46:50 UTC
9f5dc7c add tests for 8/16/32/64 bit 30 July 2018, 05:46:50 UTC
a1c9683 use clzll instead of clz, for 64 bit integers 30 July 2018, 05:46:50 UTC
f3a26ef Make CodeGen_C aware of count_leading_zeros 30 July 2018, 05:46:50 UTC
d14d3a1 Add explicit location layout specifier to uniforms as GLSL does not guarantee they are in any particular order otherwise. 28 July 2018, 00:28:59 UTC
be66f83 Remove redundant 'return' to silence ClangTidy 27 July 2018, 21:38:42 UTC
866c3ce Fix D3D12 issues - avoid the use of fallthru-without-break in CodeGen_D3D12Compute_Dev.cpp (some compiler settings assume this is an error, and the conventional equivalent is fine here) - allow runtime linker to compile when WITH_D3D12 is not defined 27 July 2018, 21:09:16 UTC
6a7ff1b Add missing #include to HexagonAlignment.h 27 July 2018, 20:20:23 UTC
cd95b27 Merge pull request #3116 from halide/srj-features Rework runtime to allow more than 64 Target::Features (Issue #2911) 27 July 2018, 16:56:13 UTC
78fde6c Update image info API's for halide_buffer_t Replace buffer_t with halide_buffer_t and other data structure. 27 July 2018, 15:48:04 UTC
14f594e Merge pull request #2 from halide/master Sync 27 July 2018, 11:52:25 UTC
d734dc0 Merge branch 'master' into srj-features 26 July 2018, 22:29:32 UTC
d91647c Merge pull request #2913 from suyogsarda/master Add v65 instructions for HVX 26 July 2018, 16:15:07 UTC
b6b9d6b Merge pull request #3152 from halide/interleave-bug Interleave bug 26 July 2018, 16:14:11 UTC
457d333 Merge pull request #2755 from halide/d3d12 Direct3D 12 (compute) GPU back-end 26 July 2018, 14:18:23 UTC
back to top