029ec23 | Pranav Bhandarkar | 13 December 2019, 23:59:41 UTC | Fix access to Hexagon intrinsic enums | 13 December 2019, 23:59:41 UTC |
255e2e1 | Pranav Bhandarkar | 13 December 2019, 22:07:01 UTC | Merge branch 'master' of https://github.com/halide/Halide | 13 December 2019, 22:07:01 UTC |
18304a7 | Volodymyr Kysenko | 11 December 2019, 18:40:10 UTC | Merge pull request #4467 from halide/vksnk/pos_inf-memory-assert Check if shared memory allocation is bounded | 11 December 2019, 18:40:10 UTC |
51d7413 | Dillon Sharlet | 11 December 2019, 16:42:15 UTC | Merge pull request #4468 from halide/fix-llvm Fix for trunk LLVM. | 11 December 2019, 16:42:15 UTC |
a553fd5 | dsharletg | 11 December 2019, 07:28:01 UTC | Fix for trunk LLVM. | 11 December 2019, 07:28:01 UTC |
429ab1b | Volodymyr Kysenko | 11 December 2019, 00:44:49 UTC | Use has_upper_bound() to check if allocation is bounded | 11 December 2019, 00:44:49 UTC |
ecbc8d3 | Andrew Adams | 11 December 2019, 00:31:52 UTC | Merge pull request #4464 from halide/vksnk/increase-device-num Increase max device count to query in OpenCL runtime | 11 December 2019, 00:31:52 UTC |
c0cf355 | Volodymyr Kysenko | 11 December 2019, 00:30:27 UTC | Check if shared memory allocation is bounded | 11 December 2019, 00:30:27 UTC |
cf526b5 | Volodymyr Kysenko | 10 December 2019, 19:25:54 UTC | Increase max device count to query (OpenCL) | 10 December 2019, 19:25:54 UTC |
5555078 | Andrew Adams | 09 December 2019, 17:36:03 UTC | Merge pull request #4458 from halide/avoid_type_punning_when_making_shapes Avoid type punning in the generated C for buffer shapes | 09 December 2019, 17:36:03 UTC |
717171a | Steven Johnson | 09 December 2019, 17:07:21 UTC | Merge pull request #4402 from aankit-ca/bounds_expr_check Calculate Expr bounds using function_value_bounds | 09 December 2019, 17:07:21 UTC |
cc5af85 | Andrew Adams | 08 December 2019, 02:44:44 UTC | Merge pull request #4445 from halide/apps_from_autoscheduler Apps from autoscheduler | 08 December 2019, 02:44:44 UTC |
3cf4d7e | Andrew Adams | 07 December 2019, 21:05:33 UTC | Avoid type punning in the generated C for buffer shapes | 07 December 2019, 21:05:33 UTC |
336a3b4 | Andrew Adams | 07 December 2019, 20:20:17 UTC | Don't rely on weak linkage | 07 December 2019, 20:20:17 UTC |
585dbad | Steven Johnson | 07 December 2019, 00:04:54 UTC | Merge pull request #4456 from halide/srj-msan-stringify stringify() should call halide_msan_annotate_memory_is_initialized() | 07 December 2019, 00:04:54 UTC |
adc5b8c | Steven Johnson | 06 December 2019, 21:04:23 UTC | stringify() should call halide_msan_annotate_memory_is_initialized() calls to halide_print() -- including those injected by Target::Debug -- will trigger MSAN failures currently. When the target includes MSAN, we should mark the buffer as initializaed to avoid false failures. | 06 December 2019, 21:04:23 UTC |
e54e455 | Steven Johnson | 06 December 2019, 18:36:21 UTC | Merge pull request #4452 from benoitsteiner/master Added a test to cover the featurization of reductions | 06 December 2019, 18:36:21 UTC |
f667032 | Steven Johnson | 06 December 2019, 18:35:32 UTC | Merge pull request #4454 from halide/srj-mustuse Add HALIDE_MUST_USE_RESULT macro to HalideRuntime.h and AOT .h | 06 December 2019, 18:35:32 UTC |
f460f77 | Pranav Bhandarkar | 06 December 2019, 17:29:23 UTC | Merge branch 'master' of https://github.com/halide/Halide | 06 December 2019, 17:29:23 UTC |
dba12cc | Benoit Steiner | 06 December 2019, 00:50:01 UTC | Merge pull request #4447 from benoitsteiner/extern_funcs Fixed the types of the args of extern functions | 06 December 2019, 00:50:01 UTC |
eb83cc9 | Benoit Steiner | 06 December 2019, 00:48:01 UTC | Code cleanup | 06 December 2019, 00:48:01 UTC |
2d274de | Steven Johnson | 06 December 2019, 00:25:31 UTC | Add HALIDE_MUST_USE_RESULT macro to HalideRuntime.h and AOT .h This is designed to make it easier to enforce examination of the result code of calls into Halide AOT code; by providing the HALIDE_MUST_USE_RESULT as part of the generated .h files, you can enable this by simply adding `-DHALIDE_FUNCTION_ATTRS=HALIDE_MUST_USE_RESULT` to your build options (rather than having to ensure that other dependencies ahead of other includes). This mimics the definition used by Abseil, in which the C++17 [[nodiscard]] attribute is preferred when available. (It's quite possible that HALIDE_MUST_USE_RESULT could be profitably applied to Halide APIs as well, both internal and external; I haven't attempted to look for candidates at this time.) | 06 December 2019, 00:25:31 UTC |
cc2efde | Benoit Steiner | 05 December 2019, 23:01:00 UTC | Added a test to cover the featurization of reductions | 05 December 2019, 23:01:00 UTC |
7549e41 | Steven Johnson | 05 December 2019, 18:16:45 UTC | Merge pull request #4448 from halide/srj-range-i32 Range() requires Exprs to be losslessly convertible to int32 | 05 December 2019, 18:16:45 UTC |
2e902ee | Benoit Steiner | 05 December 2019, 18:05:48 UTC | Merge pull request #4428 from benoitsteiner/master Improved generation of Pipeline features for extern functions | 05 December 2019, 18:05:48 UTC |
6b6f76f | Steven Johnson | 05 December 2019, 00:45:47 UTC | Range -> int32 coercion | 05 December 2019, 00:45:47 UTC |
7e14857 | Benoit Steiner | 04 December 2019, 22:19:41 UTC | Fixed indentation | 04 December 2019, 22:19:41 UTC |
7a2b03d | Andrew Adams | 04 December 2019, 21:05:24 UTC | Include base LDFLAGS in GENERATOR_LDFLAGS | 04 December 2019, 21:05:24 UTC |
ffb4c71 | Benoit Steiner | 04 December 2019, 19:47:54 UTC | Changed the api of Function::define_extern to take a vector of vars instead of a vector of strings | 04 December 2019, 19:47:54 UTC |
acbdc37 | Steven Johnson | 04 December 2019, 18:56:48 UTC | Merge pull request #4426 from halide/srj-wasmlld Fix lld::wasm::link() call for trunk LLVM | 04 December 2019, 18:56:48 UTC |
8531e7a | Benoit Steiner | 04 December 2019, 18:36:56 UTC | Made sure the arguments of extern functions are of type Var instead of string immediates. | 04 December 2019, 18:36:56 UTC |
fd69bd0 | Benoit Steiner | 04 December 2019, 18:26:20 UTC | Merge branch 'master' into master | 04 December 2019, 18:26:20 UTC |
e8acdea | Andrew Adams | 04 December 2019, 17:52:57 UTC | Add f suffix to float constant | 04 December 2019, 17:52:57 UTC |
52ba0f7 | Andrew Adams | 04 December 2019, 17:26:10 UTC | Merge remote-tracking branch 'origin/master' into apps_from_autoscheduler | 04 December 2019, 17:26:10 UTC |
6656f57 | Steven Johnson | 04 December 2019, 01:44:29 UTC | Merge pull request #4443 from halide/srj-format Apply clang-format to apps/ | 04 December 2019, 01:44:29 UTC |
880f6f9 | Andrew Adams | 04 December 2019, 01:18:01 UTC | Add missing file | 04 December 2019, 01:18:01 UTC |
5aa2879 | Steven Johnson | 04 December 2019, 01:08:47 UTC | Merge pull request #4442 from halide/srj-range2 Use Halide::Range instead of std::pair<Expr, Expr> in Derivative (Issue #4381) | 04 December 2019, 01:08:47 UTC |
01746ed | Andrew Adams | 04 December 2019, 01:04:17 UTC | Add interpolate cmakefile | 04 December 2019, 01:04:17 UTC |
f8be786 | Andrew Adams | 04 December 2019, 00:45:44 UTC | Delete more rotted code | 04 December 2019, 00:45:44 UTC |
8322145 | Andrew Adams | 04 December 2019, 00:43:34 UTC | Merge remote-tracking branch 'origin/master' into apps_from_autoscheduler | 04 December 2019, 00:43:34 UTC |
c2b6da2 | Andrew Adams | 04 December 2019, 00:43:10 UTC | clang-format | 04 December 2019, 00:43:10 UTC |
aafd8b8 | Andrew Adams | 04 December 2019, 00:25:32 UTC | Add new apps to cmake build | 04 December 2019, 00:25:32 UTC |
d504e18 | Andrew Adams | 03 December 2019, 23:35:20 UTC | Uniform ldflags for generators across all app makefiles | 03 December 2019, 23:35:20 UTC |
282b88a | Andrew Adams | 03 December 2019, 23:25:08 UTC | Rework interpolate. Add IIR blur. | 03 December 2019, 23:25:08 UTC |
3b06e6e | Steven Johnson | 03 December 2019, 23:24:49 UTC | Various tweaks | 03 December 2019, 23:24:49 UTC |
b465318 | Steven Johnson | 03 December 2019, 22:40:35 UTC | Apply clang-format to the rest of apps/ | 03 December 2019, 22:40:35 UTC |
c633f4d | Steven Johnson | 03 December 2019, 22:25:35 UTC | Apply clang-format to apps/autoscheduler No changes, purely formatting. | 03 December 2019, 22:25:35 UTC |
fad4616 | Steven Johnson | 03 December 2019, 19:29:57 UTC | Use Halide::Range instead of std::pair<Expr, Expr> in Derivative (Issue #4381) | 03 December 2019, 19:34:59 UTC |
865af6a | Steven Johnson | 03 December 2019, 18:41:22 UTC | Merge pull request #4407 from halide/srj-copy-to-interleaved Add Buffer::copy_to_interleaved() and Buffer::copy_to_planar() | 03 December 2019, 18:41:22 UTC |
78dd627 | Steven Johnson | 03 December 2019, 18:41:06 UTC | Merge branch 'master' into srj-wasmlld | 03 December 2019, 18:41:06 UTC |
9b8c9bc | Steven Johnson | 03 December 2019, 18:40:36 UTC | Merge pull request #4436 from halide/igarcia_tutorial_fixes Tutorial comment fixes from igarcia@mathworks.com | 03 December 2019, 18:40:36 UTC |
4248238 | Benoit Steiner | 03 December 2019, 16:50:29 UTC | Don't try to build the header files | 03 December 2019, 16:50:29 UTC |
ba067d5 | Andrew Adams | 03 December 2019, 02:27:53 UTC | Add notes on runtimes on the platforms I optimized this on | 03 December 2019, 02:27:53 UTC |
31289fc | Andrew Adams | 03 December 2019, 02:20:02 UTC | Add and reschedule harris corner detector | 03 December 2019, 02:20:02 UTC |
ef8e375 | Andrew Adams | 03 December 2019, 01:57:08 UTC | Delete defunct unsharp test | 03 December 2019, 01:57:08 UTC |
fef1c9d | Andrew Adams | 03 December 2019, 01:56:53 UTC | Add unsharp app, with fresh manual scheduling | 03 December 2019, 01:56:53 UTC |
511600e | Steven Johnson | 03 December 2019, 01:42:39 UTC | Merge pull request #4438 from halide/srj-autograd Increase autograd estimates/sizes from 32 -> 64 | 03 December 2019, 01:42:39 UTC |
feaef86 | Benoit Steiner | 02 December 2019, 23:15:33 UTC | Fixed bad merge | 02 December 2019, 23:15:33 UTC |
e193a5d | Steven Johnson | 02 December 2019, 23:15:02 UTC | Merge branch 'master' into srj-copy-to-interleaved | 02 December 2019, 23:15:02 UTC |
e5fb975 | Steven Johnson | 02 December 2019, 23:13:36 UTC | Fix lld::wasm::link() call for trunk LLVM | 02 December 2019, 23:13:36 UTC |
031afb4 | Benoit Steiner | 22 November 2019, 00:28:59 UTC | Use the proxy (if specified) to generate pipeline features for extern function calls | 02 December 2019, 23:11:14 UTC |
269acc7 | Benoit Steiner | 21 November 2019, 23:33:27 UTC | Added a test for FunctionDAG | 02 December 2019, 23:11:14 UTC |
060650e | Benoit Steiner | 22 November 2019, 00:28:59 UTC | Use the proxy (if specified) to generate pipeline features for extern function calls | 02 December 2019, 23:10:02 UTC |
ceb03a2 | Benoit Steiner | 21 November 2019, 23:33:27 UTC | Added a test for FunctionDAG | 02 December 2019, 23:08:54 UTC |
eb7bf82 | Benoit Steiner | 22 November 2019, 16:53:50 UTC | Added test_function_dag to the CMake file | 02 December 2019, 23:07:19 UTC |
e211d0f | Benoit Steiner | 22 November 2019, 00:28:59 UTC | Use the proxy (if specified) to generate pipeline features for extern function calls | 02 December 2019, 23:06:08 UTC |
c69a9cd | Benoit Steiner | 21 November 2019, 23:33:27 UTC | Added a test for FunctionDAG | 02 December 2019, 23:05:08 UTC |
21219e2 | Benoit Steiner | 02 December 2019, 17:00:52 UTC | Deleted unecessary depedency | 02 December 2019, 23:03:26 UTC |
0825c7f | Benoit Steiner | 25 November 2019, 21:42:42 UTC | Fixed missing symbol issue | 02 December 2019, 23:03:26 UTC |
7bde6eb | Benoit Steiner | 22 November 2019, 16:53:50 UTC | Added test_function_dag to the CMake file | 02 December 2019, 23:03:26 UTC |
9785f20 | Benoit Steiner | 22 November 2019, 03:27:35 UTC | Improved code formatting | 02 December 2019, 23:03:26 UTC |
45df0ce | Benoit Steiner | 22 November 2019, 00:28:59 UTC | Use the proxy (if specified) to generate pipeline features for extern function calls | 02 December 2019, 23:03:26 UTC |
98f1e06 | Benoit Steiner | 21 November 2019, 23:33:27 UTC | Added a test for FunctionDAG | 02 December 2019, 23:03:26 UTC |
5e6bfa2 | Steven Johnson | 02 December 2019, 21:46:56 UTC | Increase autograd estimates/sizes from 32 -> 64 The classic autoscheduler can over-vectorize (e.g., on Skylake it will vectorize to width=64, which will fail for inputs of width<64). This is a somewhat-known issue; rather than try to fix it here, I am just going to widen the buffers to the autograd test to avoid failure on Skylake+ machines. (Note that autograd shouldn't be using the classic autoscheduler in the first place; it should be using the gradient autoscheduler, but packaging issues make that impractical just yet.) | 02 December 2019, 21:52:03 UTC |
039b7d8 | Steven Johnson | 02 December 2019, 21:32:18 UTC | Merge pull request #4437 from halide/fix_region Fix dangling MinAndExtent | 02 December 2019, 21:32:18 UTC |
bf37fa6 | Andrew Adams | 02 December 2019, 18:49:12 UTC | Fix dangling MinAndExtent | 02 December 2019, 18:49:12 UTC |
3dcb411 | Steven Johnson | 02 December 2019, 18:11:18 UTC | Merge pull request #4393 from halide/srj-range Use Halide::Range instead of std::pair<Expr, Expr> (Issue #4381) | 02 December 2019, 18:11:18 UTC |
baf6cd1 | Andrew Adams | 02 December 2019, 18:00:00 UTC | Tutorial comment fixes from igarcia@mathworks.com | 02 December 2019, 18:00:00 UTC |
6f30479 | Steven Johnson | 21 November 2019, 22:02:30 UTC | Merge pull request #4412 from halide/srj-grad Add option to create gradient-descent variant of a Generator | 21 November 2019, 22:02:30 UTC |
93761b5 | Steven Johnson | 21 November 2019, 21:12:13 UTC | Merge pull request #4425 from benoitsteiner/master Bounds tracking for StringImm | 21 November 2019, 21:12:13 UTC |
723ed55 | Benoit Steiner | 21 November 2019, 20:18:47 UTC | Bounds tracking for StringImm | 21 November 2019, 20:18:47 UTC |
cd0cbf7 | Steven Johnson | 21 November 2019, 17:48:34 UTC | Merge branch 'master' into srj-range | 21 November 2019, 17:48:34 UTC |
a81089f | Steven Johnson | 21 November 2019, 17:42:05 UTC | Merge pull request #4421 from halide/srj-mp Cherrypick some changes from autoscheduler-plugins branch | 21 November 2019, 17:42:05 UTC |
8565db9 | Steven Johnson | 21 November 2019, 02:14:28 UTC | Merge pull request #4420 from benoitsteiner/master Small code simplifications at the beginning of the FunctionDAG constructor | 21 November 2019, 02:14:28 UTC |
5259702 | Steven Johnson | 21 November 2019, 00:25:37 UTC | Allow for things with Derivatives | 21 November 2019, 00:25:37 UTC |
525fda4 | Steven Johnson | 20 November 2019, 23:35:46 UTC | Remove unnecessary fill() | 20 November 2019, 23:35:46 UTC |
a1e261e | Steven Johnson | 20 November 2019, 23:31:50 UTC | Support float16, bfloat | 20 November 2019, 23:31:50 UTC |
a6b2c15 | Steven Johnson | 20 November 2019, 23:22:13 UTC | Use correct bounds for propagate_adjoint | 20 November 2019, 23:22:13 UTC |
b27923b | Steven Johnson | 20 November 2019, 22:26:49 UTC | Update Pipeline.cpp | 20 November 2019, 22:26:49 UTC |
0332911 | Steven Johnson | 20 November 2019, 22:24:46 UTC | Cherrypick some changes from autoscheduler-plugins branch - move MachineParams from AutoSchedule.h to PIpeline.h - move some helper functions from AutoSchedule.h to AutoScheduleUtils.h - minor compilation fix in HalideBuffer.h | 20 November 2019, 22:24:46 UTC |
5e999ca | Benoit Steiner | 20 November 2019, 22:18:05 UTC | Improved formatting of the code | 20 November 2019, 22:18:05 UTC |
5194280 | Benoit Steiner | 20 November 2019, 20:25:41 UTC | Small code simplifications at the beginning of the FunctionDAG constructor: Avoid setting node.func in two separate locations. The second initialization is redundant and confusing Streamlined the initialization of node.stages by avoiding extra stage creation and movements. Also applied clang-format, which accounts for all the other code changes | 20 November 2019, 20:25:41 UTC |
ec9d85b | Steven Johnson | 20 November 2019, 19:19:25 UTC | Merge pull request #4404 from vksnk/opencl-build-opt Adds the ability to pass extra build options for OpenCL kernels. | 20 November 2019, 19:19:25 UTC |
747ab36 | Steven Johnson | 20 November 2019, 19:18:12 UTC | Merge pull request #4414 from benoitsteiner/master Split the code from AutoSchedule.cpp in two files: | 20 November 2019, 19:18:12 UTC |
5827901 | Benoit Steiner | 20 November 2019, 01:28:06 UTC | Fixed the order of the #includes | 20 November 2019, 01:28:06 UTC |
58053ef | Benoit Steiner | 20 November 2019, 01:22:49 UTC | Improved formatting of the code and deleted dead code. | 20 November 2019, 01:22:49 UTC |
4331a6f | Benoit Steiner | 20 November 2019, 01:17:28 UTC | Updated the CMake file to compile LoopNest.cpp when needed | 20 November 2019, 01:17:28 UTC |
310a5b3 | Steven Johnson | 19 November 2019, 23:39:57 UTC | Merge pull request #4408 from timo-zeiss-research/host_dirty_on_device_malloc Bugfix: Empty device buffer on corner case | 19 November 2019, 23:39:57 UTC |
7a57d2d | Steven Johnson | 19 November 2019, 23:13:43 UTC | Merge pull request #4415 from ashishUthama/master Use std::abs instead of abs to prevent ambigious match | 19 November 2019, 23:13:43 UTC |
64b4ee0 | Benoit Steiner | 19 November 2019, 22:07:56 UTC | Moved most of the implementation of the methods of the LoopNest class to LoopNest.cpp | 19 November 2019, 22:07:56 UTC |