sort by:
Revision Author Date Message Commit Date
5f5aaf4 More remove_likelies() use 19 August 2019, 21:50:22 UTC
9b8b1a5 consolidate remove_likelies() 19 August 2019, 21:45:58 UTC
0aa3766 remove HL_PERMIT_FAILED_UNROLL 19 August 2019, 21:36:33 UTC
e110b79 UnrollLoops: remove likely() (etc) for the one-last-chance case So that extents of the form `(a - likely(a))` can correctly be deduced as constant. 19 August 2019, 21:34:07 UTC
56e2041 Merge pull request #4146 from halide/srj-tsan-features Make can_use_target TSAN-safe (Issue #4140) 19 August 2019, 17:24:53 UTC
8c7338a Make can_use_target TSAN-safe (Issue #4140) 16 August 2019, 23:22:24 UTC
60f1c5c Merge pull request #4143 from halide/fix_simplifier_bounds_overflow Avoid 64-bit overflow when tracking constant bounds in simplifier 16 August 2019, 17:28:17 UTC
7fa1028 Avoid 64-bit overflow when tracking constant bounds in simplifier These are no-overflow types, so treat such cases not as things that overflow (in which an overflow in the upper bound would affect the lower bound), but rather as things which still have upper/lower bounds that we can no longer represent. 15 August 2019, 20:38:15 UTC
4628a24 Merge pull request #4137 from halide/srj-has-feat Allow Target::has_feature() to operator on halide_target_feature_t 15 August 2019, 18:58:21 UTC
1d49dbd Allow Target::has_feature() to operator on halide_target_feature_t This is really a workaround to allow generated .schedule.h files to compile happily in all cases. A better long-term answer would be for those files to use the Target::Feature names (rather than the halide_target_feature_t names), but doing so will require an additional value->name map (which must be maintained and could get out of sync), so I'm reluctant to do that without a little more thought. 15 August 2019, 17:38:11 UTC
07c3c27 Merge pull request #4138 from halide/srj-llvm-fix Fix for https://reviews.llvm.org/rL368647, from alinas@ 15 August 2019, 17:07:02 UTC
70485a7 Update CodeGen_LLVM.cpp 15 August 2019, 17:03:16 UTC
98991c0 Fix for https://reviews.llvm.org/rL368647, from alinas@ 15 August 2019, 01:16:00 UTC
7eb5b24 Merge pull request #4131 from halide/srj-tic Change halide_toc_impl to use debug(1) instead of debug(0) 14 August 2019, 22:35:38 UTC
8766bc4 Merge pull request #4133 from halide/srj-aslog2 Clean up default autotuner output 14 August 2019, 21:13:43 UTC
5afe043 Merge pull request #4129 from benoitsteiner/master Improved onnx support 14 August 2019, 18:45:51 UTC
2576f3d Merge pull request #4134 from halide/srj-llvm-fix Fix for Trunk LLVM 14 August 2019, 18:09:11 UTC
19964e9 Fix for Trunk LLVM 14 August 2019, 17:03:19 UTC
7c78788 Update DefaultCostModel.cpp 14 August 2019, 16:44:04 UTC
cdec241 Back to dump 13 August 2019, 23:48:05 UTC
5f56d69 Update AutoSchedule.cpp 13 August 2019, 23:37:29 UTC
a6599e7 Update AutoSchedule.cpp 13 August 2019, 23:20:43 UTC
d99d640 Also emit the target being used 13 August 2019, 23:19:40 UTC
9808edc Simplify further 13 August 2019, 23:15:43 UTC
6dbfc57 Merge pull request #4132 from halide/srj-mod2 Tweak schedule.h output 13 August 2019, 22:51:46 UTC
ee60804 Fix indentation 13 August 2019, 22:51:33 UTC
90c52f8 Clean up default autotuner output This restructures so that the default output of the autoscheduler (at HL_DEBUG_AUTOSCHEDULE=0) is much smaller, with just some output for each pass. (All the previous output is still present for HL_DEBUG_AUTOSCHEDULE=1.) Not entire sure if this is the best "per-pass" summary; definitely open for suggestions here (even shorter/terser would be even nicer here IMHO). Also, a little drive-by cleanup to the progress bar (to avoid leaving them dangling) and to give slightly better feedback during the autotune script. 13 August 2019, 22:48:47 UTC
ed4add8 Tweak schedule.h output - avoid unnecessary blank lines - avoid lines that are whitespace-only 13 August 2019, 21:13:50 UTC
50f0861 Change halide_toc_impl to use debug(1) instead of debug(0) While handy for debugging, this means the autoscheduler currently spams stdout with timing info by default, which is puzzling to people not actually working on the autoscheduler. (alternately: add debug-level option to HALIDE_TIC, or just comment out the calls to it in the autoscheduler) 13 August 2019, 18:04:21 UTC
5700dbb Merge pull request #4128 from halide/srj-rg2 RunGen: bounds-query failure shouldn't matter if we use estimates anyway 12 August 2019, 23:46:03 UTC
14f9fb2 Improved conversion of onnx to halide: * avoid the creation of identity functions that slow down the scheduling. * recusrively inline and simplify shape expressions to make them more amenable to analysis during scheduling 12 August 2019, 23:15:00 UTC
eda5da8 Fixed the code that converts numpy arrays to Halide input buffers 12 August 2019, 23:08:09 UTC
2a4f364 RunGen: bounds-query failure shouldn't matter if we use estimates anyway bounds_query_input_shapes() agressively fails if we can't complete the bounds-query, which can happen if the constraints on inputs are nontrivial. While we can (and should) improve the bounds-query logic to make this more robust, we shouldn't aggressively fail here in the first place, as the bounds-query shape(s) end up unused if we have estimates for the inputs (which we usually do). This just adds a ShapePromise type that wraps access to the resulting Shapes inside a function; if we never need the shape, the failure doesn't matter and never happens. 12 August 2019, 21:09:20 UTC
a8128b1 Merge pull request #4123 from halide/srj-weights Convert autoscheduler weights to a single file 09 August 2019, 22:49:37 UTC
d4ec333 Update Weights.cpp 09 August 2019, 18:13:22 UTC
94dd5d7 Add versions to the weights file 09 August 2019, 18:11:21 UTC
0f24963 Convert autoscheduler weights to a single file The main gist of this change is to put all of the weights used by the autoscheduler into a single file (instead of a directory of multiple files); the goal is to make saving 'overnight' tuned weights for a pipeline simpler, via having only a single `.weights` file rather than a directory of things. There are various drive-by fixes, of course: - The 'Weights' struct moved into its own file, and now manages its own loading/saving, in both the classic dir-of-files format, and new, single-file format. (The single-file format is very simpleminded but likely more than adequate; it has a trivial signature at the front so some smart upgrading over time should be easy to do.) - DefaultCostModel.cpp got its own .h file, instead of being incestuously connected to the ABC it implements - existing `weights` folder was deleted; the equivalent data is now in `baseline.weights` - added the `weightsdir_to_weightsfile` utility for any downstream consumers that have custom weights in the 'old' format that they need to update to the 'new' format. - Removed some unused code here and there. Note that HL_WIEGHTS_DIR (and HL_WEIGHTS_OUT_DIR) are still used, but are 'smart' for now; if you pass a pathname that ends in `.weights` it loads as a single file, otherwise it assumes an old-style directory. (I didn't try to smarten the env var names because we really need to eradicate the use of env vars for this purpose anyway. but that's a job for another day.) 09 August 2019, 01:02:37 UTC
6a36290 Merge pull request #4120 from halide/srj-autotune Pass $HALIDE_DISTRIB_PATH to autotune_loop.sh 08 August 2019, 21:41:42 UTC
d89c94b Update autotune_loop.sh 08 August 2019, 21:25:20 UTC
aa03f99 Pass $HALIDE_DISTRIB_PATH to autotune_loop.sh 08 August 2019, 21:23:11 UTC
3b518a6 Merge pull request #4119 from halide/srj-dag Avoid signed/unsigned warnings in FunctionDAG.h 08 August 2019, 21:20:15 UTC
c7fd30c Avoid signed/unsigned warnings in FunctionDAG.h 08 August 2019, 21:15:07 UTC
ac66a97 Merge pull request #4112 from halide/srj-dag Avoid crash in FunctionDAG.h (Issue #4110) 08 August 2019, 21:04:56 UTC
bf74fba Merge pull request #4114 from halide/srj-loopt Add EnableLLVMLoopOpt and DisableLLVMLoopOpt (Issue #4113) 08 August 2019, 20:41:11 UTC
a8e1303 Merge branch 'master' into srj-dag 08 August 2019, 20:17:49 UTC
0d50bc9 Merge pull request #4117 from halide/srj-incl PerfectHashMap.h should be standalone 08 August 2019, 20:16:16 UTC
7857ea6 Merge pull request #4102 from halide/srj-autoclean Strip 'irrelevant' features from the target comment when generating schedule.h files 08 August 2019, 20:15:33 UTC
adb5bde Update PerfectHashMap.h 08 August 2019, 18:58:31 UTC
01ecadc PerfectHashMap.h should be standalone We don't want it to require libHalide, but including Errors.h implicitly does that. Move the 'standalone' asserter into the template class and out of the test, so that the .h file will compile standalone (regardless of what is #defined before it). 08 August 2019, 18:55:12 UTC
b69fcc9 Add enable_llvm_loop_opt 08 August 2019, 17:58:13 UTC
a0d5f28 Combine disable-llvm-loop features into one 08 August 2019, 17:46:43 UTC
081141e Update Module.cpp 08 August 2019, 17:18:37 UTC
2558fa6 Merge pull request #4106 from halide/distinct_wrapper_names Wrappers need distinct names 08 August 2019, 17:16:04 UTC
39682df Avoid crash in FunctionDAG.h 08 August 2019, 17:14:26 UTC
02e43c6 Merge pull request #4063 from halide/srj-autosched-test Add apps/autoscheduler to test_apps target 08 August 2019, 17:08:28 UTC
1606bac Merge pull request #4109 from halide/srj-incl Add missing includes in apps/autoscheduler 08 August 2019, 03:31:49 UTC
2255a92 Merge pull request #4105 from halide/introspection_64_bit_offsets Fix for llvm trunk 08 August 2019, 01:34:18 UTC
e2b2524 Add missing includes in apps/autoscheduler also, drive-by sorting of includes 08 August 2019, 00:45:16 UTC
bfe3471 Merge branch 'introspection_64_bit_offsets' into srj-autosched-test 07 August 2019, 22:38:38 UTC
a785848 Use better typedef name from #4107 07 August 2019, 22:27:31 UTC
0468fc7 Wrappers need distinct names 07 August 2019, 22:15:19 UTC
ae74331 Fix for llvm trunk 07 August 2019, 22:06:57 UTC
d7126d8 Update autoscheduler.inc 07 August 2019, 21:51:07 UTC
dec777c Merge pull request #4097 from benoitsteiner/master Improved onnx support 07 August 2019, 18:47:23 UTC
31bb617 Add missing includes 07 August 2019, 18:41:37 UTC
a33cd9b Merge branch 'master' into srj-autosched-test 07 August 2019, 18:39:54 UTC
defb6ad Merge pull request #4099 from halide/srj-schar Add `signed char` handling to RunGen.h (Issue #3909) 07 August 2019, 18:37:13 UTC
81b02fa Merge pull request #3678 from halide/cuda_sticky_allocator Cuda sticky allocator 07 August 2019, 18:12:30 UTC
44c2b31 Update Module.cpp 07 August 2019, 00:40:56 UTC
be2233b Strip 'irrelevant' features from the target comment when generating .schedule.h files 06 August 2019, 23:54:29 UTC
6f3ac72 Improved formatting and fixed spelling 06 August 2019, 18:10:10 UTC
2c7160b Update Makefile 06 August 2019, 17:16:45 UTC
d737753 Merge branch 'master' into srj-autosched-test 06 August 2019, 17:16:29 UTC
8bbd9b3 Update Makefile 06 August 2019, 17:16:19 UTC
d380205 Fix copy-paste error 06 August 2019, 16:57:27 UTC
72412cb Add `signed char` handling to RunGen.h (Issue #3909) 06 August 2019, 16:56:37 UTC
fa8c508 Merge pull request #4094 from halide/srj-schedfile Upgrade autogenerated schedule outputs into legal C++ headers 06 August 2019, 16:45:10 UTC
458553b Update Module.cpp 06 August 2019, 16:44:20 UTC
e9283a4 Update Makefile 06 August 2019, 16:43:21 UTC
7cc1019 Merge branch 'master' into srj-schedfile 06 August 2019, 16:42:52 UTC
adcc283 Numerous fixes and enhancements to the onnx converter 06 August 2019, 15:00:35 UTC
24445a4 Get rid of the watchdog timer since it creates more problems than it solves 06 August 2019, 14:50:02 UTC
8c32eeb Merge pull request #4092 from halide/srj-est Allow Input<SomeType*>::set_estimate() 06 August 2019, 01:42:49 UTC
b283c61 Update PyModule.cpp 06 August 2019, 00:45:56 UTC
30b2cfb Revert mistaken change 06 August 2019, 00:26:40 UTC
cb96c1b Upgrade autogenerated schedule outputs into legal C++ headers 06 August 2019, 00:07:17 UTC
aa219db Added comment about eviction policy 05 August 2019, 23:09:01 UTC
31ef18c Address review comment 05 August 2019, 22:57:45 UTC
f743c3f Merge pull request #4090 from halide/fix_4082 Avoid testing expressions that exercise UB 05 August 2019, 22:53:43 UTC
5a84542 Various fixes 05 August 2019, 22:28:19 UTC
8a6d4c7 Merge branch 'master' into srj-autosched-test 05 August 2019, 20:48:01 UTC
b26726c Allow Input<SomeType*>::set_estimate() Previouslty this didn't compile. Add a wrapper that allows it to be set (but only allow nullptr as the estimate value for now). 05 August 2019, 20:33:11 UTC
aad9ba2 Merge remote-tracking branch 'origin/master' into cuda_sticky_allocator 05 August 2019, 20:00:43 UTC
9b59d0b Avoid testing expressions that exercise UB 05 August 2019, 16:06:56 UTC
3f88725 Merge pull request #4083 from halide/srj-unknown Restore -Wno-unknown-warning-option 05 August 2019, 16:06:42 UTC
652b279 Merge pull request #4084 from ngzhian/parallel-benchmarks Build benchmark apps in parallel (run in serial) 05 August 2019, 15:53:21 UTC
65c2b9d Build benchmark apps in parallel (run in serial) See https://www.gnu.org/software/make/manual/make.html#Phony-Targets for an explanation of this idea. 01 August 2019, 22:49:01 UTC
3e7fbb9 Restore -Wno-unknown-warning-option Apparently OSX Clang (at least) needs it to avoid complaining about -Wno-psabi 01 August 2019, 20:38:51 UTC
da3f59e Merge pull request #4081 from halide/srj-buf Use zero-init instead of memset for other.buf in HalideBuffer.h 01 August 2019, 19:27:10 UTC
1af9866 Merge pull request #4078 from halide/srj-est-api-2 Rationalize API for setting autoscheduler estimates. 01 August 2019, 19:11:41 UTC
back to top