a675193 | Steven Johnson | 23 February 2018, 00:55:09 UTC | Verify that Python stubs work only with new-style Generators Revised version of https://github.com/halide/Halide/pull/2768, but explicitly forbidding 'old' and 'partial-new' Generators for now. | 23 February 2018, 00:55:09 UTC |
4c4a6f6 | Dillon Sharlet | 21 February 2018, 00:16:50 UTC | Merge pull request #2760 from halide/add_fpic_to_runtime Add -fpic option to build of .ll files, the new dso_local hadling in… | 21 February 2018, 00:16:50 UTC |
8ca6e97 | Steven Johnson | 20 February 2018, 23:20:37 UTC | Merge pull request #2764 from halide/srj-parammap Clean up OIR code in Parameter | 20 February 2018, 23:20:37 UTC |
d418cac | Steven Johnson | 20 February 2018, 19:02:14 UTC | Clean up OIR code in Parameter Should be the same functionality as before, just more compact and comprehensible. | 20 February 2018, 19:02:14 UTC |
684429e | Ron Lieberman | 19 February 2018, 18:50:40 UTC | Add -fpic option to build of .ll files, thew new dso_local hadling in clang needs to know if these are shared libs or not, for symbol visibility | 19 February 2018, 18:50:40 UTC |
fe85f6a | Steven Johnson | 15 February 2018, 17:40:42 UTC | Merge pull request #2751 from halide/srj-tsel Add tuple_select() to python bindings. | 15 February 2018, 17:40:42 UTC |
0409941 | Steven Johnson | 14 February 2018, 23:10:43 UTC | Merge branch 'master' into srj-tsel | 14 February 2018, 23:10:43 UTC |
46d8e9e | Steven Johnson | 14 February 2018, 23:10:30 UTC | Merge pull request #2750 from halide/srj-llvm Fix API change for trunk LLVM | 14 February 2018, 23:10:30 UTC |
c1087c6 | Steven Johnson | 14 February 2018, 23:01:31 UTC | Add tuple_select() to python bindings. | 14 February 2018, 23:01:31 UTC |
b848f79 | Andrew Adams | 14 February 2018, 21:58:51 UTC | Merge pull request #2738 from kernhanda/kernhanda/cmake-package-pr Rewrite CMake builds `install`/`distrib` targets. | 14 February 2018, 21:58:51 UTC |
7de2b4f | Steven Johnson | 14 February 2018, 20:35:01 UTC | Fix API change for trunk LLVM | 14 February 2018, 20:35:01 UTC |
d5621f5 | Dillon Sharlet | 14 February 2018, 16:56:55 UTC | Merge pull request #2635 from halide/nn-ops Replace hexagon_matmul with nn_ops | 14 February 2018, 16:56:55 UTC |
9bec1f6 | Kern Handa | 14 February 2018, 09:09:31 UTC | Fix `halide_config.cmake.tpl` | 14 February 2018, 09:09:50 UTC |
e31f3fd | Zalman Stern | 14 February 2018, 01:19:27 UTC | Merge pull request #2739 from halide/pybind11-build-fix Fix python_bindings build | 14 February 2018, 01:19:27 UTC |
b33b2f9 | Dillon Sharlet | 14 February 2018, 01:07:12 UTC | Add README.md. | 14 February 2018, 01:07:12 UTC |
0969dd2 | Zalman Stern | 14 February 2018, 00:02:15 UTC | Merge pull request #2745 from halide/slomp/bugfix removed the hard limit on the length of environment variable queries … | 14 February 2018, 00:02:15 UTC |
ce9e258 | Kern Handa (KERN) | 13 February 2018, 22:15:41 UTC | disable apps/camera_pipe on windows | 13 February 2018, 22:15:41 UTC |
34e4d9f | Marcos Slomp | 13 February 2018, 21:19:28 UTC | more code review fixes | 13 February 2018, 21:19:28 UTC |
f2133e9 | Marcos Slomp | 13 February 2018, 20:42:14 UTC | more code review fixes | 13 February 2018, 20:42:14 UTC |
be58e82 | Marcos Slomp | 13 February 2018, 19:57:56 UTC | code review changes | 13 February 2018, 19:57:56 UTC |
a5a0495 | Marcos Slomp | 13 February 2018, 19:32:19 UTC | removed the hard limit on the length of environment variable queries on Windows (which was particularly detrimental to HL_WINDOWS_TMP_DIR) | 13 February 2018, 19:32:19 UTC |
31568a0 | Zalman Stern | 13 February 2018, 00:34:50 UTC | Merge pull request #2737 from halide/thread_safe_jit_call Add a test per issue https://github.com/halide/Halide/issues/2731 . | 13 February 2018, 00:34:50 UTC |
21ba752 | Kern Handa (KERN) | 12 February 2018, 23:51:27 UTC | cr feedback | 12 February 2018, 23:51:27 UTC |
483e70d | Z Stern | 12 February 2018, 21:03:37 UTC | Add a comment to indicate why a mutex is acquired in the case where only one thread is compiling. | 12 February 2018, 21:03:37 UTC |
264d341 | Kern Handa | 12 February 2018, 05:13:32 UTC | trigger | 12 February 2018, 05:13:32 UTC |
0701de3 | Jonathan Ragan-Kelley | 12 February 2018, 02:42:40 UTC | Fix python_bindings build Fixes compatibility with various Python environments (seen in several Mac Anaconda Python 3.6 distributions), following the [PyBind11 build instructions](https://github.com/pybind/pybind11/blob/master/docs/compiling.rst#building-manually) more carefully. In particular, don’t pre-link libpython, but rely on lazy linking at load time of the module in a given interpreter. | 12 February 2018, 02:42:40 UTC |
7c43565 | Kern Handa | 12 February 2018, 00:34:57 UTC | Missed HalideRuntime*.h | 12 February 2018, 00:34:57 UTC |
7f18d20 | Kern Handa | 11 February 2018, 20:07:51 UTC | Fix up halide.cmake to be more platform agnostic | 11 February 2018, 20:07:51 UTC |
923d651 | Z Stern | 11 February 2018, 20:02:14 UTC | Add a test per issue https://github.com/halide/Halide/issues/2731 . Also serves as documnetation on passing parameters to JITted routines and avoiding recompilation for each realize. Suggestions on further comments for documentation are particularly welcome. | 11 February 2018, 20:02:14 UTC |
6d9ab57 | Kern Handa (KERN) | 11 February 2018, 19:01:03 UTC | Rewrite CMake builds `install`/`distrib` targets. This change adds support for a proper `install` target for CMake builds. This allows for the usage of CPack to create packages for various systems and platforms. Once Halide is ready to do these things, the settings can be adjusted accordingly. For now, the focus is on creating tgz and zip packages for POSIX systems and Windows, respectively. This is done using CPack. The `distrib` target that was previously available has been rewritten in terms of the `install` target and taking advantage of being able to call CPack through CMake. As was done previously, a Halide.tgz package is created for POSIX systems while a Halide.zip package is created for Windows systems. For Windows builds specifically, the following changes were made: * Enable support for no RTTI builds * Enable support for OpenMP builds * Enable building of all but one app in the `apps/` folder * Fix emitted code for the C codegen to not result in an "inconsistent dll linkage" warning Builds were confirmed to be building with VS2017 on Windows. | 11 February 2018, 19:07:15 UTC |
5125fa3 | Andrew Adams | 09 February 2018, 23:05:04 UTC | Merge pull request #2733 from halide/ger_cleanup reschedule ger | 09 February 2018, 23:05:04 UTC |
b7176fb | Andrew Adams | 09 February 2018, 17:09:28 UTC | parallel -> true | 09 February 2018, 17:09:28 UTC |
1a83604 | Andrew Adams | 08 February 2018, 22:13:46 UTC | reschedule ger Change ger to be in-place, and simplify the schedule. Performance now closer to openblas for me on the larger sizes, and is much faster on the smaller sizes. Rightmost column is proportional to flops Halide sger 32 0.081833 13.2953 Halide sger 64 0.226842 18.6209 Halide sger 128 1.073608 15.4991 Halide sger 288 7.498786 11.1378 Halide sger 544 29.680935 10.0072 Halide sger 1056 121.922739 9.16357 Halide sger 2080 1147.310148 3.77453 OpenBLAS sger 32 35.603869 0.0305585 OpenBLAS sger 64 68.888618 0.0613164 OpenBLAS sger 128 43.106477 0.386021 OpenBLAS sger 288 92.307125 0.904806 OpenBLAS sger 544 186.297631 1.59435 OpenBLAS sger 1056 363.545260 3.0732 OpenBLAS sger 2080 1035.606023 4.18167 | 08 February 2018, 22:13:46 UTC |
de84004 | Steven Johnson | 08 February 2018, 18:23:38 UTC | Convert all nn_ops Generators to new-style | 08 February 2018, 18:23:38 UTC |
9cbdbaf | Steven Johnson | 08 February 2018, 18:23:20 UTC | Don't include malloc.h | 08 February 2018, 18:23:20 UTC |
bef8bc2 | Zalman Stern | 08 February 2018, 17:30:30 UTC | Merge pull request #2730 from adityaatluri/rocm-make added makefile changes to build for amdgpu | 08 February 2018, 17:30:30 UTC |
d2dbcd7 | dsharletg | 08 February 2018, 05:00:15 UTC | Merge branch 'master' of https://github.com/halide/Halide into nn-ops | 08 February 2018, 05:00:15 UTC |
aad17fb | Steven Johnson | 08 February 2018, 00:38:52 UTC | Merge pull request #2719 from halide/srj-pycallgen Augment the Python Bindings to allow using precompiled Generators | 08 February 2018, 00:38:52 UTC |
e604193 | Steven Johnson | 07 February 2018, 21:54:37 UTC | Merge branch 'master' into srj-pycallgen | 07 February 2018, 21:54:37 UTC |
60c5e5c | Steven Johnson | 07 February 2018, 21:52:31 UTC | Merge branch 'master' into srj-pycallgen | 07 February 2018, 21:52:31 UTC |
ea1d830 | Steven Johnson | 07 February 2018, 21:52:04 UTC | Merge pull request #2728 from halide/srj-llset Revise GeneratorParam<LoopLevel>::set() work for both before-and-after generate() usage | 07 February 2018, 21:52:04 UTC |
bbada39 | Steven Johnson | 07 February 2018, 18:14:10 UTC | Merge branch 'master' into srj-llset | 07 February 2018, 18:14:10 UTC |
f8c177b | Steven Johnson | 07 February 2018, 18:13:40 UTC | Merge pull request #2729 from halide/srj-unused Remove unused 'using's | 07 February 2018, 18:13:40 UTC |
6b3df35 | Aditya Atluri | 07 February 2018, 03:09:26 UTC | changed cmake in src directory to enable WITH_AMDGPU macro for compilation | 07 February 2018, 03:09:26 UTC |
10a9b3e | Aditya Atluri | 07 February 2018, 02:59:24 UTC | Made changes to cmake to build amdgpu and added comments about amdgpu build status | 07 February 2018, 02:59:24 UTC |
f31df5f | Aditya Atluri | 07 February 2018, 02:41:37 UTC | added makefile changes to build for amdgpu 1. Changed Makefile to enable compile flag WITH_AMDGPU 2. Changed CodeGen_LLVM file to check if intializing amdgpu works or not | 07 February 2018, 02:41:37 UTC |
504a7fc | Steven Johnson | 06 February 2018, 23:57:55 UTC | Remove unused 'using's | 06 February 2018, 23:57:55 UTC |
7c5d674 | Steven Johnson | 06 February 2018, 23:51:11 UTC | Add explicit Func name | 06 February 2018, 23:51:11 UTC |
a8a2178 | Steven Johnson | 06 February 2018, 23:50:48 UTC | Add explicit Func name | 06 February 2018, 23:50:48 UTC |
b4bdb05 | Steven Johnson | 06 February 2018, 23:17:12 UTC | Patch PR#2728 | 06 February 2018, 23:17:12 UTC |
2f4c511 | Steven Johnson | 06 February 2018, 23:16:42 UTC | Add missing 'using' | 06 February 2018, 23:16:42 UTC |
95a4c88 | Steven Johnson | 06 February 2018, 23:05:03 UTC | Merge pull request #2727 from halide/srj-tupleselect tuple_select cleanup | 06 February 2018, 23:05:03 UTC |
979b56e | Steven Johnson | 06 February 2018, 23:01:45 UTC | Merge branch 'master' into srj-pycallgen | 06 February 2018, 23:01:45 UTC |
e207f95 | Steven Johnson | 06 February 2018, 23:00:39 UTC | Revise GeneratorParam<LoopLevel>::set() work for both before-and-after generate() usage A much-simpler alternative to PR#2720. Note that the interesting change is really all in the override to set() in GeneratorParam_LoopLevel (the changes to the base class implementation are just drive-by simplification to make that override a bit easier to reason about). | 06 February 2018, 23:00:39 UTC |
0e001cc | Steven Johnson | 06 February 2018, 20:01:21 UTC | Clarify and rename methods related to Outputs | 06 February 2018, 20:01:21 UTC |
2ba8955 | Steven Johnson | 06 February 2018, 19:23:48 UTC | Remove unused stuff | 06 February 2018, 19:23:48 UTC |
93ea595 | Steven Johnson | 06 February 2018, 19:18:56 UTC | Fix comment | 06 February 2018, 19:18:56 UTC |
74c4398 | Steven Johnson | 06 February 2018, 18:26:28 UTC | Restore all-are-convertible test | 06 February 2018, 18:26:28 UTC |
f1b7707 | Steven Johnson | 06 February 2018, 18:13:32 UTC | Merge branch 'master' into srj-tupleselect | 06 February 2018, 18:13:32 UTC |
18cd57e | Steven Johnson | 05 February 2018, 21:59:11 UTC | Move non-templated select() and tuple_select() impls into IROperator.cpp | 05 February 2018, 21:59:11 UTC |
137e2b2 | Zalman Stern | 05 February 2018, 21:51:20 UTC | Merge pull request #2726 from halide/zero_init_buffer_shape zero-initialize all fields of a Halide::Runtime::Buffer | 05 February 2018, 21:51:20 UTC |
9709513 | Steven Johnson | 05 February 2018, 21:03:26 UTC | tuple_select cleanup -- move to IROperator.h, next to select(), for better discoverability -- add multiway verison of tuple_select(), to match select() -- add test | 05 February 2018, 21:03:26 UTC |
b7959eb | Andrew Adams | 05 February 2018, 19:14:35 UTC | Try to avoid gcc 4.8 internal compiler error | 05 February 2018, 19:14:35 UTC |
ef8ff74 | Andrew Adams | 05 February 2018, 17:50:48 UTC | zero-initialize all fields of a Halide::Runtime::Buffer | 05 February 2018, 17:50:48 UTC |
8cdc8cd | Andrew Adams | 05 February 2018, 02:29:13 UTC | Merge pull request #2725 from halide/export_codegen_internal Don't include LLVM_Headers in CodeGen_Internal.h | 05 February 2018, 02:29:13 UTC |
41157da | Andrew Adams | 04 February 2018, 03:07:00 UTC | Don't include LLVM_Headers in CodeGen_Internal.h, so that it can be put in Halide.h | 04 February 2018, 03:07:00 UTC |
db4927a | Andrew Adams | 04 February 2018, 00:32:11 UTC | Merge pull request #2724 from halide/simplify_const_cast Remove unnecessary template function | 04 February 2018, 00:32:11 UTC |
980daf9 | Andrew Adams | 03 February 2018, 02:50:51 UTC | Remove unnecessary template function | 03 February 2018, 02:50:51 UTC |
81de819 | Steven Johnson | 03 February 2018, 01:50:05 UTC | Merge pull request #2722 from halide/srj-cast Fix error in commit c2fd18 | 03 February 2018, 01:50:05 UTC |
d414e1b | Steven Johnson | 02 February 2018, 19:05:06 UTC | Fix error in commit c2fd18 copy-paste error; several of the changes should have been <void> rather than <T>, since remove_const must match the type. | 02 February 2018, 19:05:06 UTC |
5e432c6 | Steven Johnson | 02 February 2018, 02:06:47 UTC | Merge pull request #2721 from halide/srj-tidy Fix minor clang-tidy warnings | 02 February 2018, 02:06:47 UTC |
71f8594 | Steven Johnson | 01 February 2018, 23:14:16 UTC | Fix minor clang-tidy warnings Various declaration-vs-definition-names-differ warnings to silence downstream nags | 01 February 2018, 23:14:16 UTC |
6c00f11 | Steven Johnson | 31 January 2018, 23:33:31 UTC | Turn off debugging | 31 January 2018, 23:33:31 UTC |
9518686 | Steven Johnson | 30 January 2018, 21:50:11 UTC | Augment the Python Bindings to allow using a precompiled Generator (written in C++) at JIT time via the Python bindings, mimicking the C++ stub functionality. | 31 January 2018, 23:30:49 UTC |
6908edb | Steven Johnson | 31 January 2018, 17:27:09 UTC | Merge pull request #2717 from halide/srj-movereg Add move ctor/op= for Parameter | 31 January 2018, 17:27:09 UTC |
3b0e1dc | Andrew Adams | 31 January 2018, 16:09:48 UTC | Merge pull request #2715 from halide/add_rpath_for_llvm When building Halide without llvm, add an rpath to the llvm used | 31 January 2018, 16:09:48 UTC |
528ec2b | Steven Johnson | 31 January 2018, 01:24:36 UTC | Add move ctor/op= for Parameter The implicitly-define move ctor for Parameter didn't handle object instance registry tracking, which is must do. (Other usages should never be copyable or movable; move ops were deleted to ensure this.) | 31 January 2018, 01:24:36 UTC |
658e965 | Andrew Adams | 30 January 2018, 22:16:33 UTC | Add explanatory comment | 30 January 2018, 22:16:33 UTC |
22a85d1 | Zalman Stern | 30 January 2018, 21:22:16 UTC | Merge pull request #2716 from halide/srj-ret in_gpu_loop wasn't being restore | 30 January 2018, 21:22:16 UTC |
56a5608 | Steven Johnson | 30 January 2018, 18:08:29 UTC | in_gpu_loop wasn't being restore See previous line 454. Switched all usage to ScopedValue to avoid needing manual cleanup. | 30 January 2018, 18:08:29 UTC |
edab342 | Jing Pu | 30 January 2018, 04:06:59 UTC | Merge pull request #2714 from halide/lift_reinterpret Allow substituting reinterpret calls back in LICM. | 30 January 2018, 04:06:59 UTC |
17bad9b | Andrew Adams | 30 January 2018, 02:28:27 UTC | When building Halide without llvm, add an rpath to the llvm used | 30 January 2018, 02:28:27 UTC |
68e2538 | Andrew Adams | 30 January 2018, 01:20:06 UTC | Merge pull request #2645 from halide/export_all Expose internal symbols | 30 January 2018, 01:20:06 UTC |
6d26b97 | Andrew Adams | 29 January 2018, 22:18:07 UTC | A single space character counts as true, apparently | 29 January 2018, 22:18:07 UTC |
022f01a | Jing Pu | 29 January 2018, 21:59:18 UTC | Allow lifting reinterpret calls in LICM. | 29 January 2018, 21:59:18 UTC |
92cedc3 | Andrew Adams | 29 January 2018, 19:46:56 UTC | Take RTTI settings from LLVM by default But allow the configuration LLVM with RTTI + Halide without RTTI, via WITH_RTTI in the Makefile and HALIDE_ENABLE_RTTI in cmake | 29 January 2018, 19:46:56 UTC |
9730393 | Andrew Adams | 27 January 2018, 04:11:16 UTC | export symbols on linux with cmake | 27 January 2018, 04:11:16 UTC |
bc83c0f | Andrew Adams | 26 January 2018, 22:45:47 UTC | Missing endif | 26 January 2018, 22:45:47 UTC |
90c092a | Andrew Adams | 26 January 2018, 22:36:07 UTC | Adding dllexport to global vars | 26 January 2018, 22:36:07 UTC |
16328f0 | Andrew Adams | 26 January 2018, 21:23:02 UTC | Merge remote-tracking branch 'origin/master' into export_all | 26 January 2018, 21:23:02 UTC |
9b03ec7 | Andrew Adams | 26 January 2018, 21:20:01 UTC | Flip sense of incorrect ifdef | 26 January 2018, 21:20:01 UTC |
e40ffc8 | Andrew Adams | 26 January 2018, 21:18:02 UTC | Merge pull request #2526 from halide/cuda_register_shuffle Support for warp shuffle instructions on cuda | 26 January 2018, 21:18:02 UTC |
a3a318f | Andrew Adams | 25 January 2018, 22:10:06 UTC | Forgot to actually define HALIDE_EXPORT | 25 January 2018, 22:10:06 UTC |
68a2c60 | Andrew Adams | 25 January 2018, 18:28:08 UTC | Spacing fix | 25 January 2018, 18:28:08 UTC |
9a08969 | Andrew Adams | 25 January 2018, 18:26:15 UTC | Indentation fix | 25 January 2018, 18:26:15 UTC |
b6595bd | Andrew Adams | 25 January 2018, 18:10:32 UTC | Remove EXPORT on functions only | 25 January 2018, 18:10:32 UTC |
59bca3c | Andrew Adams | 25 January 2018, 16:38:19 UTC | Review comments | 25 January 2018, 16:38:19 UTC |
ce02cd9 | Andrew Adams | 25 January 2018, 16:31:02 UTC | Merge remote-tracking branch 'origin' into export_all | 25 January 2018, 16:31:02 UTC |
eeaae61 | Andrew Adams | 25 January 2018, 16:30:05 UTC | Merge pull request #2695 from halide/clone_shares_cache_key A clone of a Func should share cache entries with the original | 25 January 2018, 16:30:05 UTC |
446267d | Andrew Adams | 25 January 2018, 16:29:47 UTC | Merge pull request #2710 from halide/srj-ub Avoid UBSan failures | 25 January 2018, 16:29:47 UTC |