swh:1:snp:2c68c8bd649bf1bd2cf3bf7bd4f98d247b82b5dc

sort by:
Revision Author Date Message Commit Date
a675193 Verify that Python stubs work only with new-style Generators Revised version of https://github.com/halide/Halide/pull/2768, but explicitly forbidding 'old' and 'partial-new' Generators for now. 23 February 2018, 00:55:09 UTC
4c4a6f6 Merge pull request #2760 from halide/add_fpic_to_runtime Add -fpic option to build of .ll files, the new dso_local hadling in… 21 February 2018, 00:16:50 UTC
8ca6e97 Merge pull request #2764 from halide/srj-parammap Clean up OIR code in Parameter 20 February 2018, 23:20:37 UTC
d418cac Clean up OIR code in Parameter Should be the same functionality as before, just more compact and comprehensible. 20 February 2018, 19:02:14 UTC
684429e Add -fpic option to build of .ll files, thew new dso_local hadling in clang needs to know if these are shared libs or not, for symbol visibility 19 February 2018, 18:50:40 UTC
fe85f6a Merge pull request #2751 from halide/srj-tsel Add tuple_select() to python bindings. 15 February 2018, 17:40:42 UTC
0409941 Merge branch 'master' into srj-tsel 14 February 2018, 23:10:43 UTC
46d8e9e Merge pull request #2750 from halide/srj-llvm Fix API change for trunk LLVM 14 February 2018, 23:10:30 UTC
c1087c6 Add tuple_select() to python bindings. 14 February 2018, 23:01:31 UTC
b848f79 Merge pull request #2738 from kernhanda/kernhanda/cmake-package-pr Rewrite CMake builds `install`/`distrib` targets. 14 February 2018, 21:58:51 UTC
7de2b4f Fix API change for trunk LLVM 14 February 2018, 20:35:01 UTC
d5621f5 Merge pull request #2635 from halide/nn-ops Replace hexagon_matmul with nn_ops 14 February 2018, 16:56:55 UTC
9bec1f6 Fix `halide_config.cmake.tpl` 14 February 2018, 09:09:50 UTC
e31f3fd Merge pull request #2739 from halide/pybind11-build-fix Fix python_bindings build 14 February 2018, 01:19:27 UTC
b33b2f9 Add README.md. 14 February 2018, 01:07:12 UTC
0969dd2 Merge pull request #2745 from halide/slomp/bugfix removed the hard limit on the length of environment variable queries … 14 February 2018, 00:02:15 UTC
ce9e258 disable apps/camera_pipe on windows 13 February 2018, 22:15:41 UTC
34e4d9f more code review fixes 13 February 2018, 21:19:28 UTC
f2133e9 more code review fixes 13 February 2018, 20:42:14 UTC
be58e82 code review changes 13 February 2018, 19:57:56 UTC
a5a0495 removed the hard limit on the length of environment variable queries on Windows (which was particularly detrimental to HL_WINDOWS_TMP_DIR) 13 February 2018, 19:32:19 UTC
31568a0 Merge pull request #2737 from halide/thread_safe_jit_call Add a test per issue https://github.com/halide/Halide/issues/2731 . 13 February 2018, 00:34:50 UTC
21ba752 cr feedback 12 February 2018, 23:51:27 UTC
483e70d Add a comment to indicate why a mutex is acquired in the case where only one thread is compiling. 12 February 2018, 21:03:37 UTC
264d341 trigger 12 February 2018, 05:13:32 UTC
0701de3 Fix python_bindings build Fixes compatibility with various Python environments (seen in several Mac Anaconda Python 3.6 distributions), following the [PyBind11 build instructions](https://github.com/pybind/pybind11/blob/master/docs/compiling.rst#building-manually) more carefully. In particular, don’t pre-link libpython, but rely on lazy linking at load time of the module in a given interpreter. 12 February 2018, 02:42:40 UTC
7c43565 Missed HalideRuntime*.h 12 February 2018, 00:34:57 UTC
7f18d20 Fix up halide.cmake to be more platform agnostic 11 February 2018, 20:07:51 UTC
923d651 Add a test per issue https://github.com/halide/Halide/issues/2731 . Also serves as documnetation on passing parameters to JITted routines and avoiding recompilation for each realize. Suggestions on further comments for documentation are particularly welcome. 11 February 2018, 20:02:14 UTC
6d9ab57 Rewrite CMake builds `install`/`distrib` targets. This change adds support for a proper `install` target for CMake builds. This allows for the usage of CPack to create packages for various systems and platforms. Once Halide is ready to do these things, the settings can be adjusted accordingly. For now, the focus is on creating tgz and zip packages for POSIX systems and Windows, respectively. This is done using CPack. The `distrib` target that was previously available has been rewritten in terms of the `install` target and taking advantage of being able to call CPack through CMake. As was done previously, a Halide.tgz package is created for POSIX systems while a Halide.zip package is created for Windows systems. For Windows builds specifically, the following changes were made: * Enable support for no RTTI builds * Enable support for OpenMP builds * Enable building of all but one app in the `apps/` folder * Fix emitted code for the C codegen to not result in an "inconsistent dll linkage" warning Builds were confirmed to be building with VS2017 on Windows. 11 February 2018, 19:07:15 UTC
5125fa3 Merge pull request #2733 from halide/ger_cleanup reschedule ger 09 February 2018, 23:05:04 UTC
b7176fb parallel -> true 09 February 2018, 17:09:28 UTC
1a83604 reschedule ger Change ger to be in-place, and simplify the schedule. Performance now closer to openblas for me on the larger sizes, and is much faster on the smaller sizes. Rightmost column is proportional to flops Halide sger 32 0.081833 13.2953 Halide sger 64 0.226842 18.6209 Halide sger 128 1.073608 15.4991 Halide sger 288 7.498786 11.1378 Halide sger 544 29.680935 10.0072 Halide sger 1056 121.922739 9.16357 Halide sger 2080 1147.310148 3.77453 OpenBLAS sger 32 35.603869 0.0305585 OpenBLAS sger 64 68.888618 0.0613164 OpenBLAS sger 128 43.106477 0.386021 OpenBLAS sger 288 92.307125 0.904806 OpenBLAS sger 544 186.297631 1.59435 OpenBLAS sger 1056 363.545260 3.0732 OpenBLAS sger 2080 1035.606023 4.18167 08 February 2018, 22:13:46 UTC
de84004 Convert all nn_ops Generators to new-style 08 February 2018, 18:23:38 UTC
9cbdbaf Don't include malloc.h 08 February 2018, 18:23:20 UTC
bef8bc2 Merge pull request #2730 from adityaatluri/rocm-make added makefile changes to build for amdgpu 08 February 2018, 17:30:30 UTC
d2dbcd7 Merge branch 'master' of https://github.com/halide/Halide into nn-ops 08 February 2018, 05:00:15 UTC
aad17fb Merge pull request #2719 from halide/srj-pycallgen Augment the Python Bindings to allow using precompiled Generators 08 February 2018, 00:38:52 UTC
e604193 Merge branch 'master' into srj-pycallgen 07 February 2018, 21:54:37 UTC
60c5e5c Merge branch 'master' into srj-pycallgen 07 February 2018, 21:52:31 UTC
ea1d830 Merge pull request #2728 from halide/srj-llset Revise GeneratorParam<LoopLevel>::set() work for both before-and-after generate() usage 07 February 2018, 21:52:04 UTC
bbada39 Merge branch 'master' into srj-llset 07 February 2018, 18:14:10 UTC
f8c177b Merge pull request #2729 from halide/srj-unused Remove unused 'using's 07 February 2018, 18:13:40 UTC
6b3df35 changed cmake in src directory to enable WITH_AMDGPU macro for compilation 07 February 2018, 03:09:26 UTC
10a9b3e Made changes to cmake to build amdgpu and added comments about amdgpu build status 07 February 2018, 02:59:24 UTC
f31df5f added makefile changes to build for amdgpu 1. Changed Makefile to enable compile flag WITH_AMDGPU 2. Changed CodeGen_LLVM file to check if intializing amdgpu works or not 07 February 2018, 02:41:37 UTC
504a7fc Remove unused 'using's 06 February 2018, 23:57:55 UTC
7c5d674 Add explicit Func name 06 February 2018, 23:51:11 UTC
a8a2178 Add explicit Func name 06 February 2018, 23:50:48 UTC
b4bdb05 Patch PR#2728 06 February 2018, 23:17:12 UTC
2f4c511 Add missing 'using' 06 February 2018, 23:16:42 UTC
95a4c88 Merge pull request #2727 from halide/srj-tupleselect tuple_select cleanup 06 February 2018, 23:05:03 UTC
979b56e Merge branch 'master' into srj-pycallgen 06 February 2018, 23:01:45 UTC
e207f95 Revise GeneratorParam<LoopLevel>::set() work for both before-and-after generate() usage A much-simpler alternative to PR#2720. Note that the interesting change is really all in the override to set() in GeneratorParam_LoopLevel (the changes to the base class implementation are just drive-by simplification to make that override a bit easier to reason about). 06 February 2018, 23:00:39 UTC
0e001cc Clarify and rename methods related to Outputs 06 February 2018, 20:01:21 UTC
2ba8955 Remove unused stuff 06 February 2018, 19:23:48 UTC
93ea595 Fix comment 06 February 2018, 19:18:56 UTC
74c4398 Restore all-are-convertible test 06 February 2018, 18:26:28 UTC
f1b7707 Merge branch 'master' into srj-tupleselect 06 February 2018, 18:13:32 UTC
18cd57e Move non-templated select() and tuple_select() impls into IROperator.cpp 05 February 2018, 21:59:11 UTC
137e2b2 Merge pull request #2726 from halide/zero_init_buffer_shape zero-initialize all fields of a Halide::Runtime::Buffer 05 February 2018, 21:51:20 UTC
9709513 tuple_select cleanup -- move to IROperator.h, next to select(), for better discoverability -- add multiway verison of tuple_select(), to match select() -- add test 05 February 2018, 21:03:26 UTC
b7959eb Try to avoid gcc 4.8 internal compiler error 05 February 2018, 19:14:35 UTC
ef8ff74 zero-initialize all fields of a Halide::Runtime::Buffer 05 February 2018, 17:50:48 UTC
8cdc8cd Merge pull request #2725 from halide/export_codegen_internal Don't include LLVM_Headers in CodeGen_Internal.h 05 February 2018, 02:29:13 UTC
41157da Don't include LLVM_Headers in CodeGen_Internal.h, so that it can be put in Halide.h 04 February 2018, 03:07:00 UTC
db4927a Merge pull request #2724 from halide/simplify_const_cast Remove unnecessary template function 04 February 2018, 00:32:11 UTC
980daf9 Remove unnecessary template function 03 February 2018, 02:50:51 UTC
81de819 Merge pull request #2722 from halide/srj-cast Fix error in commit c2fd18 03 February 2018, 01:50:05 UTC
d414e1b Fix error in commit c2fd18 copy-paste error; several of the changes should have been <void> rather than <T>, since remove_const must match the type. 02 February 2018, 19:05:06 UTC
5e432c6 Merge pull request #2721 from halide/srj-tidy Fix minor clang-tidy warnings 02 February 2018, 02:06:47 UTC
71f8594 Fix minor clang-tidy warnings Various declaration-vs-definition-names-differ warnings to silence downstream nags 01 February 2018, 23:14:16 UTC
6c00f11 Turn off debugging 31 January 2018, 23:33:31 UTC
9518686 Augment the Python Bindings to allow using a precompiled Generator (written in C++) at JIT time via the Python bindings, mimicking the C++ stub functionality. 31 January 2018, 23:30:49 UTC
6908edb Merge pull request #2717 from halide/srj-movereg Add move ctor/op= for Parameter 31 January 2018, 17:27:09 UTC
3b0e1dc Merge pull request #2715 from halide/add_rpath_for_llvm When building Halide without llvm, add an rpath to the llvm used 31 January 2018, 16:09:48 UTC
528ec2b Add move ctor/op= for Parameter The implicitly-define move ctor for Parameter didn't handle object instance registry tracking, which is must do. (Other usages should never be copyable or movable; move ops were deleted to ensure this.) 31 January 2018, 01:24:36 UTC
658e965 Add explanatory comment 30 January 2018, 22:16:33 UTC
22a85d1 Merge pull request #2716 from halide/srj-ret in_gpu_loop wasn't being restore 30 January 2018, 21:22:16 UTC
56a5608 in_gpu_loop wasn't being restore See previous line 454. Switched all usage to ScopedValue to avoid needing manual cleanup. 30 January 2018, 18:08:29 UTC
edab342 Merge pull request #2714 from halide/lift_reinterpret Allow substituting reinterpret calls back in LICM. 30 January 2018, 04:06:59 UTC
17bad9b When building Halide without llvm, add an rpath to the llvm used 30 January 2018, 02:28:27 UTC
68e2538 Merge pull request #2645 from halide/export_all Expose internal symbols 30 January 2018, 01:20:06 UTC
6d26b97 A single space character counts as true, apparently 29 January 2018, 22:18:07 UTC
022f01a Allow lifting reinterpret calls in LICM. 29 January 2018, 21:59:18 UTC
92cedc3 Take RTTI settings from LLVM by default But allow the configuration LLVM with RTTI + Halide without RTTI, via WITH_RTTI in the Makefile and HALIDE_ENABLE_RTTI in cmake 29 January 2018, 19:46:56 UTC
9730393 export symbols on linux with cmake 27 January 2018, 04:11:16 UTC
bc83c0f Missing endif 26 January 2018, 22:45:47 UTC
90c092a Adding dllexport to global vars 26 January 2018, 22:36:07 UTC
16328f0 Merge remote-tracking branch 'origin/master' into export_all 26 January 2018, 21:23:02 UTC
9b03ec7 Flip sense of incorrect ifdef 26 January 2018, 21:20:01 UTC
e40ffc8 Merge pull request #2526 from halide/cuda_register_shuffle Support for warp shuffle instructions on cuda 26 January 2018, 21:18:02 UTC
a3a318f Forgot to actually define HALIDE_EXPORT 25 January 2018, 22:10:06 UTC
68a2c60 Spacing fix 25 January 2018, 18:28:08 UTC
9a08969 Indentation fix 25 January 2018, 18:26:15 UTC
b6595bd Remove EXPORT on functions only 25 January 2018, 18:10:32 UTC
59bca3c Review comments 25 January 2018, 16:38:19 UTC
ce02cd9 Merge remote-tracking branch 'origin' into export_all 25 January 2018, 16:31:02 UTC
eeaae61 Merge pull request #2695 from halide/clone_shares_cache_key A clone of a Func should share cache entries with the original 25 January 2018, 16:30:05 UTC
446267d Merge pull request #2710 from halide/srj-ub Avoid UBSan failures 25 January 2018, 16:29:47 UTC
back to top