Revision history - refs/heads/srj/fix-pytorch - origin: https://github.com/halide/Halide

visit type:

Newer
Older

Revision	Author	Date	Message	Commit Date
d76970a	Steven Johnson	09 February 2021, 22:32:19 UTC	Fix apps/HelloPyTorch	09 February 2021, 22:32:19 UTC
fe0888b	Steven Johnson	09 February 2021, 18:00:58 UTC	Refactor code for dealing with default values of scalar Params (#5720) The "default" value for a scalar param is rarely used -- it's currently only possible to specify foran Input in a Generator, and that value only shows up in the generated metadata for AOT compilation. This PR refactors this so that instead of being maintained solely as a hack in Generator data structures, it's moved into Parameter as its own field. This seems like a lot of work some something of marginal use, but I'm reluctant to suggest removing it entirely (it's possible it could break someone's code), and refactoring it in this way will make some subsequent Generator refactoring easier to understand and review.	09 February 2021, 18:00:58 UTC
3fbb12a	John Lawson	09 February 2021, 18:00:02 UTC	Add support for AVX512 BF16 dot product (#5712) * Add support for AVX512 BF16 dot product * Match on f32f32 Remove f32 check	09 February 2021, 18:00:02 UTC
3e034d6	Steven Johnson	08 February 2021, 23:05:30 UTC	Remove unnecessary #include from RDom.cpp (#5718)	08 February 2021, 23:05:30 UTC
1b22dfe	John Lawson	05 February 2021, 20:06:34 UTC	Add support for AVX512 f32x32 to bf16x32 conversion (#5711) The vcvtne2ps2bf16 instruction combines two f32x16 vectors and converts them to one bf16x32 vector. We can use this to support converting a f32x32 vector to bf16x32 vector by splitting the input vector into two.	05 February 2021, 20:06:34 UTC
8ee7f4c	Andrew Adams	05 February 2021, 20:01:32 UTC	Add mux intrinsic (#5707) Add mux intrinsic	05 February 2021, 20:01:32 UTC
27be859	Steven Johnson	05 February 2021, 19:58:54 UTC	Deprecate old-style realize() methods (#5676) * Deprecate old-style realize() methods We had 5 extra variants of realize() (for 0-dim thru 4-dim cases); these are a holdover from both having a limit of 4 dimensions (ie, pre-halide_buffer_t) and also from pre-C++11 (ie, passing in initializer-lists of int was less convenient). Let's deprecated these for Halide 12 and remove them in Halide 13.	05 February 2021, 19:58:54 UTC
32d5f71	Andrew Adams	05 February 2021, 00:44:07 UTC	Don't make new IR nodes if nothing changed when simplifying (#5698) * Don't make new IR nodes if nothing changed Some statements and expressions get crafted anew each time when repeatedly resimplified. This PR fixes all the cases I could find.	05 February 2021, 00:44:07 UTC
3c0b9e4	John Lawson	04 February 2021, 18:43:22 UTC	Provide wrapper around 128 bit cvtneps2bf16 (#5704) * Provide wrapper around 128bit cvtneps2bf16 * Include module with AVX512 feature Co-authored-by: Steven Johnson <srj@google.com>	04 February 2021, 18:43:22 UTC
793b7f6	cimes-isi	04 February 2021, 17:40:41 UTC	HalideBuffer: cast offset operand to ptrdiff_t to avoid overflow (#5706) * HalideBuffer: cast offset operand to ptrdiff_t to avoid overflow * HalideBuffer: cast offset_of operand to ptrdiff_t to avoid overflow * HalideRuntime: ptrdiff_t casts to avoid overflow	04 February 2021, 17:40:41 UTC
a0fc7fa	Alexander Root	04 February 2021, 01:13:53 UTC	Add fixes to overflow analysis in bounds inference (#5618) * add fixes to overflow analysis in bounds inference Co-authored-by: Steven Johnson <srj@google.com>	04 February 2021, 01:13:53 UTC
dadbcbf	John Lawson	03 February 2021, 21:58:00 UTC	Check Sapphire Rapids AVX512BF16 support in runtime (#5702) The Sapphire Rapids target feature controls whether BF16 and VVNI X86 instructions are emitted. Support for both of these is checked in the Halide compiler, but BF16 is not checked in the runtime as it required extending the cpuid functionality. Now we have that cpuid functionality we can add the BF16 check.	03 February 2021, 21:58:00 UTC
f64525e	aankit-ca	03 February 2021, 19:29:29 UTC	[HVX] Correct simd-op-check-hvx (#5703) Add "+hvxv6x" to mattrs Correct isa_version in simd_op_check_hvx Co-authored-by: Ankit Aggarwal <aankit@quicinc.com>	03 February 2021, 19:29:29 UTC
d8c95dd	Andrew Adams	03 February 2021, 17:38:21 UTC	Capture Exprs by ref in IRMatch (#5696) * Capture Exprs by ref in IRMatch * Forbid rvalue Exprs passed to IRMatcher nodes * Clarify what is_const refers to * Fix min int constant * Add explanatory comment * Remove assert that was blowing up simplifier stack frames	03 February 2021, 17:38:21 UTC
c3cb54b	Volodymyr Kysenko	03 February 2021, 16:49:38 UTC	Add missing headers (#5700)	03 February 2021, 16:49:38 UTC
bdfa994	Volodymyr Kysenko	03 February 2021, 16:47:16 UTC	async deserves its own line (#5701)	03 February 2021, 16:47:16 UTC
3ba8691	Alexander Root	03 February 2021, 00:21:48 UTC	fix lens_blur estimates (#5694)	03 February 2021, 00:21:48 UTC
265f2c7	John Lawson	02 February 2021, 22:04:41 UTC	Add initial support for Sapphire Rapids AVX512 features (#5677) * Add Sapphire Rapids target feature * Add initial avx512_sr test * Guard against earlier LLVM versions * Move feature to other AVX512 features * Set earlier features when SapphireRapids selected * trigger buildbots * Add SapphireRapids to get_runtime_compatible_target * Add issue link to TODO comments * Add user errors if using unsupported feature	02 February 2021, 22:04:41 UTC
89329d3	Steven Johnson	02 February 2021, 19:36:21 UTC	Disable generator_aot_gpu_multi_context_threaded under wasm (#5692) Re-enabled wasm testing, which revealed breakage of the generator_aot_gpu_multi_context_threaded target (the runtime isn't being linked properly). Since this test isn't useful under wasm at the present time anyway (relies on GPU support), just skipping it entirely for those targets.	02 February 2021, 19:36:21 UTC
99c5583	Alexander Root	02 February 2021, 17:49:31 UTC	[adams2019] Restructure autoscheduler + add timer (#5654) * add feature caching and block caching to adams2019 autoscheduler * added caching verification for feautures * clean up TODOs and commented out src * add docstring * clean up final TODOs * rm double declaration * make clang format happy * rm stats that caused linker error * fix cmake * fix clang format too * remove caching from adams2019 restructuring * move unnecessary member functions + add descriptions for State member functions * fix clang tidy in LoopNest.* * add top level comment to State struct	02 February 2021, 17:49:31 UTC
100bc76	John Lawson	02 February 2021, 17:47:34 UTC	Add ecx support for runtime X86 cpuid (#5684) * Add ecx support for runtime X86 cpuid Some newer X86 extensions require setting ecx when calling cpuid, for example AVX512BF16 support is queried using cpuid(eax=7, ecx=1). * trigger buildbots Co-authored-by: Steven Johnson <srj@google.com>	02 February 2021, 17:47:34 UTC
f941376	Volodymyr Kysenko	02 February 2021, 04:12:21 UTC	Lower halving_* intrinsics without widening (#5686) * Lower halving_* intrinsics without widening * Fix typo * Extend the test to check for correctness of the halving_* lowering * Change vector to array	02 February 2021, 04:12:21 UTC
25eeea1	Steven Johnson	01 February 2021, 23:11:24 UTC	ScopedFile in write_debug_image needs sanity check (#5689) Specifically, don't call fclose() on a null ptr, as that can crash. Also added some code to avoid implicit int->bool conversions.	01 February 2021, 23:11:24 UTC
a471e59	Andrew Adams	01 February 2021, 21:01:56 UTC	Add a spin before the cond_wait in the thread pool (#5408) * Add a spin before the cond_wait in the thread pool * Spin on a mutex even if someone is parked	01 February 2021, 21:01:56 UTC
9743fca	Zalman Stern	01 February 2021, 19:50:40 UTC	Make GPU kernel compilation caching consistent across GPU backends. (#5546) Make GPU context handling more consistent and use a common compilation cache for kernels. Introduces a finalization routine for kernel compilation to indicate when kernels are not strictly required to be defined. Thus allowing them to be unloaded or discarded, but not when they are needed. Co-authored-by: Steven Johnson <srj@google.com> Co-authored-by: Marcos Slomp <slomp@adobe.com>	01 February 2021, 19:50:40 UTC
e517946	Steven Johnson	01 February 2021, 19:27:24 UTC	Delete llvm_builder.yml (#5685)	01 February 2021, 19:27:24 UTC
1155bda	Steven Johnson	29 January 2021, 21:31:06 UTC	Avoid bogus out-of-memory error for multiple_scatter under wasm (#5681)	29 January 2021, 21:31:06 UTC
6118a62	Steven Johnson	29 January 2021, 17:27:30 UTC	Disable a few more wasm-simd ops in simd_op_check (#5679) Recent changes to the final wasm-simd spec means that some instructions aren't being generated (and may not even exist in the same form). Commented out for now; we need to revisit this once the LLVM backend for wasm gets closer to up-to-date with the final spec.	29 January 2021, 17:27:30 UTC
288526c	Dillon Sharlet	28 January 2021, 17:52:16 UTC	Encapsulate a few more symbols (#5672) * Encapsulate more symbols.	28 January 2021, 17:52:16 UTC
f427ad1	Steven Johnson	28 January 2021, 17:51:11 UTC	Remove deprecated variants of infer_input_bounds() in the Python bindings (#5673) * Remove deprecated variants of infer_input_bounds() in the Python bindings The C++ versions were removed for Halide 12 already, but I missed the Python wrappers. * trigger buildbots	28 January 2021, 17:51:11 UTC
813eadc	Alex Reinking	28 January 2021, 09:41:44 UTC	Fix target detection for i686 (#5675)	28 January 2021, 09:41:44 UTC
a8299b5	Steven Johnson	27 January 2021, 21:22:40 UTC	Allow LLVM-13 and Clang-13 (#5674)	27 January 2021, 21:22:40 UTC
6e3fb56	Steven Johnson	26 January 2021, 04:09:53 UTC	FIx intermittent OSX Python crash (#5667) * FIx intermittent OSX Python crash The OSX buildbot has been crashing intermittently on some python tests; debugging showed that in some situations, Introspection's calls to `backtrace()` include bogus addresses (eg 0x08), which cause segfaults when you try to inspect memory near them. The reasons for this aren't entirely clear -- for instance, it only seems to repeat reliably when using the Makefile rather than CMake, and only when doing an 'out-of-tree' build. Rather than try to run this to ground further, this PR just checks for address fields that seem obviously unreasonable (first 256 bytes of address space) and ignore them. * Add -fno-omit-frame-pointer, update sanity check * Update Introspection.cpp	26 January 2021, 04:09:53 UTC
d4c27ca	Dillon Sharlet	25 January 2021, 21:51:57 UTC	Lower saturating arithmetic without widening (#5662) * Lower saturating arithmetic without widening, and handle it in lower_intrinsic. * clang-format, fix saturating sub * cout -> cerr * trigger buildbots Co-authored-by: Alex Reinking <alex.reinking@gmail.com> Co-authored-by: Steven Johnson <srj@google.com>	25 January 2021, 21:51:57 UTC
38be3e3	aankit-ca	23 January 2021, 01:06:20 UTC	Add rounding shift right instructions (#5664) Co-authored-by: Ankit Aggarwal <aankit@quicinc.com>	23 January 2021, 01:06:20 UTC
7cff481	Dillon Sharlet	22 January 2021, 21:18:47 UTC	Fix VSX min/max intrinsics. Fixes #5661. (#5663)	22 January 2021, 21:18:47 UTC
6b398a3	Andrew Adams	22 January 2021, 18:23:48 UTC	Better codegen for switch-statement-like if-else chains (#5595) Better codegen for switch-statement-like if-else chains And added a test that demonstrates writing a little interpreter in Halide and scheduling it.	22 January 2021, 18:23:48 UTC
8c57a1a	Steven Johnson	21 January 2021, 22:07:20 UTC	Use linker tools on OSX & Linux to limit exports (#4651) (#5659) * Use linker scripts on OSX & Linux to limit exports * Write script to detect appropriate linker flags. Co-authored-by: Alex Reinking <alex.reinking@gmail.com>	21 January 2021, 22:07:20 UTC
be7a6a3	Alexander Root	21 January 2021, 21:23:43 UTC	is_positive_const and is_negative_const broken for (some) casts (#5615) * let signed_const checkers fail for non-widening integral casts Co-authored-by: Steven Johnson <srj@google.com>	21 January 2021, 21:23:43 UTC
0ca0415	Steven Johnson	21 January 2021, 01:59:10 UTC	Remove all deprecated methods for Halide 12 (#5656) * Remove all deprecated methods for Halide 12 These were all marked as deprecated in Halide 11 (and probably Halide 10 too); let's go ahead and remove them in Halide 12. * Remove function bodies too	21 January 2021, 01:59:10 UTC