https://github.com/halide/Halide
Revision 26619d246acc418bd4bd5ce12543dc2aedbbfb1f authored by Pranav Bhandarkar on 18 September 2023, 19:48:34 UTC, committed by GitHub on 18 September 2023, 19:48:34 UTC
* Dump the IR more frequently in HexagonOptimize.cpp

* Fix 8bit unsigned saturating downcasts for HVX

We do not have a way of reliably lowering the following expression
to LLVM bitcode for HVX.

u8_sat(uint16x)

where uint16x is a vector (preferably a HVX double vector) with
element type uint16.
Since there is no native HVX instruction to do this, this patch
introduces two helper functions in hvx_128.ll to perform this
operation. One function interleaves its input (trunc_satub.vuh) and the
other does not (pack_satub.vuh)

This patch also removes declaration of some intrinsics not use any
longer in hvx_128.ll

* Make IR dump messages in HexagonOptimize.cpp consistent with those in CodeGen_Hexagon.cpp

* fix clang-format complaints

---------

Co-authored-by: Steven Johnson <srj@google.com>
1 parent 68a0341
History
Tip revision: 26619d246acc418bd4bd5ce12543dc2aedbbfb1f authored by Pranav Bhandarkar on 18 September 2023, 19:48:34 UTC
[Hexagon] - Fix 8-bit unsigned saturating downcasts for HVX (Fixes #7806) (#7825)
Tip revision: 26619d2
File Mode Size
.github
apps
cmake
dependencies
doc
packaging
python_bindings
src
test
tools
tutorial
util
.clang-format -rw-r--r-- 1.4 KB
.clang-format-ignore -rw-r--r-- 375 bytes
.clang-tidy -rw-r--r-- 6.9 KB
.gitattributes -rw-r--r-- 342 bytes
.gitignore -rw-r--r-- 4.9 KB
.gitmodules -rw-r--r-- 0 bytes
CMakeLists.txt -rw-r--r-- 6.6 KB
CMakePresets.json -rw-r--r-- 6.8 KB
CODE_OF_CONDUCT.md -rw-r--r-- 3.5 KB
LICENSE.txt -rw-r--r-- 14.4 KB
MANIFEST.in -rw-r--r-- 159 bytes
Makefile -rw-r--r-- 106.1 KB
README.md -rw-r--r-- 16.5 KB
README_cmake.md -rw-r--r-- 77.9 KB
README_fuzz_testing.md -rw-r--r-- 3.9 KB
README_python.md -rw-r--r-- 31.8 KB
README_rungen.md -rw-r--r-- 12.1 KB
README_vulkan.md -rw-r--r-- 11.4 KB
README_webassembly.md -rw-r--r-- 10.5 KB
README_webgpu.md -rw-r--r-- 4.4 KB
pyproject.toml -rw-r--r-- 196 bytes
requirements.txt -rw-r--r-- 130 bytes
run-clang-format.sh -rwxr-xr-x 1.4 KB
run-clang-tidy.sh -rwxr-xr-x 3.8 KB
setup.py -rw-r--r-- 1.2 KB

README.md

back to top