https://github.com/CNugteren/CLTune

sort by:
Revision Author Date Message Commit Date
d0ec5a1 Merge pull request #45 from CNugteren/development Update to version 2.5.0 27 September 2016, 19:04:58 UTC
82dd234 Updated to version 2.5.0 27 September 2016, 18:53:32 UTC
2a56722 Made the number of runs for averaging a setting configurable by the user 27 September 2016, 18:49:47 UTC
bb4ba83 Updated to version 8.0 of CLCudaAPI 27 September 2016, 18:48:10 UTC
492c362 Updated Travis CI to use the system OpenCL instead of compiling our own OpenCL library 03 August 2016, 18:21:59 UTC
68cb1d4 Updated to version 7.0 of the CLCudaAPI header 03 August 2016, 18:17:41 UTC
a6cb325 Merge pull request #44 from williamjshipman/development Fix bug in Kernel::LocalMemUsage on Intel CPU runtime 03 August 2016, 18:00:03 UTC
d8318a5 Fix bug in Kernel::LocalMemUsage where Intel CPU runtime returns a size of 0 if the in the first call to clGetKernelWorkGroupInfo. Cause seems to be an ambiguity in the OpenCL standard. 30 July 2016, 23:31:22 UTC
86dbb2e Merge pull request #42 from CNugteren/development Update to version 2.4.0 29 June 2016, 17:50:22 UTC
a001605 Minor fix to the AppVeyor CI build 29 June 2016, 16:21:25 UTC
45b2c52 Updated to version 2.4.0 29 June 2016, 16:10:46 UTC
0526f9d Made it possible to run some of the GEMM kernels using CUDA (those without shared memory) 29 June 2016, 16:08:10 UTC
6177c14 Updated to version 6.0 of the CLCudaAPI header 29 June 2016, 15:50:12 UTC
609ea4c Removed building of tests for AppVeyor CI 29 June 2016, 15:49:52 UTC
fca2ad1 Added Appveyor CI and added OS X compilation for Travis 29 June 2016, 15:03:42 UTC
48719a2 Fixed the RPATH settings for OSX 16 June 2016, 18:20:44 UTC
b516ef7 Added a VERBOSE option to CMake to get additional diagnostic messages 16 June 2016, 18:18:49 UTC
e95c158 Unit-tests are now based on string-kernels instead of external-file-kernels to make it possible to run the unit test executables anywhere 31 May 2016, 18:37:17 UTC
f1b0900 Merge pull request #39 from CNugteren/development Update to version 2.3.1 25 May 2016, 11:03:26 UTC
ebb3085 Updated to version 2.3.1 (bug-fix release) 25 May 2016, 10:14:35 UTC
53a05ba Fixed computing the validation error for half-precision fp16 data-types 24 May 2016, 09:58:26 UTC
e9f43b5 Fixed a bug where an output buffer could not be used as input at the same time 24 May 2016, 09:56:15 UTC
b887e1e Merge pull request #38 from CNugteren/development Update to version 2.3.0 22 May 2016, 15:05:41 UTC
ae12ebe Updated to version 2.3.0 22 May 2016, 15:01:18 UTC
921271c Fixed CMake to compare strings properly; made MSVC link the runtime libraries statically 22 May 2016, 15:00:41 UTC
f923a17 Fixed a bug where failed results would still show up in the JSON files 22 May 2016, 14:41:10 UTC
86d701c Fixed a bug where failed results would still show up in the final results 16 May 2016, 10:14:18 UTC
ccf5ce2 Added support for short integers and cl_half fp16 as kernel arguments 14 May 2016, 15:59:17 UTC
cba89a4 Merge pull request #37 from CNugteren/development Update to version 2.2.0 27 April 2016, 09:03:34 UTC
2da8be1 Updated to version 2.2.0 27 April 2016, 08:56:54 UTC
acc110a Made the new samples work for CUDA as well 27 April 2016, 08:55:13 UTC
5f645e8 Fixed a typo in the API documentation 27 April 2016, 08:42:47 UTC
8b76ad1 Added API documentation to the repository 27 April 2016, 08:39:18 UTC
122cbb9 Minor fixes related to the newly added samples 27 April 2016, 07:59:27 UTC
9801a1e Added two much simpler examples to improve documentation 25 April 2016, 00:13:47 UTC
eac490c Updated the documentation 24 April 2016, 02:59:02 UTC
54df67a Updated headers to version 5.0 of the CLCudaAPI 24 April 2016, 02:58:00 UTC
8752c44 Updated Travis to reflect the latest Travis and Khronos changes 24 April 2016, 02:45:10 UTC
b306cf1 Merge pull request #36 from williamjshipman/development Only use OpenCL 2.x functions on OpenCL 2.x devices 03 April 2016, 22:41:46 UTC
bf1821b - Add VersionNumber function for querying device OpenCL version number as an integer (e.g. 120 for OpenCL 1.2). - Clean up OpenCL 2.0 check in Queue constructor. 03 April 2016, 01:20:33 UTC
33ba3ef Merge pull request #2 from CNugteren/development Development 02 April 2016, 19:37:23 UTC
da97040 Prepared the changelog for the next release 31 March 2016, 04:11:58 UTC
ad94a3d Merge branch 'development' 31 March 2016, 04:09:37 UTC
5802148 Updated to version 2.1.0 31 March 2016, 04:08:35 UTC
0110efc Merge branch 'development' of https://github.com/williamjshipman/CLTune into development 26 March 2016, 13:53:07 UTC
bccd8ac Add runtime check for OpenCL 2 before using OpenCL 2 function. 26 March 2016, 13:37:30 UTC
4698d4a Add runtime check for OpenCL 2 before using OpenCL 2 function. 26 March 2016, 13:21:37 UTC
0dc2a99 Updated the README 21 March 2016, 21:27:54 UTC
1ad3bb2 Merge branch 'development' of github.com:CNugteren/CLTune into development specially if it merges an updated upstream into a topic branch. 21 March 2016, 21:15:24 UTC
0b90c0c Fixes for minor warnings under Visual Studio 21 March 2016, 19:57:35 UTC
1d3c159 Added dllexport to be able to build a DLL under Windows 21 March 2016, 19:56:35 UTC
b170354 Merge pull request #35 from williamjshipman/development Add command line parameter for platform index to conv and gemm samples in line with description in README. 31 January 2016, 17:38:09 UTC
59faefa Updated the README to show that the platform ID is one of the command line parameters and updated the samples so that the order of the parameters matches all parts of the README. 30 January 2016, 23:07:16 UTC
b5a3a8b Samples now support a platform parameter in their command lines, in addition to the device number. 30 January 2016, 22:48:10 UTC
dcddd80 Updated FindOpenCL for Intel Linux OpenCL paths 23 January 2016, 15:08:14 UTC
d643731 Prepared the changelog for the next release 22 November 2015, 11:19:55 UTC
9e401f4 Merge pull request #33 from CNugteren/development Added machine learning, new CLCudaAPI, CUDA, Catch, and MSVC support 22 November 2015, 11:18:09 UTC
8bc6684 Updated to version 2.0.0 22 November 2015, 11:16:46 UTC
b22dce2 Updated the readme 22 November 2015, 11:15:35 UTC
a21d4a5 Merge pull request #32 from CNugteren/catch_tests Replaced GTest with Catch unit testing 21 November 2015, 13:33:50 UTC
400752b Updated changelog and readme 21 November 2015, 13:29:27 UTC
8757c9e Updated the 'KernelInfo' class to use Catch 21 November 2015, 13:27:45 UTC
b74dcef Updated the 'tuner' class tests to use Catch 19 November 2015, 20:03:50 UTC
3526cc8 Removed GTest, added Catch, added CLCudaAPI tests 15 November 2015, 15:20:41 UTC
e9984c3 Merge pull request #31 from CNugteren/msvc_support MSVC 2015 support 15 November 2015, 15:08:07 UTC
d3c961f Updated changelog 15 November 2015, 15:06:23 UTC
dbb096a Fixed a warning and error for MSVC 14 November 2015, 14:58:21 UTC
f701c1f Prepared for MSVC support 14 November 2015, 14:41:08 UTC
279d1eb Merge pull request #30 from CNugteren/clcudaapi Updated the CLCudaAPI header 10 November 2015, 19:33:30 UTC
c34d456 Fixes CL to CUDA translation header to make the simple example work 08 November 2015, 11:04:53 UTC
3456bf4 Fixed a header inclusion error 08 November 2015, 10:36:28 UTC
c526a84 Added experimental support for CUDA kernels 08 November 2015, 10:29:20 UTC
97cb535 Updated the changelog 07 November 2015, 16:36:21 UTC
a5c5150 Now using version 4.0 of the CLCudaAPI header 07 November 2015, 16:33:41 UTC
1d2701f Disabled additional warnings for Clang 07 November 2015, 16:32:47 UTC
5fbcb3a Merge pull request #29 from CNugteren/machine_learning Machine learning models 07 November 2015, 11:32:02 UTC
d9ea76d Prepared changelog for next version 07 November 2015, 11:26:51 UTC
d644df1 Added a 3-layer neural network model 29 October 2015, 14:12:58 UTC
dfc3651 Prepared for addition of a neural network model 29 October 2015, 09:58:15 UTC
32eb552 Merge pull request #28 from CNugteren/json_metadata Added additional device properties to JSON-output 25 October 2015, 14:51:30 UTC
0a8e371 Added additional device properties to JSON-output 25 October 2015, 14:25:46 UTC
406f3f7 Fixed warnings and zero-range bug 22 September 2015, 08:00:23 UTC
52b34eb Now using ML models to predict best configurations + using logarithmic data 21 September 2015, 15:15:17 UTC
e6e195d Merge pull request #27 from CNugteren/publication CLTune publication and capitalization 07 September 2015, 11:55:26 UTC
1e677ac Added a reference to the CLTune paper 07 September 2015, 11:50:51 UTC
4109e58 Updated name from cltune to CLTune for Travis 07 September 2015, 11:44:54 UTC
790ca0c Cosmetic updates to linear regression 31 August 2015, 15:09:14 UTC
02af290 Added verification based on the cost function 31 August 2015, 14:24:20 UTC
6ddc479 Added regularization support to linear regression 26 August 2015, 15:14:27 UTC
b17861f Completed the function to add extra polynomial features 26 August 2015, 14:24:55 UTC
7c10c8b Added mean value normalization 26 August 2015, 13:37:19 UTC
4d29b06 Minor optimizations to population of permutation configurations 26 August 2015, 13:36:55 UTC
137a93f Merge pull request #26 from CNugteren/travis_ci Added Travis continuous integration 26 August 2015, 12:17:22 UTC
52c0c4f Made Travis always build pushes to the master branch 26 August 2015, 12:11:36 UTC
bf48395 Added travis continuous integration 21 August 2015, 15:25:56 UTC
dec4610 Added initial version of a linear regression machine learning model 21 August 2015, 15:20:48 UTC
15782dd Merge pull request #25 from CNugteren/claduc_and_json Claduc C++11 interface & method to output JSON 03 August 2015, 15:16:14 UTC
43a0e9a Updated to version 1.7.0 03 August 2015, 15:15:04 UTC
2028cfd Added a method to print all results in JSON to file 03 August 2015, 15:09:40 UTC
3d44a9d Now using the new Claduc C++11 OpenCL header 27 July 2015, 11:52:07 UTC
back to top