73ed6c3 | Cedric Nugteren | 22 October 2016, 14:42:10 UTC | Added an option to compile a static library | 22 October 2016, 14:42:10 UTC |
bdbf353 | Cedric Nugteren | 12 October 2016, 19:43:01 UTC | Fixed a const/constexpr issue caused by the previous commit | 12 October 2016, 19:43:01 UTC |
083a5e2 | Cedric Nugteren | 12 October 2016, 19:36:20 UTC | Added support for compilation under Visual Studio 2013 (MSVC++ 12.0) | 12 October 2016, 19:36:20 UTC |
0ed56a1 | Cedric Nugteren | 02 October 2016, 11:44:46 UTC | It is now possible to set the OpenCL compiler options through an environment variable | 02 October 2016, 11:44:46 UTC |
5219183 | Cedric Nugteren | 02 October 2016, 11:38:45 UTC | Execution time measurements is no longer based on events but uses CPU timers instead to also include the (varying) kernel launch time overhead and other overheads (if any) | 02 October 2016, 11:38:45 UTC |
82dd234 | Cedric Nugteren | 27 September 2016, 18:53:32 UTC | Updated to version 2.5.0 | 27 September 2016, 18:53:32 UTC |
2a56722 | Cedric Nugteren | 27 September 2016, 18:49:47 UTC | Made the number of runs for averaging a setting configurable by the user | 27 September 2016, 18:49:47 UTC |
bb4ba83 | Cedric Nugteren | 27 September 2016, 18:48:10 UTC | Updated to version 8.0 of CLCudaAPI | 27 September 2016, 18:48:10 UTC |
492c362 | Cedric Nugteren | 03 August 2016, 18:21:59 UTC | Updated Travis CI to use the system OpenCL instead of compiling our own OpenCL library | 03 August 2016, 18:21:59 UTC |
68cb1d4 | Cedric Nugteren | 03 August 2016, 18:17:41 UTC | Updated to version 7.0 of the CLCudaAPI header | 03 August 2016, 18:17:41 UTC |
a6cb325 | Cedric Nugteren | 03 August 2016, 18:00:03 UTC | Merge pull request #44 from williamjshipman/development Fix bug in Kernel::LocalMemUsage on Intel CPU runtime | 03 August 2016, 18:00:03 UTC |
d8318a5 | williamjshipman | 30 July 2016, 23:31:22 UTC | Fix bug in Kernel::LocalMemUsage where Intel CPU runtime returns a size of 0 if the in the first call to clGetKernelWorkGroupInfo. Cause seems to be an ambiguity in the OpenCL standard. | 30 July 2016, 23:31:22 UTC |
a001605 | Cedric Nugteren | 29 June 2016, 16:21:25 UTC | Minor fix to the AppVeyor CI build | 29 June 2016, 16:21:25 UTC |
45b2c52 | Cedric Nugteren | 29 June 2016, 16:10:46 UTC | Updated to version 2.4.0 | 29 June 2016, 16:10:46 UTC |
0526f9d | Cedric Nugteren | 29 June 2016, 16:08:10 UTC | Made it possible to run some of the GEMM kernels using CUDA (those without shared memory) | 29 June 2016, 16:08:10 UTC |
6177c14 | Cedric Nugteren | 29 June 2016, 15:50:12 UTC | Updated to version 6.0 of the CLCudaAPI header | 29 June 2016, 15:50:12 UTC |
609ea4c | Cedric Nugteren | 29 June 2016, 15:49:52 UTC | Removed building of tests for AppVeyor CI | 29 June 2016, 15:49:52 UTC |
fca2ad1 | Cedric Nugteren | 29 June 2016, 15:03:42 UTC | Added Appveyor CI and added OS X compilation for Travis | 29 June 2016, 15:03:42 UTC |
48719a2 | Cedric Nugteren | 16 June 2016, 18:20:44 UTC | Fixed the RPATH settings for OSX | 16 June 2016, 18:20:44 UTC |
b516ef7 | Cedric Nugteren | 16 June 2016, 18:18:49 UTC | Added a VERBOSE option to CMake to get additional diagnostic messages | 16 June 2016, 18:18:49 UTC |
e95c158 | Cedric Nugteren | 31 May 2016, 18:37:17 UTC | Unit-tests are now based on string-kernels instead of external-file-kernels to make it possible to run the unit test executables anywhere | 31 May 2016, 18:37:17 UTC |
ebb3085 | Cedric Nugteren | 25 May 2016, 10:14:35 UTC | Updated to version 2.3.1 (bug-fix release) | 25 May 2016, 10:14:35 UTC |
53a05ba | Cedric Nugteren | 24 May 2016, 09:58:26 UTC | Fixed computing the validation error for half-precision fp16 data-types | 24 May 2016, 09:58:26 UTC |
e9f43b5 | Cedric Nugteren | 24 May 2016, 09:56:15 UTC | Fixed a bug where an output buffer could not be used as input at the same time | 24 May 2016, 09:56:15 UTC |
ae12ebe | Cedric Nugteren | 22 May 2016, 15:01:18 UTC | Updated to version 2.3.0 | 22 May 2016, 15:01:18 UTC |
921271c | Cedric Nugteren | 22 May 2016, 15:00:41 UTC | Fixed CMake to compare strings properly; made MSVC link the runtime libraries statically | 22 May 2016, 15:00:41 UTC |
f923a17 | Cedric Nugteren | 22 May 2016, 14:41:10 UTC | Fixed a bug where failed results would still show up in the JSON files | 22 May 2016, 14:41:10 UTC |
86d701c | Cedric Nugteren | 16 May 2016, 10:14:18 UTC | Fixed a bug where failed results would still show up in the final results | 16 May 2016, 10:14:18 UTC |
ccf5ce2 | Cedric Nugteren | 14 May 2016, 15:59:17 UTC | Added support for short integers and cl_half fp16 as kernel arguments | 14 May 2016, 15:59:17 UTC |
2da8be1 | Cedric Nugteren | 27 April 2016, 08:56:54 UTC | Updated to version 2.2.0 | 27 April 2016, 08:56:54 UTC |
acc110a | Cedric Nugteren | 27 April 2016, 08:55:13 UTC | Made the new samples work for CUDA as well | 27 April 2016, 08:55:13 UTC |
5f645e8 | Cedric Nugteren | 27 April 2016, 08:42:47 UTC | Fixed a typo in the API documentation | 27 April 2016, 08:42:47 UTC |
8b76ad1 | Cedric Nugteren | 27 April 2016, 08:39:18 UTC | Added API documentation to the repository | 27 April 2016, 08:39:18 UTC |
122cbb9 | Cedric Nugteren | 27 April 2016, 07:59:27 UTC | Minor fixes related to the newly added samples | 27 April 2016, 07:59:27 UTC |
9801a1e | cnugteren | 25 April 2016, 00:13:47 UTC | Added two much simpler examples to improve documentation | 25 April 2016, 00:13:47 UTC |
eac490c | cnugteren | 24 April 2016, 02:59:02 UTC | Updated the documentation | 24 April 2016, 02:59:02 UTC |
54df67a | cnugteren | 24 April 2016, 02:58:00 UTC | Updated headers to version 5.0 of the CLCudaAPI | 24 April 2016, 02:58:00 UTC |
8752c44 | cnugteren | 24 April 2016, 02:45:10 UTC | Updated Travis to reflect the latest Travis and Khronos changes | 24 April 2016, 02:45:10 UTC |
b306cf1 | Cedric Nugteren | 03 April 2016, 22:41:46 UTC | Merge pull request #36 from williamjshipman/development Only use OpenCL 2.x functions on OpenCL 2.x devices | 03 April 2016, 22:41:46 UTC |
bf1821b | williamjshipman | 03 April 2016, 01:20:33 UTC | - Add VersionNumber function for querying device OpenCL version number as an integer (e.g. 120 for OpenCL 1.2). - Clean up OpenCL 2.0 check in Queue constructor. | 03 April 2016, 01:20:33 UTC |
33ba3ef | William John Shipman | 02 April 2016, 19:37:23 UTC | Merge pull request #2 from CNugteren/development Development | 02 April 2016, 19:37:23 UTC |
da97040 | cnugteren | 31 March 2016, 04:11:58 UTC | Prepared the changelog for the next release | 31 March 2016, 04:11:58 UTC |
5802148 | cnugteren | 31 March 2016, 04:08:35 UTC | Updated to version 2.1.0 | 31 March 2016, 04:08:35 UTC |
0110efc | williamjshipman | 26 March 2016, 13:53:07 UTC | Merge branch 'development' of https://github.com/williamjshipman/CLTune into development | 26 March 2016, 13:53:07 UTC |
bccd8ac | williamjshipman | 26 March 2016, 13:21:37 UTC | Add runtime check for OpenCL 2 before using OpenCL 2 function. | 26 March 2016, 13:37:30 UTC |
4698d4a | williamjshipman | 26 March 2016, 13:21:37 UTC | Add runtime check for OpenCL 2 before using OpenCL 2 function. | 26 March 2016, 13:21:37 UTC |
0dc2a99 | Cedric Nugteren | 21 March 2016, 21:27:54 UTC | Updated the README | 21 March 2016, 21:27:54 UTC |
1ad3bb2 | Cedric Nugteren | 21 March 2016, 21:15:24 UTC | Merge branch 'development' of github.com:CNugteren/CLTune into development specially if it merges an updated upstream into a topic branch. | 21 March 2016, 21:15:24 UTC |
0b90c0c | CNugteren | 21 March 2016, 19:57:35 UTC | Fixes for minor warnings under Visual Studio | 21 March 2016, 19:57:35 UTC |
1d3c159 | CNugteren | 21 March 2016, 19:56:35 UTC | Added dllexport to be able to build a DLL under Windows | 21 March 2016, 19:56:35 UTC |
b170354 | Cedric Nugteren | 31 January 2016, 17:38:09 UTC | Merge pull request #35 from williamjshipman/development Add command line parameter for platform index to conv and gemm samples in line with description in README. | 31 January 2016, 17:38:09 UTC |
59faefa | williamjshipman | 30 January 2016, 23:07:16 UTC | Updated the README to show that the platform ID is one of the command line parameters and updated the samples so that the order of the parameters matches all parts of the README. | 30 January 2016, 23:07:16 UTC |
b5a3a8b | williamjshipman | 30 January 2016, 22:48:10 UTC | Samples now support a platform parameter in their command lines, in addition to the device number. | 30 January 2016, 22:48:10 UTC |
dcddd80 | Cedric Nugteren | 23 January 2016, 15:08:14 UTC | Updated FindOpenCL for Intel Linux OpenCL paths | 23 January 2016, 15:08:14 UTC |
d643731 | Cedric Nugteren | 22 November 2015, 11:19:55 UTC | Prepared the changelog for the next release | 22 November 2015, 11:19:55 UTC |
8bc6684 | Cedric Nugteren | 22 November 2015, 11:16:46 UTC | Updated to version 2.0.0 | 22 November 2015, 11:16:46 UTC |
b22dce2 | Cedric Nugteren | 22 November 2015, 11:15:35 UTC | Updated the readme | 22 November 2015, 11:15:35 UTC |
a21d4a5 | Cedric Nugteren | 21 November 2015, 13:33:50 UTC | Merge pull request #32 from CNugteren/catch_tests Replaced GTest with Catch unit testing | 21 November 2015, 13:33:50 UTC |
400752b | Cedric Nugteren | 21 November 2015, 13:29:27 UTC | Updated changelog and readme | 21 November 2015, 13:29:27 UTC |
8757c9e | Cedric Nugteren | 21 November 2015, 13:27:45 UTC | Updated the 'KernelInfo' class to use Catch | 21 November 2015, 13:27:45 UTC |
b74dcef | Cedric Nugteren | 19 November 2015, 20:03:50 UTC | Updated the 'tuner' class tests to use Catch | 19 November 2015, 20:03:50 UTC |
3526cc8 | Cedric Nugteren | 15 November 2015, 15:20:41 UTC | Removed GTest, added Catch, added CLCudaAPI tests | 15 November 2015, 15:20:41 UTC |
e9984c3 | Cedric Nugteren | 15 November 2015, 15:08:07 UTC | Merge pull request #31 from CNugteren/msvc_support MSVC 2015 support | 15 November 2015, 15:08:07 UTC |
d3c961f | Cedric Nugteren | 15 November 2015, 15:06:23 UTC | Updated changelog | 15 November 2015, 15:06:23 UTC |
dbb096a | Cedric Nugteren | 14 November 2015, 14:58:21 UTC | Fixed a warning and error for MSVC | 14 November 2015, 14:58:21 UTC |
f701c1f | Cedric Nugteren | 14 November 2015, 14:41:08 UTC | Prepared for MSVC support | 14 November 2015, 14:41:08 UTC |
279d1eb | Cedric Nugteren | 10 November 2015, 19:33:30 UTC | Merge pull request #30 from CNugteren/clcudaapi Updated the CLCudaAPI header | 10 November 2015, 19:33:30 UTC |
c34d456 | CNugteren | 08 November 2015, 11:04:53 UTC | Fixes CL to CUDA translation header to make the simple example work | 08 November 2015, 11:04:53 UTC |
3456bf4 | CNugteren | 08 November 2015, 10:36:28 UTC | Fixed a header inclusion error | 08 November 2015, 10:36:28 UTC |
c526a84 | Cedric Nugteren | 08 November 2015, 10:29:20 UTC | Added experimental support for CUDA kernels | 08 November 2015, 10:29:20 UTC |
97cb535 | Cedric Nugteren | 07 November 2015, 16:36:21 UTC | Updated the changelog | 07 November 2015, 16:36:21 UTC |
a5c5150 | Cedric Nugteren | 07 November 2015, 16:33:41 UTC | Now using version 4.0 of the CLCudaAPI header | 07 November 2015, 16:33:41 UTC |
1d2701f | Cedric Nugteren | 07 November 2015, 16:32:47 UTC | Disabled additional warnings for Clang | 07 November 2015, 16:32:47 UTC |
5fbcb3a | Cedric Nugteren | 07 November 2015, 11:32:02 UTC | Merge pull request #29 from CNugteren/machine_learning Machine learning models | 07 November 2015, 11:32:02 UTC |
d9ea76d | Cedric Nugteren | 07 November 2015, 11:26:51 UTC | Prepared changelog for next version | 07 November 2015, 11:26:51 UTC |
d644df1 | CNugteren | 29 October 2015, 14:12:58 UTC | Added a 3-layer neural network model | 29 October 2015, 14:12:58 UTC |
dfc3651 | CNugteren | 29 October 2015, 09:58:15 UTC | Prepared for addition of a neural network model | 29 October 2015, 09:58:15 UTC |
32eb552 | Cedric Nugteren | 25 October 2015, 14:51:30 UTC | Merge pull request #28 from CNugteren/json_metadata Added additional device properties to JSON-output | 25 October 2015, 14:51:30 UTC |
0a8e371 | CNugteren | 25 October 2015, 14:25:46 UTC | Added additional device properties to JSON-output | 25 October 2015, 14:25:46 UTC |
406f3f7 | CNugteren | 22 September 2015, 08:00:23 UTC | Fixed warnings and zero-range bug | 22 September 2015, 08:00:23 UTC |
52b34eb | CNugteren | 21 September 2015, 15:15:17 UTC | Now using ML models to predict best configurations + using logarithmic data | 21 September 2015, 15:15:17 UTC |
e6e195d | Cedric Nugteren | 07 September 2015, 11:55:26 UTC | Merge pull request #27 from CNugteren/publication CLTune publication and capitalization | 07 September 2015, 11:55:26 UTC |
1e677ac | CNugteren | 07 September 2015, 11:50:51 UTC | Added a reference to the CLTune paper | 07 September 2015, 11:50:51 UTC |
4109e58 | CNugteren | 07 September 2015, 11:44:54 UTC | Updated name from cltune to CLTune for Travis | 07 September 2015, 11:44:54 UTC |
790ca0c | CNugteren | 31 August 2015, 15:09:14 UTC | Cosmetic updates to linear regression | 31 August 2015, 15:09:14 UTC |
02af290 | CNugteren | 31 August 2015, 14:24:20 UTC | Added verification based on the cost function | 31 August 2015, 14:24:20 UTC |
6ddc479 | CNugteren | 26 August 2015, 15:14:27 UTC | Added regularization support to linear regression | 26 August 2015, 15:14:27 UTC |
b17861f | CNugteren | 26 August 2015, 14:24:55 UTC | Completed the function to add extra polynomial features | 26 August 2015, 14:24:55 UTC |
7c10c8b | CNugteren | 26 August 2015, 13:37:19 UTC | Added mean value normalization | 26 August 2015, 13:37:19 UTC |
4d29b06 | CNugteren | 26 August 2015, 13:36:55 UTC | Minor optimizations to population of permutation configurations | 26 August 2015, 13:36:55 UTC |
137a93f | Cedric Nugteren | 26 August 2015, 12:17:22 UTC | Merge pull request #26 from CNugteren/travis_ci Added Travis continuous integration | 26 August 2015, 12:17:22 UTC |
52c0c4f | CNugteren | 26 August 2015, 12:11:36 UTC | Made Travis always build pushes to the master branch | 26 August 2015, 12:11:36 UTC |
bf48395 | CNugteren | 21 August 2015, 15:25:56 UTC | Added travis continuous integration | 21 August 2015, 15:25:56 UTC |
dec4610 | CNugteren | 21 August 2015, 15:20:48 UTC | Added initial version of a linear regression machine learning model | 21 August 2015, 15:20:48 UTC |
15782dd | Cedric Nugteren | 03 August 2015, 15:16:14 UTC | Merge pull request #25 from CNugteren/claduc_and_json Claduc C++11 interface & method to output JSON | 03 August 2015, 15:16:14 UTC |
43a0e9a | CNugteren | 03 August 2015, 15:15:04 UTC | Updated to version 1.7.0 | 03 August 2015, 15:15:04 UTC |
2028cfd | CNugteren | 03 August 2015, 15:09:40 UTC | Added a method to print all results in JSON to file | 03 August 2015, 15:09:40 UTC |
3d44a9d | CNugteren | 27 July 2015, 11:52:07 UTC | Now using the new Claduc C++11 OpenCL header | 27 July 2015, 11:52:07 UTC |
0cd4295 | CNugteren | 27 July 2015, 11:01:32 UTC | Added initial version of JSON output | 27 July 2015, 11:01:32 UTC |
039d9ea | Cedric Nugteren | 28 May 2015, 12:40:21 UTC | Merge pull request #24 from CNugteren/reduced_requirements Reduced requirements and warning fixes | 28 May 2015, 12:40:21 UTC |