https://github.com/paboyle/Grid

sort by:
Revision Author Date Message Commit Date
a00ae98 Fence propagation from SYCL 29 March 2023, 19:00:40 UTC
3f2fd49 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 28 March 2023, 00:29:54 UTC
0efa107 Script update 28 March 2023, 00:29:43 UTC
8feedb4 Include files moved 28 March 2023, 00:29:21 UTC
05e562e Move the copy synch out to stencil and do one per call instead of one per packet 28 March 2023, 00:28:38 UTC
dd3bbb8 MOve the synchronise out to the stencil so one call instead of one call per packet 28 March 2023, 00:27:45 UTC
2fbcf13 SYCL fix 27 March 2023, 21:25:14 UTC
4ea48ef Merge pull request #419 from lehner/feature/gpt Separate rankSum from sum 24 March 2023, 19:42:16 UTC
546be72 Merge pull request #421 from UniOfLeicester/feature/accel_Copy_plane Populate the Cshift_table in the GPU 24 March 2023, 16:04:06 UTC
481bbaf Interface to query memory use 23 March 2023, 16:55:31 UTC
2814886 WriteDiscard on construct 23 March 2023, 14:28:50 UTC
bae0f8e Merge pull request #425 from rrhodgson/feature/CacheLogging Huge Cache 21 March 2023, 12:59:08 UTC
bbbcd36 Merge pull request #426 from rrhodgson/feature/LCDeflation Batched Local Coherence Tools 21 March 2023, 12:58:40 UTC
39c0815 WriteDiscard 21 March 2023, 12:57:29 UTC
a3e935c Batched block project/promote size checks 27 February 2023, 11:38:16 UTC
7731c7d Add huge cache type and allow Ncache==0 26 February 2023, 14:15:28 UTC
ff97340 Expose cached bytes 26 February 2023, 12:22:45 UTC
920a514 Added batched Mixed precision CG 14 February 2023, 17:04:13 UTC
be528b6 Add batched block project/promote functions 14 February 2023, 14:37:10 UTC
796abfa Merge pull request #422 from fjosw/fix/NVCC_DIAG_PRAGMA_SUPPORT Disable diagnostic pragma warnings for CUDA 12+ 17 January 2023, 14:34:49 UTC
ad0270a fix: diagnostic pragma warnings fixed for CUDA 12+ 12 January 2023, 12:36:30 UTC
7d62f1d Populate the Cshift_table in the GPU Cshift is allocated in Unified memory and used in the LambdaApply kernels but also populated from the host. This creates a lot of Unified HtoD and DtoH mem operations and has a negative effect in performance. With this commit we populate the Cshift table in the device with the populate_Cshift_table() kernel. 11 January 2023, 21:26:25 UTC
458c943 merged upstream 31 December 2022, 09:16:21 UTC
88015b0 Split sum in rankSum and GlobalSum 26 December 2022, 09:01:32 UTC
4ca1bf7 Added gauge invariance test 21 December 2022, 12:23:16 UTC
2ff868f CPU open doesn't need to free space 20 December 2022, 10:10:23 UTC
ede02b6 Memory manager debug Felix case 20 December 2022, 10:10:23 UTC
1822ced Bug fix 20 December 2022, 10:10:23 UTC
37ba327 More logging 20 December 2022, 10:10:23 UTC
99b3697 More loggin 20 December 2022, 10:10:23 UTC
43a45ec SSC_START 20 December 2022, 10:10:23 UTC
b00a414 A=A fix 20 December 2022, 10:10:23 UTC
3791bc5 Logging pulled in from dirichlet branch 20 December 2022, 10:10:23 UTC
d8c29f5 Updated FFT test for PETSc 18 December 2022, 17:05:00 UTC
281f810 Matt FFT test 18 December 2022, 01:35:33 UTC
07acfe8 Merge pull request #417 from rrhodgson/feature/fermtoprop Feature/fermtoprop 06 December 2022, 17:45:03 UTC
40234f5 FermToProp accelerator_for -> thread_for 06 December 2022, 17:34:51 UTC
d49694f PropToFerm fix 06 December 2022, 15:48:54 UTC
97a0986 FermToProp 30 November 2022, 20:36:35 UTC
e13930c Faster fermtoprop case 30 November 2022, 20:11:29 UTC
0655dab Open MP on host enabled 08 November 2022, 21:38:54 UTC
7f097bc Merge branch 'develop' of https://github.com/paboyle/Grid into develop 08 November 2022, 21:23:40 UTC
5c75aa5 Device mem 08 November 2022, 21:22:57 UTC
1873101 PVC 08 November 2022, 21:22:45 UTC
63fd1df Config on PVC 08 November 2022, 21:22:09 UTC
bd68861 SYCL sum 08 November 2022, 20:49:26 UTC
82e959f SYCL reduction 08 November 2022, 20:45:25 UTC
62e52de Merge pull request #414 from fjosw/feat/eCloverGPU Compact Exponential Cloverterm on GPU 01 November 2022, 13:15:44 UTC
184adee feat: renamed open_boundaries to fixedBoundaries 26 October 2022, 11:53:46 UTC
5fa6a8b docs: CompactClover debug info generalized. 26 October 2022, 11:41:14 UTC
a2a879b docs: CompactClover Debug Info improved. 25 October 2022, 16:20:42 UTC
9317d89 docs: details about inversion of CompactClover term added. 25 October 2022, 16:10:06 UTC
86075fd feat: MassTerm and ExponentiateClover merged into InstantiateClover 25 October 2022, 16:05:34 UTC
b36442e feat: CloverHelpers::InvertClover implemented which handles the inversion of the Clover term depending on clover type and the boundary conditions. 25 October 2022, 15:57:01 UTC
513d797 fix: signature of CompactWilsonCloverHelpers::Exponentiate fixed. 25 October 2022, 15:17:22 UTC
9e4835a feat: changed CompactWilsonExpClover exponentiation to Taylor expansion with Horner scheme. 25 October 2022, 14:19:43 UTC
477ebf2 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 04 October 2022, 18:19:43 UTC
0d5639f Run script update 04 October 2022, 18:13:41 UTC
413312f Benchmark the halo construction. THe bye counts are out and should be doubled for SIMD directions 04 October 2022, 18:12:59 UTC
0350844 Remove verbose 04 October 2022, 18:12:15 UTC
e1e5c75 Stencil gather improvements - SVM was running slow and used for a pointer array that wasn't needed to be in SVM 04 October 2022, 18:11:10 UTC
9296299 Better commenting 04 October 2022, 18:10:34 UTC
913fbca Merge pull request #410 from gkanwar/photon_and_sha_patches Photon.h and SHA256 patches 31 August 2022, 22:01:45 UTC
60dfb49 Remove FP16 tests when FP16 is disabled 21 August 2022, 15:29:55 UTC
554c238 Update OpenSSL digest to use high-level methods This avoids deprecation warnings when compiling against OpenSSL 3.0 but should still be backwards compatible. It is the recommended way to use the digest API going forward. 21 August 2022, 15:28:57 UTC
f922adf Fix Photon ComplexField type 21 August 2022, 14:16:18 UTC
188d2c7 PVC default, ignore ATS 02 August 2022, 15:38:53 UTC
17d7177 Files for SYCL 02 August 2022, 15:33:39 UTC
bb0a0da inon blocking caution due to SYCL 02 August 2022, 15:09:43 UTC
8411016 Fix the fence 02 August 2022, 15:00:43 UTC
d32b923 Fencing on a stream in SYCL is needed. Didn't know that ... gulp 02 August 2022, 14:58:04 UTC
2ab1af5 Ensure no synchronize and not optoin dependent 19 July 2022, 16:51:06 UTC
5f8892b Mistake pointed out by Camilo 19 July 2022, 16:31:51 UTC
f14e7e5 Grid accelerator 12 July 2022, 17:56:22 UTC
042ab1a Update GridStd.h 27 June 2022, 17:21:39 UTC
2df98a9 Merge pull request #406 from giordano/patch-1 Update default value of gen-simd-width in README 14 June 2022, 21:46:25 UTC
315ea18 Update default value of gen-simd-width in README 14 June 2022, 21:41:05 UTC
a9c2e1d Merge pull request #404 from rrhodgson/feature/json_nvcc Feature/json nvcc 25 May 2022, 17:30:11 UTC
da4daea Updated json to latest release 3.10.5 24 May 2022, 15:16:06 UTC
af3b065 Merge pull request #403 from fjosw/fix/cuda_11_5_warnings Fixed nvcc 11.5+ warnings 24 May 2022, 15:10:02 UTC
e346154 Updated json CUDA compile guards 24 May 2022, 14:48:01 UTC
7937ac2 fix: conditional pragmas according to new NVCC_DIAG_PRAGMA_SUPPORT standard in pugixml/pugixml.cc 24 May 2022, 14:31:03 UTC
e909aee fix: conditional pragmas according to new NVCC_DIAG_PRAGMA_SUPPORT standard in Grid_Eigen_Dense.h 24 May 2022, 14:29:42 UTC
bab8aa8 fix: conditional pragmas according to new NVCC_DIAG_PRAGMA_SUPPORT standard in DisableWarnings.h 24 May 2022, 14:27:40 UTC
38b22f0 Merge pull request #402 from fjosw/fix/clover_warnings fixed clover warnings 24 May 2022, 14:05:27 UTC
3ca0de1 Fix json write for vector<string> 24 May 2022, 13:37:33 UTC
c7205d2 Removed nvcc guards for json 24 May 2022, 13:30:26 UTC
617c536 fix: fixed warning: missing return statement at end of non-void function in CloverHelpers 24 May 2022, 10:37:33 UTC
083b58e Merge pull request #401 from JPRichings/LocalCoheranceDeflation Local coherance batch deflation 20 May 2022, 15:44:22 UTC
633427a Merge pull request #400 from JPRichings/wilson_sweep bench wilson sweep fix 20 May 2022, 15:43:40 UTC
2031d69 Merge branch 'paboyle:develop' into wilson_sweep 20 May 2022, 15:20:23 UTC
79e34b3 Local Coherence batch deflation 19 May 2022, 13:53:17 UTC
4f3d581 Merge branch 'paboyle:develop' into LocalCoheranceDeflation 19 May 2022, 13:46:17 UTC
d16427b Merge pull request #399 from fjosw/fix/Nc_neq_3 fix: assert for dimensions of compact Wilson clover moved to constructor 17 May 2022, 13:03:42 UTC
4b1997e wilson sweep test 16 May 2022, 14:58:33 UTC
8939d5d bugfix: eo operator called in correct location 15 May 2022, 23:28:28 UTC
b051e00 Additional Local Coherance Deflation operator() 15 May 2022, 23:25:13 UTC
8aa75b4 Merge branch 'develop' into fix/Nc_neq_3 10 May 2022, 13:22:03 UTC
0274f40 Merge pull request #389 from mbruno46/mbruno-eclover Feature/expClover 10 May 2022, 13:18:19 UTC
77aa147 Merge branch 'develop' into mbruno-eclover 10 May 2022, 13:16:53 UTC
back to top