https://github.com/paboyle/Grid

sort by:
Revision Author Date Message Commit Date
da59379 Large reg file for double 26 March 2024, 17:03:20 UTC
3ef2a41 ifdef guard ommitted 26 March 2024, 14:50:32 UTC
aa96f42 Acclerator ware MPI guard on the Unix domain sockets 26 March 2024, 14:41:25 UTC
49e9e4e Fences 26 March 2024, 14:14:06 UTC
f7b8163 Deterministic MPI reduce options 26 March 2024, 14:11:40 UTC
93769ea Updated configure for bounce through host 26 March 2024, 14:10:24 UTC
59b0cc1 REduce the time in single 26 March 2024, 00:42:40 UTC
f32c275 Updated config options for MPI not being aware of GPU 26 March 2024, 00:42:00 UTC
5404fc6 Merge needs a fence on SYCL 26 March 2024, 00:38:41 UTC
1f53458 Options to bounce through a host buffer if --disable-accelerator-aware-mpi 26 March 2024, 00:37:19 UTC
434c3e7 We have a choice of GET or PUT across NVlink 25 March 2024, 14:32:44 UTC
500b119 Deterministic MPI 22 March 2024, 15:55:23 UTC
4b87259 New config command for sunspot 22 March 2024, 15:43:49 UTC
503dec3 This appears working now on Sunspot 22 March 2024, 15:43:30 UTC
d1e9fe5 Xor csum for repro testing 22 March 2024, 15:42:57 UTC
d01e5fa Improved FlightRecorder 22 March 2024, 15:42:32 UTC
a477c25 Sunspot repro tests 22 March 2024, 15:42:11 UTC
1bd20cd FlightRecorder 22 March 2024, 15:40:01 UTC
e49e95b Upgrade of the Britney test with flight recorder and fast xor checksum 22 March 2024, 15:39:27 UTC
6f59fed Flight recorder, resurrecting the "world famous" Britney test 22 March 2024, 15:32:32 UTC
60b7f6c Flight recorder, resurrecting the "world famous" Britney test 22 March 2024, 15:32:26 UTC
b92dfcc Flight recorder, resurrecting the "world famous" Britney test 22 March 2024, 15:30:27 UTC
f6fd6dd Flight recorder, resurrecting the "world famous" Britney test 22 March 2024, 15:30:01 UTC
79ad567 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 19 March 2024, 15:43:42 UTC
fab1efb More britney logging improvements 19 March 2024, 14:36:21 UTC
660eb76 FFTW from OneAPI 19 March 2024, 14:28:33 UTC
62e7bf0 Updated flight logging for Britney test 12 March 2024, 20:10:04 UTC
95f3d69 Extra hardware test hook 12 March 2024, 20:09:37 UTC
89c0519 Repro test 12 March 2024, 16:11:33 UTC
2704b82 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 12 March 2024, 15:16:24 UTC
cf8632b Britney test option 12 March 2024, 15:15:35 UTC
d224297 PBS scripts 12 March 2024, 15:15:16 UTC
a4d11a6 Merge pull request #458 from paboyle/fix/HOST_NAME_MAX fallback to _POSIX_HOST_NAME_MAX if HOST_NAME_MAX is not defined 07 March 2024, 12:50:25 UTC
2b4399f more HOST_NAME_MAX fix 07 March 2024, 06:26:01 UTC
f17b8de fallback to _POSIX_HOST_NAME_MAX if HOST_NAME_MAX is not defined 07 March 2024, 06:22:08 UTC
7e5bd46 Booster update 06 March 2024, 18:03:45 UTC
228bbb9 Benchmark results 06 March 2024, 18:03:35 UTC
b812a7b Staggered launch script 06 March 2024, 01:32:40 UTC
891a366 Repro CG script 06 March 2024, 01:22:55 UTC
10116b3 Force device copyable and tell SYCL to shut it. 06 March 2024, 01:13:27 UTC
a46a0f0 force device copyable and don't take crap from SYCL 06 March 2024, 01:12:49 UTC
a26a8a3 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 06 March 2024, 00:05:00 UTC
7435315 More blasted shell variables 06 March 2024, 00:03:59 UTC
9b5f741 Reproducing CG can be more useful now 06 March 2024, 00:03:16 UTC
517822f SPR HBM benchmarking right and also PVC batched GEMM 06 March 2024, 00:02:27 UTC
1b93a9b Print out the hostname 06 March 2024, 00:01:58 UTC
783a66b Deterministic reduction please 06 March 2024, 00:01:37 UTC
976c3e9 Hack for flight logging CG inner products. Can be made to work, but could put in some more serious infrastructure for repro testing and blame attribution (Britney test) if necessary 05 March 2024, 23:59:57 UTC
f8ca971 Use of a bare PRECISION macro is not namespace safe and collides with SYCL 05 March 2024, 23:59:13 UTC
21bc8c2 OneMKL batched blas starting 05 March 2024, 23:58:20 UTC
3022821 SYCL conflict with Eigen 05 March 2024, 23:56:10 UTC
2ae980a Update sourceme.sh 05 March 2024, 18:39:18 UTC
6153dec Update setup.sh 05 March 2024, 18:38:32 UTC
c805f86 USQCD benchmark 01 March 2024, 05:05:04 UTC
04ca065 Only one rank opens 01 March 2024, 01:09:11 UTC
88d8fa4 Benchmark development 01 March 2024, 01:01:44 UTC
3c49762 Propagate in the blas routine 29 February 2024, 20:33:06 UTC
436bf1d Merge pull request #455 from clarkedavida/hisq_fat_links Hisq fat links 29 February 2024, 20:29:39 UTC
f70df6e changed NO_SHIFT and BACKWARD_CONST from define to enum 29 February 2024, 19:29:30 UTC
fce3852 Merge pull request #451 from paboyle/feature/eigen-3.4.0-update updating Eigen to 3.4.0 28 February 2024, 23:03:37 UTC
ee1b8bb Merge pull request #454 from edbennett/adjoint-broke fix HMC for non-fundamental representations 28 February 2024, 19:05:27 UTC
3f16366 Merge pull request #453 from dbollweg/feature/sliceSum_gpu Feature/slice sum gpu 28 February 2024, 19:04:43 UTC
2e570f5 Merge pull request #457 from lehner/feature/gpt Import GPT-related updates 28 February 2024, 18:59:04 UTC
9f89486 remove unnecessary code path 28 February 2024, 18:56:23 UTC
22b43b8 Make GPT test suite work with SYCL 28 February 2024, 11:57:17 UTC
3c90126 CUDA cub refuses to reduce vSpinColourMatrix, breaking up into smaller parts like already done for HIP case. 27 February 2024, 17:41:45 UTC
b507fe2 Added SpinColourMatrix case to sliceSum Test 27 February 2024, 16:28:32 UTC
6cd2d8f Replace cuda/hip memcpy with Grid functions 26 February 2024, 14:55:07 UTC
b02d022 fixed race condition (thx michael) 24 February 2024, 00:14:28 UTC
94581e3 accelerator_for is broken 23 February 2024, 22:58:33 UTC
88b52cc Merge branch 'develop' into hisq_fat_links 23 February 2024, 21:47:15 UTC
0a816b5 Merge branch 'feature/sliceSum_gpu' of https://github.com/dbollweg/Grid into feature/sliceSum_gpu 23 February 2024, 02:43:06 UTC
1c8b807 free malloc'd memory 23 February 2024, 02:42:44 UTC
66391f8 Merge branch 'feature/gpt' of ../Grid into develop 21 February 2024, 18:05:00 UTC
97f7a9e fix HMC for non-fundamental representations 21 February 2024, 08:27:55 UTC
15878f7 sliceSumReduction_cub_large now also faster than CPU on Frontier 16 February 2024, 18:55:21 UTC
e0d5e3c Merge branch 'paboyle:develop' into feature/sliceSum_gpu 16 February 2024, 18:16:37 UTC
6f34559 Adding sliceSumReduction_cub_small/large since hipcub cannot deal with arb. large vobjs 16 February 2024, 18:15:02 UTC
56827d6 accelerator_inline bug 14 February 2024, 20:56:57 UTC
73c0b29 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 13 February 2024, 20:19:32 UTC
303b83c Scaling benchmarks, verbosity and MPICH aware in acceleratorInit() For some reason Dirichlet benchmark fails on several nodes; need to debug this. 13 February 2024, 19:48:03 UTC
5ef4da3 Silence verbose 13 February 2024, 19:47:36 UTC
1502860 Benchmark scripts 13 February 2024, 19:47:02 UTC
585efc6 More benchmark scripts 13 February 2024, 19:40:49 UTC
62055e0 missing semicolon generates error with some compilers 13 February 2024, 17:18:27 UTC
e4a641b removing old Eigen tensor patch 13 February 2024, 09:37:14 UTC
8849f18 updating Eigen to 3.4.0 13 February 2024, 09:30:22 UTC
db42052 fix Simd::Nsimd typo 12 February 2024, 22:03:53 UTC
b5659d1 more test cases 09 February 2024, 18:37:14 UTC
4b43307 Undo include path changes for level zero api header 09 February 2024, 18:07:56 UTC
09af8c2 Merge branch 'paboyle:develop' into feature/sliceSum_gpu 09 February 2024, 18:02:59 UTC
9514035 refactor slicesum: slicesum uses GPU version by default now 09 February 2024, 18:02:28 UTC
2da09ae acceleration compiles and doesn't break scalar mode 07 February 2024, 01:40:13 UTC
a38fb0e first effort toward accelerators 07 February 2024, 01:24:55 UTC
7019916 RNG seed change safer for large volumes; this is a long term solution 07 February 2024, 00:56:39 UTC
1514b4f slicesum_sycl passes test 07 February 2024, 00:08:44 UTC
91cf5ee Updated bench script 06 February 2024, 23:45:10 UTC
0a6e2f4 small amount of cleanup 06 February 2024, 23:32:07 UTC
ab2de13 work towards sliceSum for sycl backend 06 February 2024, 18:24:45 UTC
5bfa88b Aurora MPI standalone benchmake and options that work well 06 February 2024, 16:28:40 UTC
back to top