Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
260 commits
Select commit Hold shift + click to select a range
0bbfa77
Merge pull request #473 from LLNL/task/rhornung67/script-updates
rhornung67 Aug 16, 2024
b927cec
Merge branch 'develop' into bugfix/rhornung67/windows-build
rhornung67 Aug 16, 2024
a9f3243
Fix include statement
rhornung67 Aug 16, 2024
266c9d4
Merge branch 'develop' into bugfix/rhornung67/windows-build
rhornung67 Aug 16, 2024
bf56743
Merge branch 'bugfix/rhornung67/windows-build' of github.com:LLNL/RAJ…
rhornung67 Aug 16, 2024
ccab003
Merge pull request #475 from LLNL/bugfix/rhornung67/windows-build
rhornung67 Aug 16, 2024
b655bd2
Add new issue templates, monor change to PR template
rhornung67 Aug 16, 2024
66a6599
Minor reorg of top-level CMake file, add option for short test run
rhornung67 Aug 16, 2024
0cb653c
Add GH Actions testing, split up tests between azure and GH Actions.
rhornung67 Aug 16, 2024
11b5d5f
Make build type explicit for CI checks
rhornung67 Aug 16, 2024
735a81f
Merge pull request #476 from LLNL/task/rhornung67/azureupdate-addGHac…
rhornung67 Aug 16, 2024
fad8028
Update GitLab CI content to match Adrien's RAJA PR
rhornung67 Aug 30, 2024
1c96029
Update files used in GitLab CI
rhornung67 Aug 30, 2024
1e1535c
Update RAJA
adrienbernede Sep 4, 2024
262dcbb
Use GitLab container registry as a Spack build cache (#453)
adrienbernede Sep 5, 2024
5676a48
Update lassen jobs w.r.t. changes in radiuss-spack-configs
adrienbernede Sep 18, 2024
5896fc5
Merge branch 'develop' into task/rhornung67/update-ci-toolchains
adrienbernede Sep 18, 2024
ea7a9de
Update and standardize RADIUSS packages
adrienbernede Sep 18, 2024
5d0fcd1
build_and_test.sh: allow to control whether or not to push to registry
adrienbernede Sep 19, 2024
77cf546
From RAJA: Update RSC with fixes
adrienbernede Sep 19, 2024
3ee1f8f
Fix package name
adrienbernede Sep 19, 2024
99dd23d
Update RAJA
adrienbernede Sep 25, 2024
6a82d52
Fix Spack branch name
adrienbernede Sep 25, 2024
c06f8c8
From RAJA: From RSC: Add missing sycl variant to RAJAPerf and fix c++…
adrienbernede Sep 25, 2024
f7653e1
Fix tests variant value in sycl spec
adrienbernede Sep 26, 2024
a27dcfe
Turn off openmp support in sycl job on corona (same as RAJA CI)
adrienbernede Sep 26, 2024
e3267f7
turn off benchmarks
adrienbernede Sep 26, 2024
aae34ed
Allow failure of sycl job on corona for now
adrienbernede Sep 26, 2024
bb2e498
Fix cmake version in build script; what was there does not exist
rhornung67 Sep 27, 2024
6f33971
Fix memory issue in SYCL variants of FIR kernel -- fixes checksum
rhornung67 Sep 27, 2024
186d419
Merge pull request #481 from LLNL/bugfix/rhornung67/FIR-memory-issue
rhornung67 Sep 27, 2024
9caaef0
Merge branch 'develop' into woptim/spack-update
adrienbernede Sep 30, 2024
98e70e4
From RAJA: From RSC: RAJAPerf: Fix CMake variable for sycl support
adrienbernede Oct 1, 2024
83e4124
Update Spack ref (waiting for next snapshot tag)
adrienbernede Oct 1, 2024
8f50606
Fix ompt variants of HISTOGRAM and MULTI_REDUCE
MrBurmark Oct 3, 2024
85d0c5a
Only use new reductions in OMP Target variants
rhornung67 Oct 3, 2024
45e7d4b
Fix erroneous deletions.
rhornung67 Oct 3, 2024
9a9bef8
Merge branch 'bugfix/burmark1/multireduce_ompt' into task/rhornung67/…
rhornung67 Oct 3, 2024
2b02fe0
Squash unused arg warnings
rhornung67 Oct 3, 2024
b379712
Update RSC to main
adrienbernede Oct 4, 2024
460a946
From RAJA: Update RAJA and Update RSC to main
adrienbernede Oct 4, 2024
1d164e4
Change variant string name to be consistent with enum name
rhornung67 Oct 4, 2024
70f8c16
Remove extraneous lambda and make string message consistent with othe…
rhornung67 Oct 4, 2024
537db6f
Make checksum scale factor a class member.
rhornung67 Oct 4, 2024
c87862c
Remove unused member function declarations
rhornung67 Oct 4, 2024
d124c33
Update Spack to appropriate snapshot branch
adrienbernede Oct 8, 2024
dba431f
Merge pull request #483 from LLNL/bugfix/burmark1/multireduce_ompt
rhornung67 Oct 8, 2024
b6bbf64
Merge branch 'develop' into task/rhornung67/rm-ompt-reduction-tunings
rhornung67 Oct 8, 2024
9af20b3
Merge pull request #484 from LLNL/task/rhornung67/rm-ompt-reduction-t…
rhornung67 Oct 10, 2024
173bb92
Merge branch 'develop' into woptim/spack-update
rhornung67 Oct 11, 2024
dcce169
Update to RAJA develop and convert Seq and OMP reductions to new inte…
rhornung67 Oct 11, 2024
7505865
Update RAJA to merge of similar PR
adrienbernede Oct 14, 2024
7696a9f
Convert CUDA variants to new reduction interface
rhornung67 Oct 15, 2024
bd434a5
Convert OpenMP target variants of reduction kernels to new interface.
rhornung67 Oct 15, 2024
9dfca7d
Convert Sycl kernels with reductions to new val-loc interface
rhornung67 Oct 15, 2024
e43bd0a
Pull in latest changes in RAJA develop
rhornung67 Oct 16, 2024
c045ae8
Update HIP kernels to val-op interface
rhornung67 Oct 16, 2024
353e552
Update CI to essentially match RAJA
rhornung67 Oct 16, 2024
161da8f
Revert "Update CI to essentially match RAJA"
rhornung67 Oct 16, 2024
86784f1
Update to raja@develop
adrienbernede Oct 16, 2024
a4e2ef1
Merge branch 'task/rhornung67/new-reductions' into woptim/spack-update
adrienbernede Oct 16, 2024
8b6f167
Allow failure for intel 2023 and remove superfluous job
adrienbernede Oct 16, 2024
ad7fd42
Update RAJA with "fix" for intel 2023
adrienbernede Oct 21, 2024
8c6849f
use -01 and fp-precise with intel 2023
adrienbernede Oct 21, 2024
f1b7375
From RAJA: Point at new RSC main commit
adrienbernede Oct 21, 2024
abfa26e
Do not allow failure with intel 2023, update RAJA
adrienbernede Oct 21, 2024
46d07d9
Update RAJA to new develop commit
adrienbernede Oct 22, 2024
91a0ac9
Add resource arg to Seq and OpenMP RAJA variants
rhornung67 Oct 22, 2024
67462ee
Update to RAJA develop
rhornung67 Oct 23, 2024
f478a84
Merge branch 'woptim/spack-update' of github.com:LLNL/RAJAPerf into w…
rhornung67 Oct 23, 2024
979a806
Add resource arg to Seq and OpenMP variants
rhornung67 Oct 23, 2024
c6abab1
Try to get more info about corona SYCL job
rhornung67 Oct 23, 2024
34da9d1
Add resource to RAJA Seq and OMP variants
rhornung67 Oct 23, 2024
4bb3f45
Fix test variant syntax
adrienbernede Oct 24, 2024
ab7d71b
Pull in RAJA branch with radiuss-spack-configs branch that should fix…
rhornung67 Oct 24, 2024
bb596d5
SYCL CI fix attempt
rhornung67 Oct 24, 2024
28e585a
Apply changes to second occurence of ENABLE_SYCL
adrienbernede Oct 24, 2024
29d7746
Add resource for Seq and OpenMP variants
rhornung67 Oct 24, 2024
af481c0
Add resource for Seq and OpenMP RAJA variants
rhornung67 Oct 24, 2024
7e0fa93
Add resource for Seq and OpenMP RAJA variants
rhornung67 Oct 24, 2024
2fb6bf1
Add resource to Seq and OpenMP RAJA variants
rhornung67 Oct 24, 2024
53c5468
Add resource to Seq and OpenMP RAJA::launch kernels
rhornung67 Oct 24, 2024
afddcb8
Pull in latest RAJA and don't allow jobs to fail
rhornung67 Oct 25, 2024
e1c42e6
From RAJA: From RSC: Set both ENABLE_SYCL and RAJA_ENABLE_SYCL
adrienbernede Oct 25, 2024
017c214
Add resource arg to remaining RAJA Seq and OpenMP variants
rhornung67 Oct 25, 2024
01bf5b2
Don't unload rocm module on corona
rhornung67 Oct 28, 2024
58ff1da
Pull in new radiuss-spack-configs via RAJA
rhornung67 Oct 29, 2024
314f510
Update some specs and remove allow failure
rhornung67 Oct 29, 2024
097718c
Add missing CUDA and HIP resources
rhornung67 Oct 29, 2024
cb55a10
Add resource to omp target variants
rhornung67 Oct 30, 2024
4d72e98
add resource to omp target variants
rhornung67 Oct 30, 2024
82d6de7
add resource to omp target variants
rhornung67 Oct 30, 2024
9203121
add resource to omp target variants + minor code formatting
rhornung67 Oct 30, 2024
3e97895
Add resource in omp target variants
rhornung67 Oct 30, 2024
4625664
Add resource to omp target kernel variants
rhornung67 Oct 30, 2024
81d9f43
Fix typo
rhornung67 Oct 31, 2024
89f1618
Type and code consistency
rhornung67 Oct 31, 2024
ebe577b
Merge pull request #490 from LLNL/task/rhornung67/add-resource
rhornung67 Oct 31, 2024
0c61dd7
Merge branch 'develop' into woptim/spack-update
rhornung67 Oct 31, 2024
b54d8f7
Update to latest RAJA develop
rhornung67 Oct 31, 2024
3fcfd83
Update GH Actions to use ROCm 6
rhornung67 Oct 31, 2024
9dd203f
Update test to pass with OpenMP Target back-end
rhornung67 Oct 31, 2024
964e21b
[WIP] Update and standardize implementation of packages, in sync with…
adrienbernede Nov 4, 2024
8baeab9
Merge branch 'develop' into task/rhornung67/new-ci-checks
rhornung67 Nov 4, 2024
3602695
Mods to make test run successfully with omp target enabled
rhornung67 Nov 4, 2024
e4b6b88
Add CI jobs, remove irrelevant jobs
rhornung67 Nov 4, 2024
f6f50a7
Add comment about reduce test tolerance
rhornung67 Nov 4, 2024
abb0779
Merge pull request #492 from LLNL/task/rhornung67/new-ci-checks
rhornung67 Nov 4, 2024
d69a856
Update copyright year in all files to 2025
rhornung67 Jan 10, 2025
83aed13
Merge pull request #498 from LLNL/task/rhornung67/copyright-2025
rhornung67 Jan 13, 2025
0c564e3
Add some caliper scripts
MrBurmark Jan 17, 2025
7a1173b
Add symlink to RAJA host-configs
MrBurmark Jan 17, 2025
a9310e6
Prevent usage of CALI_CONFIG
Feb 11, 2025
d0986de
Add P3HPC paper
pearce8 Feb 12, 2025
0b16472
Undo throw error. Parse CALI_CONFIG into config manager channels.
Feb 14, 2025
1f07a3a
Unset CALI_CONFIG to disable automatic config
Feb 19, 2025
0d99200
Improve wording of output message when CALI_CONFIG is provided
Feb 19, 2025
8cd4abf
Initial FEM based linear sweep port.
rchen20 Feb 19, 2025
cae14f3
Merge pull request #502 from LLNL/pearce8-patch-1
MrBurmark Feb 21, 2025
53078ed
OpenMP backend.
rchen20 Feb 23, 2025
3175a04
Merge branch 'develop' into fix/cali_config
michaelmckinsey1 Mar 6, 2025
0cea63b
Various updates in RADIUSS Spack Configs (#496)
adrienbernede Mar 17, 2025
770ca4b
Update debug print with better fp accuracy.
rchen20 Mar 19, 2025
58173a5
Ordering of builds.
rchen20 Mar 19, 2025
f758024
Removing mathematical effect of F array temporarily.
rchen20 Mar 19, 2025
0d8a1f4
Some cleanup.
rchen20 Mar 19, 2025
4d0bad8
Proper HIP block size.
rchen20 Mar 19, 2025
8d3790b
OpenMP bakcend implementation.
rchen20 Mar 19, 2025
cb1bfee
Fixes for modifying F array.
rchen20 Mar 20, 2025
23c1cca
Clean up some unused args.
rchen20 Mar 20, 2025
2999660
Remove extraneous copy functionality.
rchen20 Mar 20, 2025
5747a70
Clarifying comments on RHS perturbation.
rchen20 Mar 21, 2025
b247f9d
Corrected FLOP count.
rchen20 Mar 24, 2025
820876f
More sweep cleanup.
rchen20 Mar 24, 2025
a798ad4
Add new Complexity.
rchen20 Mar 24, 2025
277085e
Remove some input vars and keep as constexpr.
rchen20 Mar 24, 2025
c338cb9
Various formatting changes.
rchen20 Mar 24, 2025
eca8673
Merge branch 'develop' into task/chen59/femsweep
rchen20 Mar 24, 2025
3f625df
Further detail explaining F and Ffactor.
rchen20 Mar 25, 2025
25ec26b
Add empty kernel
MrBurmark Mar 27, 2025
1692d5c
fix ompt build and add sycl method decls
MrBurmark Mar 27, 2025
1b211cc
Improve asm guards for msvc
MrBurmark Mar 27, 2025
581e8ac
Use a volatile local with windows builds
MrBurmark Mar 27, 2025
485f596
Add occupancy grid stride tunings for cuda and hip
MrBurmark Mar 28, 2025
c491677
Remove user defined mesh parameters.
rchen20 Mar 28, 2025
bf93055
Remove complexity.
rchen20 Mar 28, 2025
ea3b408
Encapsulate mesh data in class.
rchen20 Mar 28, 2025
15c38c6
Combine host alloc and copy function.
rchen20 Mar 28, 2025
f75187c
Use templated block size.
rchen20 Mar 28, 2025
31cd85c
Various cleanup and consolidation.
rchen20 Mar 28, 2025
8c90da4
Remove Launch parameters for sequential.
rchen20 Mar 28, 2025
334648e
enable optimizations on a few caliper builds
berg-michael Mar 28, 2025
9add864
Checksum temporary hack for AMD CPU inaccuracy.
rchen20 Apr 1, 2025
5efbda8
Separate variable definitions.
rchen20 Apr 3, 2025
7797092
Scale HIP checksum rather than truncate.
rchen20 Apr 3, 2025
88310fd
Use fixed block size GPU policies.
rchen20 Apr 3, 2025
87ad272
Calculate faces with actual mesh dimensions.
rchen20 Apr 3, 2025
76add38
Scale checksums for all platforms.
rchen20 Apr 3, 2025
afcfd7e
Add empty OMPTarget and Sycl variants.
rchen20 Apr 3, 2025
323fb1b
Merge pull request #504 from LLNL/feature/burmark1/noop_kernel
MrBurmark Apr 4, 2025
69120cd
Correct SYCL variant.
rchen20 Apr 4, 2025
527bd07
Raising lassen CI time.
rchen20 Apr 4, 2025
2e60b5a
Merge branch 'develop' into pr-from-fork/505
artv3 Apr 4, 2025
04352ea
Correct omptarget function name.
rchen20 Apr 4, 2025
cd3a873
Merge branch 'develop' into task/chen59/femsweep
rchen20 Apr 4, 2025
0a00f96
Trigger CI.
rchen20 Apr 7, 2025
6923f61
Merge pull request #506 from LLNL/pr-from-fork/505
artv3 Apr 7, 2025
ba1b1f4
Add mesh dimension variables for clarity.
rchen20 Apr 7, 2025
cbc482a
Clarify unknown backend.
rchen20 Apr 7, 2025
5d10214
Increase CI time limits for FEM sweep.
rchen20 Apr 7, 2025
ebb28e9
Merge branch 'develop' into task/chen59/femsweep
rchen20 Apr 7, 2025
8b1bae1
Increase tioga time by 2 more minutes.
rchen20 Apr 7, 2025
68833b1
Add base GPU variants.
rchen20 Apr 7, 2025
7ee6e5b
Merge pull request #503 from LLNL/task/chen59/femsweep
rchen20 Apr 8, 2025
8d22fc1
Update to BLT v0.7.0.
rchen20 Apr 8, 2025
10613ee
Update to RAJA v2025.03.0.
rchen20 Apr 8, 2025
7cab9ea
Update copyright year in FEM sweep.
rchen20 Apr 8, 2025
d6e446a
Merge pull request #508 from LLNL/task/chen59/updateBLTRAJA2025
rchen20 Apr 9, 2025
4e3c744
Update RAJA to v2025.03.2 patch.
rchen20 May 16, 2025
9ad01ad
Merge pull request #512 from LLNL/task/chen59/updateTPLs2025
rchen20 May 16, 2025
1dca1df
Merge branch 'develop' into fix/cali_config
rhornung67 May 19, 2025
43f7f50
Allow custom naming of out cali file & do not add outdir to config (R…
michaelmckinsey1 May 19, 2025
1ab83f5
Throw error if trying to name multiple cali files with same name
michaelmckinsey1 May 19, 2025
494bb55
Merge branch 'develop' into feature/burmark1/scripts
MrBurmark May 20, 2025
4feb1ef
Make hip and cuda memcpy always synchronous
MrBurmark May 22, 2025
80e59f6
Change scopedMoveData to allocDataForSeqInit
MrBurmark May 22, 2025
3082542
Explicitly perform copies in HALO kernels
MrBurmark May 22, 2025
d423aad
Rename to ForInit
MrBurmark May 22, 2025
2053343
Update hip asan scripts
MrBurmark May 22, 2025
2a81e86
Merge branch 'develop' into refactor/burmark1/copies
MrBurmark May 22, 2025
ee07872
Merge branch 'develop' into feature/burmark1/hip_asan_scripts
MrBurmark May 22, 2025
a722b59
Merge pull request #513 from LLNL/feature/burmark1/hip_asan_scripts
MrBurmark May 22, 2025
fdb8fdb
Use host data in ATOMIC teardown
MrBurmark May 22, 2025
e9f3917
Add OpenMP target host id to KernelBase
MrBurmark May 22, 2025
81ff326
Merge branch 'develop' into refactor/burmark1/copies
MrBurmark May 22, 2025
5ee0432
Fix INT_PREDICT::updateChecksum
MrBurmark May 23, 2025
54deda6
Throw error to avoid silent fail and undefined behavior
michaelmckinsey1 May 28, 2025
4db7563
Unset env var before setting new env var
michaelmckinsey1 May 28, 2025
a13d005
Merge branch 'develop' into fix/cali_config
MrBurmark May 28, 2025
80bcbd1
Add launch tunings of LTIMES
MrBurmark Jun 6, 2025
d364750
Fix tunings with gpu variants
MrBurmark Jun 6, 2025
8bc9ff5
Add LTIMES_NOVIEW launch variants
MrBurmark Jun 6, 2025
a4128ec
Use policies implemented in sycl launch
MrBurmark Jun 6, 2025
47ca6ce
Fix policy mapping with sycl
MrBurmark Jun 6, 2025
c1ed07c
Swap order in sycl launch teams and threads
MrBurmark Jun 6, 2025
21c7a20
Use views and strongly typed indices throughout LTIMES
MrBurmark Jun 6, 2025
6054ecd
Merge pull request #515 from LLNL/feature/burmark1/launch_LTIMES
MrBurmark Jun 9, 2025
c0a8b0d
Merge remote-tracking branch 'origin' into feature/burmark1/LTIMES_us…
MrBurmark Jun 9, 2025
245664f
Fixup type in LTIMES::runCudaVariantImpl
MrBurmark Jun 10, 2025
7b018d4
Merge branch 'develop' into fix/cali_config
MrBurmark Jun 10, 2025
8e2e4f2
Remove is_device_ptr in LTIMES::runOpenMPTargetVariant
MrBurmark Jun 10, 2025
a0a2dfa
Merge branch 'develop' into refactor/burmark1/copies
rhornung67 Jun 10, 2025
dd5aaf8
Apply suggestions from code review
MrBurmark Jun 10, 2025
893a9c8
Merge pull request #501 from LLNL/fix/cali_config
MrBurmark Jun 10, 2025
cdc0058
Merge branch 'develop' into refactor/burmark1/copies
MrBurmark Jun 10, 2025
10b285f
Merge pull request #514 from LLNL/refactor/burmark1/copies
MrBurmark Jun 10, 2025
164cc70
Merge branch 'develop' into feature/burmark1/scripts
MrBurmark Jun 10, 2025
0a2939b
Merge pull request #499 from LLNL/feature/burmark1/scripts
MrBurmark Jun 10, 2025
1269b17
Merge branch 'develop' into feature/burmark1/LTIMES_use_views
MrBurmark Jun 10, 2025
3d1f273
Use hip wavesize from RAJA
MrBurmark Jun 12, 2025
4dc18b5
Merge pull request #519 from LLNL/feature/burmark1/raja_wavesize
MrBurmark Jun 12, 2025
237e386
Merge remote-tracking branch 'origin' into feature/burmark1/LTIMES_us…
MrBurmark Jun 12, 2025
49ce58e
Explicitly use firstprivate in LTIMES::runOpenMPTargetVariant
MrBurmark Jun 12, 2025
41005d9
Add option to enable custom scan, default to off
MrBurmark Jun 13, 2025
3ff3af2
Fix hang when warpSize is 32 in hip.
MrBurmark Jun 14, 2025
38f7620
Refactor cuda implemenation to match
MrBurmark Jun 14, 2025
a3980bd
Reenable custom scan by default
MrBurmark Jun 14, 2025
628fcda
Add ubuntu amdclang build script
MrBurmark Jun 14, 2025
2ef195e
Fix indentation
MrBurmark Jun 16, 2025
1f07bf8
Document enable_custom_scan
MrBurmark Jun 16, 2025
99e191a
Fix cuda grid scan
MrBurmark Jun 16, 2025
321150c
Merge pull request #517 from LLNL/feature/burmark1/LTIMES_use_views
MrBurmark Jun 17, 2025
5f0a77b
Merge branch 'develop' into feature/burmark1/custom_scan_flag
MrBurmark Jun 17, 2025
3667947
Add checksum to -sp
MrBurmark Jun 17, 2025
0916ad5
Move allocation of mpi buffers into base class
MrBurmark Jun 17, 2025
529f459
Fix cuda grid scan
MrBurmark Jun 17, 2025
30027db
Merge branch 'feature/burmark1/custom_scan_flag' of github.com:LLNL/R…
MrBurmark Jun 17, 2025
478dc5e
changing from array of auto to view_type, resolved issue with compila…
Yejashi Jun 17, 2025
634e6fc
Removing unnecessary runKokkosVariant invocations, causes compilation…
Yejashi Jun 17, 2025
7df76dc
pinning kokkos to tag:3.7.02, necessary for proper usage of Kokkos::M…
Yejashi Jun 17, 2025
07ded07
Merge branch 'develop' into feature/burmark1/comm_align
rhornung67 Jun 17, 2025
f0bfb41
Fix unused var warnings
MrBurmark Jun 17, 2025
cc15453
Fix cuda grid scan
MrBurmark Jun 17, 2025
94df5ef
Merge pull request #521 from LLNL/feature/burmark1/custom_scan_flag
MrBurmark Jun 18, 2025
d9ba29b
Merge branch 'develop' into feature/burmark1/comm_align
MrBurmark Jun 18, 2025
146ce91
Merge branch 'develop' into pr-from-fork/523
rhornung67 Jun 18, 2025
61f501f
Merge pull request #522 from LLNL/feature/burmark1/comm_align
MrBurmark Jun 18, 2025
264af8e
Merge branch 'develop' into pr-from-fork/523
MrBurmark Jun 18, 2025
3334142
Merge pull request #524 from LLNL/pr-from-fork/523
rhornung67 Jun 18, 2025
251712a
Update version number
rhornung67 Jun 18, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
25 changes: 25 additions & 0 deletions .github/ISSUE_TEMPLATE/bug_report.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,25 @@
---
name: Bug report
about: Create a report to help us improve

---

**Describe the bug**

A clear and concise description of what the bug is:

**To Reproduce**

Steps to reproduce the behavior:

**Expected behavior**

A clear and concise description of what you expected to happen:

**Compilers & Libraries (please complete the following information):**
- Compiler & version: [e.g. GCC 4.9.3]:
- CUDA version (if applicable):

**Additional context**

Add any other context about the problem here.
22 changes: 22 additions & 0 deletions .github/ISSUE_TEMPLATE/feature_request.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
---
name: Feature request
about: Suggest an idea for this project

---

**Is your feature request related to a problem? Please describe.**

A clear and concise description of what the problem is.

**Describe the solution you'd like**

A clear and concise description of what you want to happen.

**Describe alternatives you've considered**

A clear and concise description of any alternative solutions or features you've considered.

**Additional context**

Add any other context or information about the feature request here.

2 changes: 1 addition & 1 deletion .github/PULL_REQUEST_TEMPLATE.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
# Summary (Write a short headline summary of PR)
# Summary

- This PR is a (refactoring, bugfix, feature, something else)
- It does the following (modify list as needed):
Expand Down
77 changes: 77 additions & 0 deletions .github/workflows/build.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,77 @@
on: push
name: Build
jobs:
build_docker:
strategy:
matrix:
target: [gcc12, gcc13, clang13, clang15, rocm6, rocm6_desul, intel2024, intel2024_debug, intel2024_sycl]
runs-on: ubuntu-latest
steps:
- run: |
sudo rm -rf /usr/share/dotnet
sudo rm -rf /usr/local/lib/android
sudo rm -rf /opt/ghc
sudo rm -rf /opt/hostedtoolcache/CodeQL
sudo docker image prune --all --force
- uses: actions/checkout@v2
with:
submodules: recursive
- uses: docker/setup-buildx-action@v3
- uses: docker/build-push-action@v6
with:
target: ${{ matrix.target }}
build_mac:
runs-on: macos-latest
steps:
- uses: actions/checkout@v2
with:
submodules: recursive
- uses: threeal/cmake-action@v1.3.0
with:
build-dir: build
options:
CMAKE_CXX_STANDARD=14
ENABLE_OPENMP=Off
CMAKE_BUILD_TYPE=Release
run-build: true
build-args: '--parallel 16'
- uses: threeal/ctest-action@v1.1.0
build_windows:
strategy:
matrix:
shared:
## ====================================
## Shared library build generated undefined symbol errors that are not
## understood -RDH
## - args:
## BUILD_SHARED_LIBS=On
## CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS=On
- args: BUILD_SHARED_LIBS=Off

runs-on: windows-latest
steps:
- uses: actions/checkout@v2
with:
submodules: recursive
## ====================================
## Config and build action
- uses: threeal/cmake-action@v1.3.0
with:
build-dir: build
options:
ENABLE_WARNINGS_AS_ERRORS=Off
BLT_CXX_STD=c++17
CMAKE_BUILD_TYPE=Release
PERFSUITE_RUN_SHORT_TEST=On
${{ matrix.shared.args }}
run-build: true
build-args: '--parallel 16'
## ====================================
## Print the contents of the test directory in the build space (debugging)
## - run: |
## dir -r D:\a\RAJA\RAJA\build\test
## ====================================
## Run tests action
- uses: threeal/ctest-action@v1.1.0
with:
build-config: Debug
4 changes: 2 additions & 2 deletions .gitlab-ci.yml
Original file line number Diff line number Diff line change
Expand Up @@ -72,7 +72,7 @@ stages:
include:
- local: '.gitlab/custom-jobs-and-variables.yml'
- project: 'radiuss/radiuss-shared-ci'
ref: 'v2024.06.0'
ref: 'v2024.12.0'
file: 'pipelines/${CI_MACHINE}.yml'
- artifact: '${CI_MACHINE}-jobs.yml'
job: 'generate-job-lists'
Expand All @@ -85,7 +85,7 @@ include:
file: 'id_tokens.yml'
# [Optional] checks preliminary to running the actual CI test
#- project: 'radiuss/radiuss-shared-ci'
# ref: 'v2024.06.0'
# ref: 'v2024.12.0'
# file: 'utilities/preliminary-ignore-draft-pr.yml'
# pipelines subscribed by the project
- local: '.gitlab/subscribed-pipelines.yml'
27 changes: 21 additions & 6 deletions .gitlab/custom-jobs-and-variables.yml
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
###############################################################################
# Copyright (c) 2017-24, Lawrence Livermore National Security, LLC
# Copyright (c) 2017-25, Lawrence Livermore National Security, LLC
# and RAJA Performance Suite project contributors.
# See the RAJAPerf/LICENSE file for details.
#
Expand All @@ -15,7 +15,7 @@ variables:

# Ruby
# Arguments for top level allocation
RUBY_SHARED_ALLOC: "--exclusive --reservation=ci --time=40 --nodes=1"
RUBY_SHARED_ALLOC: "--exclusive --reservation=ci --time=45 --nodes=1"
# Arguments for job level allocation
# Note: We repeat the reservation, necessary when jobs are manually re-triggered.
RUBY_JOB_ALLOC: "--reservation=ci --nodes=1"
Expand All @@ -26,7 +26,7 @@ variables:

# Poodle
# Arguments for top level allocation
POODLE_SHARED_ALLOC: "--exclusive --time=40 --nodes=1"
POODLE_SHARED_ALLOC: "--exclusive --time=45 --nodes=1"
# Arguments for job level allocation
POODLE_JOB_ALLOC: "--nodes=1"
# Project specific variants for poodle
Expand All @@ -36,7 +36,7 @@ variables:

# Corona
# Arguments for top level allocation
CORONA_SHARED_ALLOC: "--exclusive --time-limit=12m --nodes=1 -o per-resource.count=2"
CORONA_SHARED_ALLOC: "--exclusive --time-limit=25m --nodes=1 -o per-resource.count=2"
# Arguments for job level allocation
CORONA_JOB_ALLOC: "--nodes=1 --begin-time=+5s"
# Project specific variants for corona
Expand All @@ -46,7 +46,7 @@ variables:

# Tioga
# Arguments for top level allocation
TIOGA_SHARED_ALLOC: "--queue=pci --exclusive --time-limit=26m --nodes=1 -o per-resource.count=2"
TIOGA_SHARED_ALLOC: "--queue=pci --exclusive --time-limit=31m --nodes=1 -o per-resource.count=2"
# Arguments for job level allocation
TIOGA_JOB_ALLOC: "--nodes=1 --begin-time=+5s"
# Project specific variants for tioga
Expand All @@ -57,7 +57,7 @@ variables:
# Lassen and Butte use a different job scheduler (spectrum lsf) that does not
# allow pre-allocation the same way slurm does.
# Arguments for job level allocation
LASSEN_JOB_ALLOC: "1 -W 20 -q pci"
LASSEN_JOB_ALLOC: "1 -W 25 -q pci"
# Project specific variants for lassen
PROJECT_LASSEN_VARIANTS: "~shared +openmp cuda_arch=70"
# Project specific deps for lassen
Expand All @@ -71,3 +71,18 @@ variables:
artifacts:
reports:
junit: junit.xml
name: "${CI_PROJECT_NAME}-${CI_MACHINE}-${CI_JOB_NAME}-${CI_PIPELINE_ID}"
paths:
- ./*.cmake

.reproducer_vars:
script:
- |
echo -e "
# Required variables \n
export MODULE_LIST=\"${MODULE_LIST}\" \n
export SPEC=\"${SPEC//\"/\\\"}\" \n
# Allow to set job script for debugging (only this differs from CI) \n
export DEBUG_MODE=true \n
# Using the CI build cache is optional and requires a token. Set it like so: \n
# export REGISTRY_TOKEN=\"<your token here>\" \n"
16 changes: 9 additions & 7 deletions .gitlab/jobs/corona.yml
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
###############################################################################
# Copyright (c) 2017-24, Lawrence Livermore National Security, LLC
# Copyright (c) 2017-25, Lawrence Livermore National Security, LLC
# and RAJA Performance Suite project contributors.
# See the RAJAPerf/LICENSE file for details.
#
Expand All @@ -9,9 +9,7 @@
# Override reproducer section to define project specific variables.
.corona_reproducer_vars:
script:
- |
echo -e "export MODULE_LIST=\"${MODULE_LIST}\""
echo -e "export SPEC=\"${SPEC//\"/\\\"}\""
- !reference [.reproducer_vars, script]

########################
# Overridden shared jobs
Expand All @@ -29,6 +27,10 @@
# ${PROJECT_<MACHINE>_DEPS} in the extra jobs. There is no reason not to fully
# describe the spec here.

# With GitLab CI, included files cannot be empty.
variables:
INCLUDED_FILE_CANNOT_BE_EMPTY: "True"
clang_20_0_0_sycl_gcc_10_3_1_rocmcc_6_0_2_hip:
variables:
SPEC: " ~shared +sycl ~openmp tests=basic %clang@=20.0.0 cxxflags==\"-w -fsycl -fsycl-unnamed-lambda -fsycl-targets=amdgcn-amd-amdhsa -Xsycl-target-backend --offload-arch=gfx906\" ^blt@develop"
MODULE_LIST: "rocm/6.0.2"
SYCL_PATH: "/usr/WS2/raja-dev/clang_sycl_730cd3a5275f_hip_gcc10.3.1_rocm6.0.2/"
LD_LIBRARY_PATH: "${SYCL_PATH}/lib:${SYCL_PATH}/lib64:${LD_LIBRARY_PATH}"
extends: .job_on_corona
40 changes: 15 additions & 25 deletions .gitlab/jobs/lassen.yml
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
###############################################################################
# Copyright (c) 2017-24, Lawrence Livermore National Security, LLC
# Copyright (c) 2017-25, Lawrence Livermore National Security, LLC
# and RAJA Performance Suite project contributors.
# See the RAJAPerf/LICENSE file for details.
#
Expand All @@ -9,9 +9,7 @@
# Override reproducer section to define project specific variables.
.lassen_reproducer_vars:
script:
- |
echo -e "export MODULE_LIST=\"${MODULE_LIST}\""
echo -e "export SPEC=\"${SPEC//\"/\\\"}\""
- !reference [.reproducer_vars, script]

########################
# Overridden shared jobs
Expand All @@ -20,14 +18,7 @@
# We keep ${PROJECT_<MACHINE>_VARIANTS} and ${PROJECT_<MACHINE>_DEPS} So that
# the comparison with the original job is easier.

# Overriding shared spec: Longer allocation + extra flags
xl_2022_08_19_gcc_8_3_1_cuda_11_2_0:
variables:
SPEC: "${PROJECT_LASSEN_VARIANTS} +cuda cxxflags==\"-qthreaded -std=c++14 -O3 -qstrict -qxlcompatmacros -qlanglvl=extended0x -qalias=noansi -qhot -qpic -qsmp=omp -qsuppress=1500-029 -qsuppress=1500-036\" %xl@=16.1.1.12.gcc.8.3.1 ^cuda@11.2.0+allow-unsupported-compilers ${PROJECT_LASSEN_DEPS}"
MODULE_LIST: "cuda/11.2.0"
LASSEN_JOB_ALLOC: "1 -W 60 -q pci"
extends: .job_on_lassen

# No jobs overridden

############
# Extra jobs
Expand All @@ -38,21 +29,21 @@ xl_2022_08_19_gcc_8_3_1_cuda_11_2_0:

gcc_8_3_1:
variables:
SPEC: " ~shared +openmp %gcc@=8.3.1 ${PROJECT_LASSEN_DEPS}"
SPEC: " ~shared +openmp %gcc@=8.3.1 ^blt@develop"
extends: .job_on_lassen

gcc_8_3_1_cuda_11_5_0_ats_disabled:
gcc_8_3_1_cuda_11_7_0_ats_disabled:
extends: .job_on_lassen
variables:
SPEC: " ~shared +openmp +cuda %gcc@=8.3.1 cuda_arch=70 ^cuda@11.5.0+allow-unsupported-compilers ${PROJECT_LASSEN_DEPS}"
MODULE_LIST: "cuda/11.5.0"
SPEC: " ~shared +openmp +cuda %gcc@=8.3.1 cuda_arch=70 ^cuda@11.7.0+allow-unsupported-compilers ^blt@develop"
MODULE_LIST: "cuda/11.7.0"
LASSEN_JOB_ALLOC: "1 --atsdisable -W 30 -q pci"

gcc_8_3_1_cuda_11_5_0_ats_disabled_mpi:
gcc_8_3_1_cuda_11_7_0_ats_disabled_mpi:
extends: .job_on_lassen
variables:
SPEC: " ~shared +openmp +cuda +mpi %gcc@=8.3.1 cuda_arch=70 ^cuda@11.5.0+allow-unsupported-compilers ^spectrum-mpi ${PROJECT_LASSEN_DEPS}"
MODULE_LIST: "cuda/11.5.0"
SPEC: " ~shared +openmp +cuda +mpi %gcc@=8.3.1 cuda_arch=70 ^cuda@11.7.0+allow-unsupported-compilers ^spectrum-mpi ^blt@develop"
MODULE_LIST: "cuda/11.7.0"
LASSEN_JOB_ALLOC: "1 --atsdisable -W 30 -q pci"

##########
Expand All @@ -61,7 +52,7 @@ gcc_8_3_1_cuda_11_5_0_ats_disabled_mpi:

clang_13_0_1_libcpp:
variables:
SPEC: " ~shared +openmp %clang@=13.0.1 cflags==\"-DGTEST_HAS_CXXABI_H_=0\" cxxflags==\"-stdlib=libc++ -DGTEST_HAS_CXXABI_H_=0\" ${PROJECT_LASSEN_DEPS}"
SPEC: " ~shared +openmp %clang@=13.0.1 cflags==\"-DGTEST_HAS_CXXABI_H_=0\" cxxflags==\"-stdlib=libc++ -DGTEST_HAS_CXXABI_H_=0\" ^blt@develop"
extends: .job_on_lassen

#clang_14_0_5_asan:
Expand All @@ -71,8 +62,7 @@ clang_13_0_1_libcpp:
# LSAN_OPTIONS: "suppressions=${CI_PROJECT_DIR}/tpl/RAJA/suppressions.asan"
# extends: .job_on_lassen

# Activated in RAJA, but we don't use desul atomics here
#gcc_8_3_1_cuda_10_1_168_desul_atomics:
# variables:
# SPEC: "+openmp +cuda +desul %gcc@=8.3.1 cuda_arch=70 cuda_arch=70 ^cuda@10.1.243+allow-unsupported-compilers ${PROJECT_LASSEN_DEPS}"
# extends: .job_on_lassen
clang_16_0_6_omptarget:
variables:
SPEC: " ~shared +openmp +omptarget %clang@=16.0.6.cuda.11.8.0.gcc.11.2.1 ^blt@develop"
extends: .job_on_lassen
Loading