Skip to content

Conversation

@natalie-perlin
Copy link
Collaborator

Description

Use the following modules to build spack-stack-1.9.2 environment on Derecho:

intel/2024.2.1
ncarenv/24.12
craype/2.7.31
libfabric/1.15.2.0

A modulefile cray-mpich/8.1.29 for the spack-stack to be updated to specify additional PE/PALS libraries for submitting parallel jobs

Dependencies

If there are PRs that need to be merged before or along with this one, please add "- [ ] waiting on LINK_TO_PR" for each of them.

Issues addressed

Addresses some issues encountered with Derecho configuration for the older environment, ncarenv/23.09.

ufs-community/ufs-weather-model#2033
ufs-community/ufs-weather-model#2177

A PR opened to the weather model (still in draft form); the issues appear to be addressed after using Derecho spack-stack 1.9.x with the new environment, ncarenv/24.12:

ufs-community/ufs-weather-model#2863

Applications affected

List all known applications (UFS WM, JEDI, SRW, etc.) intentionally or unintentionally affected by this PR.

UFS SRW
UFS Weather-Model

(possibly other applications on Derecho)

Systems affected

Derecho

Testing

At the moment, the spack-stack-1.9.x environment has been build in user space and successfully tested for most of the UFS WM Regression tests.

Logs from the WM cpld_control_ciceC_p8_intel test could be found here: /glade/derecho/scratch/nperlin/FV3_RT/rt_70201/
And attached few logs from compile and run times :

compile_out.txt
compile_err.txt
compile_job_card.txt
run_out.txt
run_err.txt
run_job_card.txt

  • CI: Note whether the automatic tests (GitHub actions tests that run automatically for every commit) pass or not
    • GitHub actions CI tests pass
    • GitHub actions CI tests do not pass (provide explanation)
    • GitHub actions CI tests skipped (provide explanation if necessary)
  • New tests added: List and describe any new tests added to GitHub actions
    • ...
  • Additional testing: Add information on any additional tests conducted
    • ...

Checklist

  • This PR addresses one issue/problem/enhancement or has a very good reason for not doing so.
  • These changes have been tested on the affected systems and applications.
  • All dependency PRs/issues have been resolved and this PR can be merged.
  • All necessary updates to the documentation on readthedocs are included in this PR
    • For site config updates, check in particular doc/source/PreConfiguredSites.rst and doc/source/MaintainersSection.rst
  • All necessary updates to the spack-stack wiki will be made when this PR is merged

@climbfuji
Copy link
Collaborator

@natalie-perlin Please let us know when you tested this with the recent met/metplus updates. Thanks!

@climbfuji
Copy link
Collaborator

@natalie-perlin Is this ready to merge?

Copy link
Collaborator

@rickgrubin-noaa rickgrubin-noaa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

All tests pass, thanks.

@climbfuji climbfuji merged commit ab41aec into JCSDA:release/1.9.0 Aug 28, 2025
8 checks passed
@climbfuji
Copy link
Collaborator

@natalie-perlin Please make sure that any changes to the Derecho site config that are not yet in develop will find their way there. Thanks!

@natalie-perlin
Copy link
Collaborator Author

@climbfuji -
I still have some troubles with generating the cray-mpich modulefiles correctly. May need to reopen the PR

@natalie-perlin
Copy link
Collaborator Author

All tests pass, thanks.

Rick, how did you test it?..

@rickgrubin-noaa
Copy link
Collaborator

All tests pass, thanks.

Rick, how did you test it?..

PR-specific tests that are run when you create a PR / make changes / etc., as seen on the PR itself.

It was my assumption that the PR was ready because comments were resolved.

@climbfuji
Copy link
Collaborator

@climbfuji - I still have some troubles with generating the cray-mpich modulefiles correctly. May need to reopen the PR

Please submit a follow-up bug fix PR. Apologies for merging prematurely. I asked for confirmation in various places and got no reply, but saw the commit that applied the same libfabric changes that worked for Intel to the GNU config.

@natalie-perlin
Copy link
Collaborator Author

@climbfuji - Thanks, sure, will do!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

No open projects

Development

Successfully merging this pull request may close these issues.

3 participants