-
Notifications
You must be signed in to change notification settings - Fork 170
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Turn C96C48_ufs_hybatmDA and C48mx500_3DVarAOWCDA into a regression test #3120
base: develop
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me once the change has been made to use the same testing option flag as the other PR
@CoryMartin-NOAA Done! |
I misunderstood how default yamls work. I'm merging Cory's branch into mine and redoing the test references |
This PR is a companion to GW PR [#3120](NOAA-EMC/global-workflow#3120). It does a couple things: 1. 5 GW CI tests are added/extended as CTests in GDASApp, running through to the fcst jobs in the first full-cycle. These CI tests are: ```C96C48_hybatmDA```, ```C96C48_ufs_hybatmDA```, ```C96C48_hybatmaerosnowDA```, ```C48mx500_3DVarAOWCDA```, and ```C48mx500_hybAOWCDA```. 2. Test references are added for ```C96C48_ufs_hybatmDA``` and ```C48mx500_3DVarAOWCDA```, so that we're actually testing the output. 3. These CTests are turned on by default in a workflow build, rather than having to mess with the ```CMakeCache.txt``` file and re-running make. This will allow us to use these tests in nightly testing. 4. ```test/gw-ci/CMakeLists.txt``` is refactored quite a bit. 5. There are 89 CTests, but for 5 CI tests, but I added task dependencies, so they can be run in parallel. The primary motivation for this PR is that we can run CI for our nightly testing of GDASApp. Also, anyone with a PR can easily do CI testing through CTests. --------- Co-authored-by: Russ-Treadon-NOAA <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Installed DavidNew-NOAA:feature/gw-ci
at f74c93b on Hera. Run test_gdasapp
ctests. All 133 tests pass. Run with 12 threads. All tests complete in 3479.51 seconds.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No objections
g-w CI tests on Hera The C96C48_ufs_hybatmDA 20240224 00Z gfs_atmanlvar job failed due to the reference check
The gfs and gdas use different data dumps. The gdas analysis assimilates more data than the gfs. We need two reference check files - one for gdas_atmanlvar and another for gfs_atmanlvar. If we don't want to test gfs_atmanlvar, we need to change the yaml used by the gfs_atmanlvar job. |
Thanks, @RussTreadon-NOAA. I'll look into how to fix this |
Installed Run the following g-w CI configurations
with the following results
Three of the four streams successfully ran all jobs. The C96C48_ufs_hybatmDA ( |
This PR addresses the bug @RussTreadon-NOAA found that ```gfs_atmanlvar``` was being run as a regression test and using the same test reference as ```gdas_atmanlvar``` in GW PR [#3120](NOAA-EMC/global-workflow#3120). See NOAA-EMC/global-workflow#3120 (comment) I've moved all activation of testing mode in JCB out of the JCB base YAMLs and into the JCB algorithm YAMLs. I test the ```RUN``` variables to make sure it's not equal to ```gfs```. I re-ran all the regression tests, and they all passed.
GDASApp PR #1390 resolved the bug @RussTreadon-NOAA found and the GDASApp hash has been updated in this PR. This PR is ready for final review. |
Hera C96C48_ufs_hybatmDA testing Update Resume cron running C96C48_ufs_hybatmDA ( The 20240224 06Z gdas_atmanlvar and enkfgdas_atmensanlobs failed the reference check. gdas_atmanalvar.log (look in
enkfgdas_atmensanlobs.log
These failures make sense. The reference files in |
@RussTreadon-NOAA I update the JCB algorithm yamls in https://github.com/NOAA-EMC/GDASApp/tree/bugfix/gw-ci to only use test references at the appropriate cycle time. I update the GDAS hash here to point to that branch. Can you retry the testing? |
@DavidNew-NOAA , my working copy of The DEAD I will let the prjedi_pr3120 run to completion. After this I will make a clean run of C96C48_ufs_hybatmDA from start to finish. |
g-w CI C96C48_ufs_hybatmDA
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Install DavidNew-NOAA:feature/gw-ci
at 4af258c
on Hera. The following g-w CI successfully ran all jobs
/scratch1/NCEPDEV/stmp2/Russ.Treadon/EXPDIR/C48_ATM_pr3120
CYCLE STATE ACTIVATED DEACTIVATED
202103231200 Done Nov 28 2024 13:40:15 Nov 28 2024 15:00:26
202103231800 Done Nov 28 2024 13:40:15 Nov 28 2024 15:10:15
/scratch1/NCEPDEV/stmp2/Russ.Treadon/EXPDIR/C48_S2SW_pr3120
CYCLE STATE ACTIVATED DEACTIVATED
202103231200 Done Nov 28 2024 13:40:22 Nov 28 2024 17:09:49
202103231800 Done Nov 28 2024 13:40:22 Nov 28 2024 17:18:48
/scratch1/NCEPDEV/stmp2/Russ.Treadon/EXPDIR/C48mx500_3DVarAOWCDA_pr3120
CYCLE STATE ACTIVATED DEACTIVATED
202103241200 Done Nov 28 2024 13:40:23 Nov 28 2024 14:00:41
202103241800 Done Nov 28 2024 13:40:23 Nov 28 2024 14:55:49
/scratch1/NCEPDEV/stmp2/Russ.Treadon/EXPDIR/C96C48_hybatmDA_pr3120
CYCLE STATE ACTIVATED DEACTIVATED
202112201800 Done Nov 28 2024 13:40:25 Nov 28 2024 14:00:43
202112210000 Done Nov 28 2024 13:40:25 Nov 28 2024 16:15:30
202112210600 Done Nov 28 2024 13:40:25 Nov 28 2024 16:00:38
/scratch1/NCEPDEV/stmp2/Russ.Treadon/EXPDIR/C96C48_hybatmaerosnowDA_pr3120
CYCLE STATE ACTIVATED DEACTIVATED
202112201200 Done Nov 28 2024 13:40:26 Nov 28 2024 14:05:28
202112201800 Done Nov 28 2024 13:40:26 Nov 28 2024 16:20:28
202112210000 Done Nov 28 2024 13:40:26 Nov 28 2024 16:00:39
/scratch1/NCEPDEV/stmp2/Russ.Treadon/EXPDIR/C96C48_ufs_hybatmDA_pr3120
CYCLE STATE ACTIVATED DEACTIVATED
202402231800 Done Nov 28 2024 13:40:28 Nov 28 2024 14:00:47
202402240000 Done Nov 28 2024 13:40:28 Nov 28 2024 16:50:28
202402240600 Done Nov 28 2024 13:40:28 Nov 28 2024 16:40:28
/scratch1/NCEPDEV/stmp2/Russ.Treadon/EXPDIR/C96_atm3DVar_pr3120
CYCLE STATE ACTIVATED DEACTIVATED
202112201800 Done Nov 28 2024 13:40:31 Nov 28 2024 14:00:51
202112210000 Done Nov 28 2024 13:40:31 Nov 28 2024 16:15:37
202112210600 Done Nov 28 2024 13:40:31 Nov 28 2024 15:50:30
The following gefs g-w CI jobs died
/scratch1/NCEPDEV/stmp2/Russ.Treadon/EXPDIR/C48_S2SWA_gefs_pr3120
202103231200 gefs_fcst_mem000_seg0 3151343 DEAD 153 2 104.0
202103231200 gefs_fcst_mem001_seg0 3151344 DEAD 153 2 116.0
202103231200 gefs_fcst_mem002_seg0 3151345 DEAD 153 2 116.0
and
/scratch1/NCEPDEV/stmp2/Russ.Treadon/EXPDIR/C96_S2SWA_gefs_replay_ics_pr3120
202011010000 gefs_fcst_mem000_seg0 3151354 DEAD 174 2 117.0
202011010000 gefs_fcst_mem001_seg0 3151355 DEAD 174 2 147.0
202011010000 gefs_fcst_mem002_seg0 3151356 DEAD 174 2 132.0
The C48_S2SWA_gefs forecasts died with
24: Wave model ...
0: zeroing coupling accumulated fields at kdt= 1
0: zeroing coupling accumulated fields at kdt= 1
26: forrtl: severe (153): allocatable array or pointer is not allocated
26: Image PC Routine Line Source
26: ufs_model.x 00000000063BE842 Unknown Unknown Unknown
26: ufs_model.x 0000000001E6BBA8 pdlib_w3profsmd_m 7349 w3profsmd_pdlib.F90
26: ufs_model.x 0000000001BCA1D7 w3initmd_mp_w3ini 1244 w3initmd.F90
26: ufs_model.x 0000000001AE2CF7 wav_comp_nuopc_mp 1669 wav_comp_nuopc.F90
26: ufs_model.x 0000000000A9D754 Unknown Unknown Unknown
The C96_S2SWA_gefs_replay_ics forecast died with
132: Wave model ...
136: [h24c41:2969163:0:2969163] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x4)
138: [h24c41:2969165:0:2969165] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x4)
136: forrtl: severe (174): SIGSEGV, segmentation fault occurred
136: Image PC Routine Line Source
136: ufs_model.x 0000000006379DEA Unknown Unknown Unknown
136: libpthread-2.28.s 000014EC77722D10 Unknown Unknown Unknown
136: ufs_model.x 0000000001DF769E pdlib_field_vec_m 501 pdlib_field_vec.F90
136: ufs_model.x 0000000001C80E06 w3iorsmd_mp_w3ior 802 w3iorsmd.F90
136: ufs_model.x 0000000001BC626B w3initmd_mp_w3ini 961 w3initmd.F90
136: ufs_model.x 0000000001AE2CF7 wav_comp_nuopc_mp 1669 wav_comp_nuopc.F90
136: ufs_model.x 0000000000A9D754 Unknown Unknown Unknown
This PR does not alter gefs. Not sure if we expect these jobs to successfully run on Hera.
DA g-w CI passes on Hera. Approve.
Description
This PR is a companion to GDASApp PR #1365 (merged).
It turns
C96C48_ufs_hybatmDA
andC48mx500_3DVarAOWCDA
into a regression test using the JEDI application testing feature. This feature is turned on using the newDO_TEST_MODE
parameter added toconfig.base
in GW PR #3115. This parameter is set to"YES"
in the yaml defaults for the JEDI-based CI tests in GW.The motivation for this PR is a need to catch changes in JEDI which alter the outputs of our applications.
Type of change
Change characteristics
How has this been tested?
C96C48_hybatmDA
,C96C48_ufs_hybatmDA
,C96C48_hybatmaerosnowDA
,C48mx500_3DVarAOWCDA
, andC48mx500_hybAOWCDA
have been tested successfully on HeraChecklist