Skip to content

TwoStream solver crashes with mismatch at restart #195

@maritsandstad

Description

@maritsandstad

Brief summary of bug

When restarting a specific case, the FatesRestartInterfaceMod sets_restart_vectors, and in doing so does update_3dpatch_radiation. In this step it calles the TwoStreamMLPMod Solve method. This method fails on a mismatch in the forward solver. This seems to be pretty widespread (a bunch of threads return the same error for various pfts)

General bug information

Typical stack-trace:

 Unknown error submitted to shr_abort_abort.
1375: Image              PC                Routine            Line        Source             
1375: cesm.exe           00000000037D90E7  Unknown               Unknown  Unknown
1375: cesm.exe           000000000351FA62  shr_abort_mod_mp_         110  shr_abort_mod.F90
1375: cesm.exe           0000000002DFDF09  twostreammlpemod_         258  TwoStreamMLPEMod.F90
1375: cesm.exe           0000000002DFBF2E  twostreammlpemod_        1587  TwoStreamMLPEMod.F90
1375: cesm.exe           0000000002AB7AE6  fatesrestartinter        4213  FatesRestartInterfaceMod.F90
1375: cesm.exe           00000000025C59C1  clmfatesinterface        2120  clmfates_interfaceMod.F90
1375: cesm.exe           000000000259F01A  clm_instmod_mp_cl         620  clm_instMod.F90
1375: cesm.exe           00000000026FBA9D  restfilemod_mp_re         215  restFileMod.F90
1375: cesm.exe           000000000259B1BC  clm_initializemod         546  clm_initializeMod.F90
1375: cesm.exe           0000000002524E5B  lnd_comp_nuopc_mp         677  lnd_comp_nuopc.F90
1375: libesmf.so         0000149517B68F95  _ZN5ESMCI6FTable1     Unknown  Unknown
1375: libesmf.so         0000149517B6D86F  ESMCI_FTableCallE     Unknown  Unknown
1375: libesmf.so         000014951827D1CA  _ZN5ESMCI3VMK5ent     Unknown  Unknown
1375: libesmf.so         000014951829D63F  _ZN5ESMCI2VM5ente     Unknown  Unknown
1375: libesmf.so         0000149517B6AA4A  c_esmc_ftablecall     Unknown  Unknown
1375: libesmf.so         000014951853B5E0  esmf_compmod_mp_e     Unknown  Unknown
1375: libesmf.so         00001495187A9D81  esmf_gridcompmod_     Unknown  Unknown
1375: libesmf.so         0000149518DDEA00  nuopc_driver_mp_l     Unknown  Unknown
1375: libesmf.so         0000149518DFBAAE  nuopc_driver_mp_i     Unknown  Unknown
1375: libesmf.so         0000149517B68F95  _ZN5ESMCI6FTable1     Unknown  Unknown
1375: libesmf.so         0000149517B6D86F  ESMCI_FTableCallE     Unknown  Unknown
1375: libesmf.so         000014951827D1CA  _ZN5ESMCI3VMK5ent     Unknown  Unknown
1375: libesmf.so         000014951829D63F  _ZN5ESMCI2VM5ente     Unknown  Unknown
1375: libesmf.so         0000149517B6AA4A  c_esmc_ftablecall     Unknown  Unknown
1375: libesmf.so         000014951853B5E0  esmf_compmod_mp_e     Unknown  Unknown
1375: libesmf.so         00001495187A9D81  esmf_gridcompmod_     Unknown  Unknown
1375: libesmf.so         0000149518DDEA00  nuopc_driver_mp_l     Unknown  Unknown
1375: libesmf.so         0000149518DFBBC4  nuopc_driver_mp_i     Unknown  Unknown
1375: libesmf.so         0000149518E11401  nuopc_driver_mp_i     Unknown  Unknown
1375: libesmf.so         0000149517B68F95  _ZN5ESMCI6FTable1     Unknown  Unknown
1375: libesmf.so         0000149517B6D86F  ESMCI_FTableCallE     Unknown  Unknown
1375: libesmf.so         000014951827D1CA  _ZN5ESMCI3VMK5ent     Unknown  Unknown
1375: libesmf.so         000014951829D63F  _ZN5ESMCI2VM5ente     Unknown  Unknown
1375: libesmf.so         0000149517B6AA4A  c_esmc_ftablecall     Unknown  Unknown
1375: libesmf.so         000014951853B5E0  esmf_compmod_mp_e     Unknown  Unknown
1375: libesmf.so         00001495187A9D81  esmf_gridcompmod_     Unknown  Unknown
1375: cesm.exe           0000000000439E2C  MAIN__                    128  esmApp.F90
1375: cesm.exe           00000000004286A2  Unknown               Unknown  Unknown
1375: libc.so.6          0000149516A3FEB0  Unknown               Unknown  Unknown
1375: libc.so.6          0000149516A3FF60  __libc_start_main     Unknown  Unknown
1375: cesm.exe           00000000004285A5  Unknown               Unknown  Unknown
1375:  ESMF_Finalize: Error closing trace stream

Typical output from where the error happens:

1375: Poor forward solution on two-stream solver
1375:  isol (1=beam or 2=diff):            1
1375:  i (equation):            6
1375:  band index (1=vis,2=nir):            1
1375:  error (tau(i) - omega(i,:)*lambda(:))   -21.7470430886848     
1375:  Dumping Two-stream elements for band    

Followed by multiple blocks like this:

1375:  --           3           3 --
1375:  pft:           0
1375:  area:   0.106609674796023     
1375:  lai,sai:   0.000000000000000E+000  0.000000000000000E+000
1375:  Kb:   0.500000000000000     
1375:  Kb leaf:   0.500000000000000     
1375:  Kd:   0.500000000000000     
1375:  Rb0:   2.775859938347824E-002
1375:  om:   0.500000000000000     
1375:  betad:   0.500000000000000     
1375:  betab:  0.500000000000000     
1375:  a:   0.353553390593274     
1375:  RDiff Down @ bottom:  -2.775859938347825E-002
1375:  RDiff Up @ bottom:   0.000000000000000E+000
1375:  Rbeam @ bottom:   2.775859938347824E-002
1375:  ENDRUN:
1375:  ERROR in TwoStreamMLPEMod.F90 at line 1587     

and finally:

1375:  ENDRUN:
1375:  ERROR in TwoStreamMLPEMod.F90 at line 1587  

This is all repeated from multiple threads.

CTSM version you are using: [output of git describe]

ctsm5.4.002_noresm_v3
sci.1.88.6_api.42.0.0_nor_sci2_api1

Does this bug cause significantly incorrect results in the model's science? [Yes / No]

Hard to tell, there is a crash

Configurations affected: [Fill this in if known.]

Details of bug

See above

Important details of your setup / configuration so we can reproduce the bug

This is not pervasive, but happens for a particular parameter set and overall setup

Important output or errors that show the problem

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions