Skip to content

Conversation

bartgol
Copy link
Contributor

@bartgol bartgol commented Sep 30, 2024

These were likely residues from when FieldLayout did not store device-friendly extents.

I had this branch sitting on my workstation, but forgot to open the PR. These small device allocs were found during the NERSC hackathon a few weeks back.

fixes #2958

These were likely residues from when FieldLayout did not store
device-friendly extents
tcclevenger
tcclevenger previously approved these changes Sep 30, 2024
@E3SM-Bot
Copy link
Collaborator

Status Flag 'Pull Request AutoTester' - Testing Jenkins Projects:

Pull Request Auto Testing STARTING (click to expand)

Build Information

Test Name: SCREAM_PullRequest_Autotester_Weaver

  • Build Num: 6087
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
PR_LABELS code quality;BFB;AT: AUTOMERGE;code cleanup
PULLREQUESTNUM 3021
SCREAM_SOURCE_REPO https://github.com/E3SM-Project/scream
SCREAM_SOURCE_SHA fa54928
SCREAM_TARGET_BRANCH master
SCREAM_TARGET_REPO https://github.com/E3SM-Project/scream
SCREAM_TARGET_SHA 5ad00c2
TEST_REPO_ALIAS SCREAM

Build Information

Test Name: SCREAM_PullRequest_Autotester_Mappy

  • Build Num: 5858
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
PR_LABELS code quality;BFB;AT: AUTOMERGE;code cleanup
PULLREQUESTNUM 3021
SCREAM_SOURCE_REPO https://github.com/E3SM-Project/scream
SCREAM_SOURCE_SHA fa54928
SCREAM_TARGET_BRANCH master
SCREAM_TARGET_REPO https://github.com/E3SM-Project/scream
SCREAM_TARGET_SHA 5ad00c2
TEST_REPO_ALIAS SCREAM

Using Repos:

Repo: SCREAM (E3SM-Project/scream)
  • Branch: bartgol/eamxx/remove-runtime-allocs
  • SHA: fa54928
  • Mode: TEST_REPO

Pull Request Author: bartgol

@E3SM-Bot
Copy link
Collaborator

Status Flag 'Pull Request AutoTester' - Jenkins Testing: 1 or more Jobs FAILED

Note: Testing will normally be attempted again in approx. 2 Hrs. If a change to the PR source branch occurs, the testing will be attempted again on next available autotester run.

Pull Request Auto Testing has FAILED (click to expand)

Build Information

Test Name: SCREAM_PullRequest_Autotester_Weaver

  • Build Num: 6087
  • Status: FAILED

Jenkins Parameters

Parameter Name Value
PR_LABELS code quality;BFB;AT: AUTOMERGE;code cleanup
PULLREQUESTNUM 3021
SCREAM_SOURCE_REPO https://github.com/E3SM-Project/scream
SCREAM_SOURCE_SHA fa54928
SCREAM_TARGET_BRANCH master
SCREAM_TARGET_REPO https://github.com/E3SM-Project/scream
SCREAM_TARGET_SHA 5ad00c2
TEST_REPO_ALIAS SCREAM

Build Information

Test Name: SCREAM_PullRequest_Autotester_Mappy

  • Build Num: 5858
  • Status: FAILED

Jenkins Parameters

Parameter Name Value
PR_LABELS code quality;BFB;AT: AUTOMERGE;code cleanup
PULLREQUESTNUM 3021
SCREAM_SOURCE_REPO https://github.com/E3SM-Project/scream
SCREAM_SOURCE_SHA fa54928
SCREAM_TARGET_BRANCH master
SCREAM_TARGET_REPO https://github.com/E3SM-Project/scream
SCREAM_TARGET_SHA 5ad00c2
TEST_REPO_ALIAS SCREAM
SCREAM_PullRequest_Autotester_Weaver # 6087 FAILED (click to see last 100 lines of console output)

209 | integer :: nsteps, n0_qdp, np1_qdp, ie, i, j, geometry_type, nerr
| 1
Warning: Unused variable 'j' declared at (1) [-Wunused-variable]
/home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6087/scream/components/homme/src/share/compose_test_mod.F90:184:8:

184 | use dimensions_mod, only: ne, np, nlev, qsize, qsize_d, nelemd
| 1
Warning: Unused module variable 'ne' which has been explicitly imported at (1) [-Wunused-variable]
/home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6087/scream/components/homme/src/share/compose_test_mod.F90:182:8:

182 | use thread_mod, only: hthreads, vthreads, omp_set_num_threads, omp_get_thread_num
| 1
Warning: Unused module variable 'vthreads' which has been explicitly imported at (1) [-Wunused-variable]
/home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6087/scream/components/homme/src/share/compose_test_mod.F90:86:8:

86 | use thread_mod, only: hthreads, vthreads, omp_set_num_threads, omp_get_thread_num
| 1
Warning: Unused module variable 'vthreads' which has been explicitly imported at (1) [-Wunused-variable]
[ 52%] Linking Fortran static library libtheta-l_kokkos_4_128_10.a
[ 52%] Building Fortran object src/dynamics/homme/tests/CMakeFiles/theta-l_kokkos_4_72_10.dir/home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6087/scream/components/homme/src/share/compose_test_mod.F90.o
[ 52%] Built target theta-l_kokkos_4_128_10
/home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6087/scream/components/homme/src/share/compose_test_mod.F90:182:8:

182 | use thread_mod, only: hthreads, vthreads, omp_set_num_threads, omp_get_thread_num
| 1
Warning: Unused module variable 'hthreads' which has been explicitly imported at (1) [-Wunused-variable]
/home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6087/scream/components/homme/src/share/compose_test_mod.F90:209:48:

209 | integer :: nsteps, n0_qdp, np1_qdp, ie, i, j, geometry_type, nerr
| 1
Warning: Unused variable 'j' declared at (1) [-Wunused-variable]
/home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6087/scream/components/homme/src/share/compose_test_mod.F90:184:8:

184 | use dimensions_mod, only: ne, np, nlev, qsize, qsize_d, nelemd
| 1
Warning: Unused module variable 'ne' which has been explicitly imported at (1) [-Wunused-variable]
/home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6087/scream/components/homme/src/share/compose_test_mod.F90:182:8:

182 | use thread_mod, only: hthreads, vthreads, omp_set_num_threads, omp_get_thread_num
| 1
Warning: Unused module variable 'vthreads' which has been explicitly imported at (1) [-Wunused-variable]
/home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6087/scream/components/homme/src/share/compose_test_mod.F90:86:8:

86 | use thread_mod, only: hthreads, vthreads, omp_set_num_threads, omp_get_thread_num
| 1
Warning: Unused module variable 'vthreads' which has been explicitly imported at (1) [-Wunused-variable]
[ 52%] Linking Fortran static library libtheta-l_kokkos_4_72_10.a
[ 52%] Building Fortran object tests/multi-process/dynamics_physics/mam/homme_shoc_cld_p3_mam_optics_rrtmgp/CMakeFiles/theta-l_kokkos_4_72_41.dir/home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6087/scream/components/homme/src/share/compose_test_mod.F90.o
/home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6087/scream/components/homme/src/share/compose_test_mod.F90:182:8:

182 | use thread_mod, only: hthreads, vthreads, omp_set_num_threads, omp_get_thread_num
| 1
Warning: Unused module variable 'hthreads' which has been explicitly imported at (1) [-Wunused-variable]
/home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6087/scream/components/homme/src/share/compose_test_mod.F90:209:48:

209 | integer :: nsteps, n0_qdp, np1_qdp, ie, i, j, geometry_type, nerr
| 1
Warning: Unused variable 'j' declared at (1) [-Wunused-variable]
/home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6087/scream/components/homme/src/share/compose_test_mod.F90:184:8:

184 | use dimensions_mod, only: ne, np, nlev, qsize, qsize_d, nelemd
| 1
Warning: Unused module variable 'ne' which has been explicitly imported at (1) [-Wunused-variable]
/home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6087/scream/components/homme/src/share/compose_test_mod.F90:182:8:

182 | use thread_mod, only: hthreads, vthreads, omp_set_num_threads, omp_get_thread_num
| 1
Warning: Unused module variable 'vthreads' which has been explicitly imported at (1) [-Wunused-variable]
/home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6087/scream/components/homme/src/share/compose_test_mod.F90:86:8:

86 | use thread_mod, only: hthreads, vthreads, omp_set_num_threads, omp_get_thread_num
| 1
Warning: Unused module variable 'vthreads' which has been explicitly imported at (1) [-Wunused-variable]
[ 52%] Linking Fortran static library libtheta-l_kokkos_4_72_41.a
[ 52%] Built target theta-l_kokkos_4_72_10
[ 52%] Built target theta-l_kokkos_4_72_41
gmake: *** [Makefile:166: all] Error 2

Error(s) occurred during test phase
OVERALL STATUS: FAIL
Starting analysis on weaver with cmd: cd /home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6087/scream/components/eamxx && source /etc/profile.d/modules.sh && module purge && module load cmake/3.25.1 git/2.39.1 python/3.10.8 py-netcdf4/1.5.8 gcc/11.3.0 cuda/11.8.0 openmpi netcdf-c netcdf-fortran parallel-netcdf netlib-lapack && export HDF5_USE_FILE_LOCKING=FALSE && true && bsub -I -q rhel8 -n 4 -gpu num=4 ./scripts/test-all-scream --baseline-dir AUTO $compiler -p -c EKAT_DISABLE_TPL_WARNINGS=ON -m weaver
RUN: cd /home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6087/scream/components/eamxx && source /etc/profile.d/modules.sh && module purge && module load cmake/3.25.1 git/2.39.1 python/3.10.8 py-netcdf4/1.5.8 gcc/11.3.0 cuda/11.8.0 openmpi netcdf-c netcdf-fortran parallel-netcdf netlib-lapack && export HDF5_USE_FILE_LOCKING=FALSE && true && bsub -I -q rhel8 -n 4 -gpu num=4 ./scripts/test-all-scream --baseline-dir AUTO $compiler -p -c EKAT_DISABLE_TPL_WARNINGS=ON -m weaver
FROM: /home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6087/scream/components/eamxx
weaver failed
######################################################
Build step 'Execute shell' marked build as failure
Performing Post build task...
Match found for : : True
Logical operation result is TRUE
Running script : #!/bin/bash -le

cd $WORKSPACE/${BUILD_ID}/

./scream/components/eamxx/scripts/jenkins/jenkins_cleanup.sh
[SCREAM_PullRequest_Autotester_Weaver] $ /bin/bash -le /tmp/jenkins4180747085295428996.sh
POST BUILD TASK : SUCCESS
END OF POST BUILD TASK : 0
Sending e-mails to: lbertag@sandia.gov
Finished: FAILURE

SCREAM_PullRequest_Autotester_Mappy # 5858 FAILED (click to see last 100 lines of console output)

[ 48%] Linking Fortran static library libtheta-l_kokkos_4_72_10.a
[ 48%] Linking Fortran static library libtheta-l_kokkos_4_128_10.a
/home/e3sm-jenkins/jenkins-ws/workspace/SCREAM_PullRequest_Autotester_Mappy/5858/scream/components/homme/src/share/compose_test_mod.F90:182:8:

182 | use thread_mod, only: hthreads, vthreads, omp_set_num_threads, omp_get_thread_num
| 1
Warning: Unused module variable 'hthreads' which has been explicitly imported at (1) [-Wunused-variable]
/home/e3sm-jenkins/jenkins-ws/workspace/SCREAM_PullRequest_Autotester_Mappy/5858/scream/components/homme/src/share/compose_test_mod.F90:209:48:

209 | integer :: nsteps, n0_qdp, np1_qdp, ie, i, j, geometry_type, nerr
| 1
Warning: Unused variable 'j' declared at (1) [-Wunused-variable]
/home/e3sm-jenkins/jenkins-ws/workspace/SCREAM_PullRequest_Autotester_Mappy/5858/scream/components/homme/src/share/compose_test_mod.F90:184:8:

184 | use dimensions_mod, only: ne, np, nlev, qsize, qsize_d, nelemd
| 1
Warning: Unused module variable 'ne' which has been explicitly imported at (1) [-Wunused-variable]
/home/e3sm-jenkins/jenkins-ws/workspace/SCREAM_PullRequest_Autotester_Mappy/5858/scream/components/homme/src/share/compose_test_mod.F90:182:8:

182 | use thread_mod, only: hthreads, vthreads, omp_set_num_threads, omp_get_thread_num
| 1
Warning: Unused module variable 'vthreads' which has been explicitly imported at (1) [-Wunused-variable]
/home/e3sm-jenkins/jenkins-ws/workspace/SCREAM_PullRequest_Autotester_Mappy/5858/scream/components/homme/src/share/compose_test_mod.F90:86:8:

86 | use thread_mod, only: hthreads, vthreads, omp_set_num_threads, omp_get_thread_num
| 1
Warning: Unused module variable 'vthreads' which has been explicitly imported at (1) [-Wunused-variable]
[ 48%] Built target theta-l_kokkos_4_72_10
[ 48%] Built target theta-l_kokkos_4_128_10
[ 48%] Linking Fortran static library libtheta-l_kokkos_4_72_41.a
[ 48%] Built target theta-l_kokkos_4_72_41
gmake: *** [Makefile:166: all] Error 2

Error(s) occurred during test phase
OVERALL STATUS: FAIL
Starting analysis on mappy with cmd: cd /home/e3sm-jenkins/jenkins-ws/workspace/SCREAM_PullRequest_Autotester_Mappy/5858/scream/components/eamxx && source /projects/sems/modulefiles/utils/sems-modules-init.sh && module purge && module load sems-cmake/3.27.9 sems-git/2.42.0 sems-gcc/11.4.0 sems-openmpi-no-cuda/4.1.6 sems-netcdf-c/4.9.2 sems-netcdf-cxx/4.2 sems-netcdf-fortran/4.6.1 sems-parallel-netcdf/1.12.3 sems-openblas && export GATOR_INITIAL_MB=4000MB && export OMP_PROC_BIND=spread && true && ./scripts/test-all-scream --baseline-dir AUTO $compiler -p -c EKAT_DISABLE_TPL_WARNINGS=ON -m mappy
RUN: cd /home/e3sm-jenkins/jenkins-ws/workspace/SCREAM_PullRequest_Autotester_Mappy/5858/scream/components/eamxx && source /projects/sems/modulefiles/utils/sems-modules-init.sh && module purge && module load sems-cmake/3.27.9 sems-git/2.42.0 sems-gcc/11.4.0 sems-openmpi-no-cuda/4.1.6 sems-netcdf-c/4.9.2 sems-netcdf-cxx/4.2 sems-netcdf-fortran/4.6.1 sems-parallel-netcdf/1.12.3 sems-openblas && export GATOR_INITIAL_MB=4000MB && export OMP_PROC_BIND=spread && true && ./scripts/test-all-scream --baseline-dir AUTO $compiler -p -c EKAT_DISABLE_TPL_WARNINGS=ON -m mappy
FROM: /home/e3sm-jenkins/jenkins-ws/workspace/SCREAM_PullRequest_Autotester_Mappy/5858/scream/components/eamxx
mappy failed
SCREAM V1 TESTING FAILED!
Waiting for tests to finish
FAIL ERP_D_Lh4.ne4_ne4.F2010-SCREAMv1.mappy_gnu.scream-output-preset-1 (phase MODEL_BUILD)
Case dir: /home/e3sm-jenkins/acme/scratch/ERP_D_Lh4.ne4_ne4.F2010-SCREAMv1.mappy_gnu.scream-output-preset-1.C.20240930_124815_td30m6
FAIL ERP_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-output-preset-4 (phase MODEL_BUILD)
Case dir: /home/e3sm-jenkins/acme/scratch/ERP_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-output-preset-4.C.20240930_124815_td30m6
FAIL ERS_D_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-rad_frequency_2--scream-output-preset-5 (phase MODEL_BUILD)
Case dir: /home/e3sm-jenkins/acme/scratch/ERS_D_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-rad_frequency_2--scream-output-preset-5.C.20240930_124815_td30m6
FAIL ERS_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-small_kernels--scream-output-preset-5 (phase MODEL_BUILD)
Case dir: /home/e3sm-jenkins/acme/scratch/ERS_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-small_kernels--scream-output-preset-5.C.20240930_124815_td30m6
FAIL ERS_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-small_kernels_p3--scream-output-preset-5 (phase MODEL_BUILD)
Case dir: /home/e3sm-jenkins/acme/scratch/ERS_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-small_kernels_p3--scream-output-preset-5.C.20240930_124815_td30m6
FAIL ERS_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-small_kernels_shoc--scream-output-preset-5 (phase MODEL_BUILD)
Case dir: /home/e3sm-jenkins/acme/scratch/ERS_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-small_kernels_shoc--scream-output-preset-5.C.20240930_124815_td30m6
FAIL ERS_Ln9.ne4_ne4.F2000-SCREAMv1-AQP1.mappy_gnu.scream-output-preset-2 (phase MODEL_BUILD)
Case dir: /home/e3sm-jenkins/acme/scratch/ERS_Ln9.ne4_ne4.F2000-SCREAMv1-AQP1.mappy_gnu.scream-output-preset-2.C.20240930_124815_td30m6
FAIL ERS_P16_Ln22.ne30pg2_ne30pg2.FIOP-SCREAMv1-DP.mappy_gnu.scream-dpxx-arm97 (phase MODEL_BUILD)
Case dir: /home/e3sm-jenkins/acme/scratch/ERS_P16_Ln22.ne30pg2_ne30pg2.FIOP-SCREAMv1-DP.mappy_gnu.scream-dpxx-arm97.C.20240930_124815_td30m6
FAIL ERS_P16_Ln22.ne30pg2_ne30pg2.FIOP-SCREAMv1-DP.mappy_gnu.scream-dpxx-comble (phase MODEL_BUILD)
Case dir: /home/e3sm-jenkins/acme/scratch/ERS_P16_Ln22.ne30pg2_ne30pg2.FIOP-SCREAMv1-DP.mappy_gnu.scream-dpxx-comble.C.20240930_124815_td30m6
FAIL ERS_P16_Ln22.ne30pg2_ne30pg2.FIOP-SCREAMv1-DP.mappy_gnu.scream-dpxx-dycomsrf01 (phase MODEL_BUILD)
Case dir: /home/e3sm-jenkins/acme/scratch/ERS_P16_Ln22.ne30pg2_ne30pg2.FIOP-SCREAMv1-DP.mappy_gnu.scream-dpxx-dycomsrf01.C.20240930_124815_td30m6
FAIL ERS_P16_Ln22.ne30pg2_ne30pg2.FRCE-SCREAMv1-DP.mappy_gnu (phase MODEL_BUILD)
Case dir: /home/e3sm-jenkins/acme/scratch/ERS_P16_Ln22.ne30pg2_ne30pg2.FRCE-SCREAMv1-DP.mappy_gnu.C.20240930_124815_td30m6
FAIL PET_Ln9_P32x2.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-output-preset-1 (phase MODEL_BUILD)
Case dir: /home/e3sm-jenkins/acme/scratch/PET_Ln9_P32x2.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-output-preset-1.C.20240930_124815_td30m6
FAIL SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-aci (phase MODEL_BUILD)
Case dir: /home/e3sm-jenkins/acme/scratch/SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-aci.C.20240930_124815_td30m6
FAIL SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-drydep (phase MODEL_BUILD)
Case dir: /home/e3sm-jenkins/acme/scratch/SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-drydep.C.20240930_124815_td30m6
FAIL SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-optics (phase MODEL_BUILD)
Case dir: /home/e3sm-jenkins/acme/scratch/SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-optics.C.20240930_124815_td30m6
FAIL SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-wetscav (phase MODEL_BUILD)
Case dir: /home/e3sm-jenkins/acme/scratch/SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-wetscav.C.20240930_124815_td30m6
FAIL SMS_D_Ln9.ne4_ne4.F2010-SCREAMv1-noAero.mappy_gnu.scream-output-preset-3 (phase MODEL_BUILD)
Case dir: /home/e3sm-jenkins/acme/scratch/SMS_D_Ln9.ne4_ne4.F2010-SCREAMv1-noAero.mappy_gnu.scream-output-preset-3.C.20240930_124815_td30m6
test-scheduler took 509.3573594093323 seconds
######################################################
Build step 'Execute shell' marked build as failure
$ ssh-agent -k
unset SSH_AUTH_SOCK;
unset SSH_AGENT_PID;
echo Agent pid 4091072 killed;
[ssh-agent] Stopped.
Performing Post build task...
Match found for : : True
Logical operation result is TRUE
Running script : #!/bin/bash -le

cd $WORKSPACE/${BUILD_ID}/

./scream/components/eamxx/scripts/jenkins/jenkins_cleanup.sh

We're having issues with some test-launcher job hanging forever. So let's make sure we clean all penting test-launcher jobs

squeue -o"%.7i %u %40j" | grep e3sm-jenkins | grep test-launcher | awk '{ print $1 }' | xargs -r scancel

[SCREAM_PullRequest_Autotester_Mappy] $ /bin/bash -le /tmp/jenkins2161400527173379345.sh
POST BUILD TASK : SUCCESS
END OF POST BUILD TASK : 0
Sending e-mails to: lbertag@sandia.gov
Finished: FAILURE

@mahf708 mahf708 linked an issue Sep 30, 2024 that may be closed by this pull request
tcclevenger
tcclevenger previously approved these changes Oct 1, 2024
@E3SM-Bot
Copy link
Collaborator

E3SM-Bot commented Oct 1, 2024

Status Flag 'Pull Request AutoTester' - Testing Jenkins Projects:

Pull Request Auto Testing STARTING (click to expand)

Build Information

Test Name: SCREAM_PullRequest_Autotester_Weaver

  • Build Num: 6090
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
PR_LABELS code quality;BFB;AT: AUTOMERGE;code cleanup
PULLREQUESTNUM 3021
SCREAM_SOURCE_REPO https://github.com/E3SM-Project/scream
SCREAM_SOURCE_SHA a01889e
SCREAM_TARGET_BRANCH master
SCREAM_TARGET_REPO https://github.com/E3SM-Project/scream
SCREAM_TARGET_SHA 5ad00c2
TEST_REPO_ALIAS SCREAM

Build Information

Test Name: SCREAM_PullRequest_Autotester_Mappy

  • Build Num: 5860
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
PR_LABELS code quality;BFB;AT: AUTOMERGE;code cleanup
PULLREQUESTNUM 3021
SCREAM_SOURCE_REPO https://github.com/E3SM-Project/scream
SCREAM_SOURCE_SHA a01889e
SCREAM_TARGET_BRANCH master
SCREAM_TARGET_REPO https://github.com/E3SM-Project/scream
SCREAM_TARGET_SHA 5ad00c2
TEST_REPO_ALIAS SCREAM

Using Repos:

Repo: SCREAM (E3SM-Project/scream)
  • Branch: bartgol/eamxx/remove-runtime-allocs
  • SHA: a01889e
  • Mode: TEST_REPO

Pull Request Author: bartgol

@E3SM-Bot
Copy link
Collaborator

E3SM-Bot commented Oct 1, 2024

Status Flag 'Pull Request AutoTester' - Jenkins Testing: 1 or more Jobs FAILED

Note: Testing will normally be attempted again in approx. 2 Hrs. If a change to the PR source branch occurs, the testing will be attempted again on next available autotester run.

Pull Request Auto Testing has FAILED (click to expand)

Build Information

Test Name: SCREAM_PullRequest_Autotester_Weaver

  • Build Num: 6090
  • Status: FAILED

Jenkins Parameters

Parameter Name Value
PR_LABELS code quality;BFB;AT: AUTOMERGE;code cleanup
PULLREQUESTNUM 3021
SCREAM_SOURCE_REPO https://github.com/E3SM-Project/scream
SCREAM_SOURCE_SHA a01889e
SCREAM_TARGET_BRANCH master
SCREAM_TARGET_REPO https://github.com/E3SM-Project/scream
SCREAM_TARGET_SHA 5ad00c2
TEST_REPO_ALIAS SCREAM

Build Information

Test Name: SCREAM_PullRequest_Autotester_Mappy

  • Build Num: 5860
  • Status: FAILED

Jenkins Parameters

Parameter Name Value
PR_LABELS code quality;BFB;AT: AUTOMERGE;code cleanup
PULLREQUESTNUM 3021
SCREAM_SOURCE_REPO https://github.com/E3SM-Project/scream
SCREAM_SOURCE_SHA a01889e
SCREAM_TARGET_BRANCH master
SCREAM_TARGET_REPO https://github.com/E3SM-Project/scream
SCREAM_TARGET_SHA 5ad00c2
TEST_REPO_ALIAS SCREAM
SCREAM_PullRequest_Autotester_Weaver # 6090 FAILED (click to see last 100 lines of console output)

70:surface_upward_latent_heat_flux
72:aodvis
75:atm_backtend
80:homme_standalone_np1
81:homme_standalone_baseline_cmp
82:p3_standalone_np1
83:p3_tend_check_np1
84:p3_standalone_baseline_cmp
85:shoc_standalone_np1
86:shoc_tend_check_np1
87:check_U_V_slices_np1
88:check_surf_mom_flux_slices_np1
89:shoc_standalone_baseline_cmp
90:check_U_V_slices_fail_diff
91:check_U_V_slices_fail_layout
92:check_U_V_slices_fail_missing
94:spa_standalone_np1
95:spa_standalone_baseline_cmp
96:surface_coupling_np1
97:rrtmgp_standalone_unit
98:rrtmgp_standalone_not_chunked_np1
99:rrtmgp_standalone_chunked
100:rrtmgp_standalone_chunked_vs_not_chunked
101:rrtmgp_standalone_chunked_baseline_cmp
102:cosp_standalone_np1
103:cosp_standalone_baseline_cmp
104:mam4_optics_standalone_np1
105:mam4_optics_standalone_baseline_cmp
106:mam4_aci_standalone_np1
107:mam4_aci_standalone_baseline_cmp
108:mam4_drydep_standalone_np1
109:mam4_drydep_standalone_baseline_cmp
110:mam4_wetscav_standalone_np1
111:mam4_wetscav_standalone_baseline_cmp
112:mam4_srf_online_emiss_standalone_np1
113:mam4_srf_online_emiss_standalone_baseline_cmp
114:mam4_constituent_fluxes_standalone_np1
115:mam4_constituent_fluxes_standalone_baseline_cmp
116:shoc_cld_p3_rrtmgp_np1
117:shoc_cld_spa_p3_rrtmgp_np1
118:mam_optics_rrtmgp_np1
119:shoc_mam4_aci_np1
120:shoc_mam4_drydep_np1
121:shoc_cldfrac_mam4_aci_p3_np1
122:shoc_cldfrac_mam4_aci_p3_rrtmgp_np1
123:shoc_cldfrac_mam4_aci_p3_mam4_optics_rrtmgp_np1
124:p3_mam4_wetscav_np1
125:shoc_cldfrac_p3_wetscav_np1
126:mam4_srf_online_emiss_mam4_constituent_fluxes_np1
127:shoc_p3_subcycled
128:shoc_p3_monolithic
129:check_subcycling
130:check_subcycling_tend_check
132:shoc_p3_source
133:shoc_p3_nudged
134:shoc_p3_nudged_remapped
135:shoc_p3_nudging_glob_novert
136:homme_shoc_cld_p3_rrtmgp_np1
137:homme_shoc_cld_p3_rrtmgp_baseline_cmp
138:homme_shoc_cld_p3_rrtmgp_pg2_np1
139:homme_shoc_cld_p3_rrtmgp_pg2_baseline_cmp
140:model_baseline
141:model_initial
142:model_restart
143:restarted_vs_monolithic_check_np1
144:homme_shoc_cld_spa_p3_rrtmgp_np1
145:homme_shoc_cld_spa_p3_rrtmgp_baseline_cmp
146:homme_shoc_cld_spa_p3_rrtmgp_128levels_np1
147:homme_shoc_cld_spa_p3_rrtmgp_128levels_tend_check_np1
148:homme_shoc_cld_spa_p3_rrtmgp_128levels_baseline_cmp
149:homme_shoc_cld_spa_p3_rrtmgp_pg2_dp_np1
150:homme_shoc_cld_spa_p3_rrtmgp_pg2_dp_baseline_cmp
151:homme_shoc_cld_p3_mam_optics_rrtmgp_np1
152:homme_shoc_cld_p3_mam_optics_rrtmgp_baseline_cmp
153:homme_shoc_cld_mam_aci_p3_mam_optics_rrtmgp_mam_drydep_np1
154:homme_shoc_cld_mam_aci_p3_mam_optics_rrtmgp_mam_drydep_baseline_cmp
155:homme_shoc_cld_spa_p3_rrtmgp_mam4_wetscav_np1
156:homme_shoc_cld_spa_p3_rrtmgp_mam4_wetscav_baseline_cmp

Error(s) occurred during test phase
OVERALL STATUS: FAIL
Starting analysis on weaver with cmd: cd /home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6090/scream/components/eamxx && source /etc/profile.d/modules.sh && module purge && module load cmake/3.25.1 git/2.39.1 python/3.10.8 py-netcdf4/1.5.8 gcc/11.3.0 cuda/11.8.0 openmpi netcdf-c netcdf-fortran parallel-netcdf netlib-lapack && export HDF5_USE_FILE_LOCKING=FALSE && true && bsub -I -q rhel8 -n 4 -gpu num=4 ./scripts/test-all-scream --baseline-dir AUTO $compiler -p -c EKAT_DISABLE_TPL_WARNINGS=ON -m weaver
RUN: cd /home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6090/scream/components/eamxx && source /etc/profile.d/modules.sh && module purge && module load cmake/3.25.1 git/2.39.1 python/3.10.8 py-netcdf4/1.5.8 gcc/11.3.0 cuda/11.8.0 openmpi netcdf-c netcdf-fortran parallel-netcdf netlib-lapack && export HDF5_USE_FILE_LOCKING=FALSE && true && bsub -I -q rhel8 -n 4 -gpu num=4 ./scripts/test-all-scream --baseline-dir AUTO $compiler -p -c EKAT_DISABLE_TPL_WARNINGS=ON -m weaver
FROM: /home/e3sm-jenkins/weaver/workspace/SCREAM_PullRequest_Autotester_Weaver/6090/scream/components/eamxx
weaver failed
######################################################
Build step 'Execute shell' marked build as failure
Performing Post build task...
Match found for : : True
Logical operation result is TRUE
Running script : #!/bin/bash -le

cd $WORKSPACE/${BUILD_ID}/

./scream/components/eamxx/scripts/jenkins/jenkins_cleanup.sh
[SCREAM_PullRequest_Autotester_Weaver] $ /bin/bash -le /tmp/jenkins7823848783670956950.sh
POST BUILD TASK : SUCCESS
END OF POST BUILD TASK : 0
Sending e-mails to: lbertag@sandia.gov
Finished: FAILURE

SCREAM_PullRequest_Autotester_Mappy # 5860 FAILED (click to see last 100 lines of console output)

+ V1_FAILURES_DETAILS+='Waiting for tests to finish
PASS ERP_D_Lh4.ne4_ne4.F2010-SCREAMv1.mappy_gnu.scream-output-preset-1 RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/ERP_D_Lh4.ne4_ne4.F2010-SCREAMv1.mappy_gnu.scream-output-preset-1.C.20241001_095205_1cvv3v
PASS ERP_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-output-preset-4 RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/ERP_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-output-preset-4.C.20241001_095205_1cvv3v
PASS ERS_D_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-rad_frequency_2--scream-output-preset-5 RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/ERS_D_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-rad_frequency_2--scream-output-preset-5.C.20241001_095205_1cvv3v
PASS ERS_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-small_kernels--scream-output-preset-5 RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/ERS_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-small_kernels--scream-output-preset-5.C.20241001_095205_1cvv3v
PASS ERS_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-small_kernels_p3--scream-output-preset-5 RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/ERS_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-small_kernels_p3--scream-output-preset-5.C.20241001_095205_1cvv3v
PASS ERS_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-small_kernels_shoc--scream-output-preset-5 RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/ERS_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-small_kernels_shoc--scream-output-preset-5.C.20241001_095205_1cvv3v
PASS ERS_Ln9.ne4_ne4.F2000-SCREAMv1-AQP1.mappy_gnu.scream-output-preset-2 RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/ERS_Ln9.ne4_ne4.F2000-SCREAMv1-AQP1.mappy_gnu.scream-output-preset-2.C.20241001_095205_1cvv3v
DIFF ERS_P16_Ln22.ne30pg2_ne30pg2.FIOP-SCREAMv1-DP.mappy_gnu.scream-dpxx-arm97 (phase BASELINE)
    Case dir: /home/e3sm-jenkins/acme/scratch/ERS_P16_Ln22.ne30pg2_ne30pg2.FIOP-SCREAMv1-DP.mappy_gnu.scream-dpxx-arm97.C.20241001_095205_1cvv3v
DIFF ERS_P16_Ln22.ne30pg2_ne30pg2.FIOP-SCREAMv1-DP.mappy_gnu.scream-dpxx-comble (phase BASELINE)
    Case dir: /home/e3sm-jenkins/acme/scratch/ERS_P16_Ln22.ne30pg2_ne30pg2.FIOP-SCREAMv1-DP.mappy_gnu.scream-dpxx-comble.C.20241001_095205_1cvv3v
DIFF ERS_P16_Ln22.ne30pg2_ne30pg2.FIOP-SCREAMv1-DP.mappy_gnu.scream-dpxx-dycomsrf01 (phase BASELINE)
    Case dir: /home/e3sm-jenkins/acme/scratch/ERS_P16_Ln22.ne30pg2_ne30pg2.FIOP-SCREAMv1-DP.mappy_gnu.scream-dpxx-dycomsrf01.C.20241001_095205_1cvv3v
DIFF ERS_P16_Ln22.ne30pg2_ne30pg2.FRCE-SCREAMv1-DP.mappy_gnu (phase BASELINE)
    Case dir: /home/e3sm-jenkins/acme/scratch/ERS_P16_Ln22.ne30pg2_ne30pg2.FRCE-SCREAMv1-DP.mappy_gnu.C.20241001_095205_1cvv3v
PASS PET_Ln9_P32x2.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-output-preset-1 RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/PET_Ln9_P32x2.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-output-preset-1.C.20241001_095205_1cvv3v
PASS SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-aci RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-aci.C.20241001_095205_1cvv3v
PASS SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-drydep RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-drydep.C.20241001_095205_1cvv3v
PASS SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-optics RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-optics.C.20241001_095205_1cvv3v
PASS SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-wetscav RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-wetscav.C.20241001_095205_1cvv3v
PASS SMS_D_Ln9.ne4_ne4.F2010-SCREAMv1-noAero.mappy_gnu.scream-output-preset-3 RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/SMS_D_Ln9.ne4_ne4.F2010-SCREAMv1-noAero.mappy_gnu.scream-output-preset-3.C.20241001_095205_1cvv3v
test-scheduler took 1810.660885334015 seconds'
+ set +x
######################################################
FAILS DETECTED:
  SCREAM V1 TESTING FAILED!
Waiting for tests to finish
PASS ERP_D_Lh4.ne4_ne4.F2010-SCREAMv1.mappy_gnu.scream-output-preset-1 RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/ERP_D_Lh4.ne4_ne4.F2010-SCREAMv1.mappy_gnu.scream-output-preset-1.C.20241001_095205_1cvv3v
PASS ERP_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-output-preset-4 RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/ERP_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-output-preset-4.C.20241001_095205_1cvv3v
PASS ERS_D_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-rad_frequency_2--scream-output-preset-5 RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/ERS_D_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-rad_frequency_2--scream-output-preset-5.C.20241001_095205_1cvv3v
PASS ERS_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-small_kernels--scream-output-preset-5 RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/ERS_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-small_kernels--scream-output-preset-5.C.20241001_095205_1cvv3v
PASS ERS_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-small_kernels_p3--scream-output-preset-5 RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/ERS_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-small_kernels_p3--scream-output-preset-5.C.20241001_095205_1cvv3v
PASS ERS_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-small_kernels_shoc--scream-output-preset-5 RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/ERS_Ln22.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-small_kernels_shoc--scream-output-preset-5.C.20241001_095205_1cvv3v
PASS ERS_Ln9.ne4_ne4.F2000-SCREAMv1-AQP1.mappy_gnu.scream-output-preset-2 RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/ERS_Ln9.ne4_ne4.F2000-SCREAMv1-AQP1.mappy_gnu.scream-output-preset-2.C.20241001_095205_1cvv3v
DIFF ERS_P16_Ln22.ne30pg2_ne30pg2.FIOP-SCREAMv1-DP.mappy_gnu.scream-dpxx-arm97 (phase BASELINE)
    Case dir: /home/e3sm-jenkins/acme/scratch/ERS_P16_Ln22.ne30pg2_ne30pg2.FIOP-SCREAMv1-DP.mappy_gnu.scream-dpxx-arm97.C.20241001_095205_1cvv3v
DIFF ERS_P16_Ln22.ne30pg2_ne30pg2.FIOP-SCREAMv1-DP.mappy_gnu.scream-dpxx-comble (phase BASELINE)
    Case dir: /home/e3sm-jenkins/acme/scratch/ERS_P16_Ln22.ne30pg2_ne30pg2.FIOP-SCREAMv1-DP.mappy_gnu.scream-dpxx-comble.C.20241001_095205_1cvv3v
DIFF ERS_P16_Ln22.ne30pg2_ne30pg2.FIOP-SCREAMv1-DP.mappy_gnu.scream-dpxx-dycomsrf01 (phase BASELINE)
    Case dir: /home/e3sm-jenkins/acme/scratch/ERS_P16_Ln22.ne30pg2_ne30pg2.FIOP-SCREAMv1-DP.mappy_gnu.scream-dpxx-dycomsrf01.C.20241001_095205_1cvv3v
DIFF ERS_P16_Ln22.ne30pg2_ne30pg2.FRCE-SCREAMv1-DP.mappy_gnu (phase BASELINE)
    Case dir: /home/e3sm-jenkins/acme/scratch/ERS_P16_Ln22.ne30pg2_ne30pg2.FRCE-SCREAMv1-DP.mappy_gnu.C.20241001_095205_1cvv3v
PASS PET_Ln9_P32x2.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-output-preset-1 RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/PET_Ln9_P32x2.ne4pg2_ne4pg2.F2010-SCREAMv1.mappy_gnu.scream-output-preset-1.C.20241001_095205_1cvv3v
PASS SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-aci RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-aci.C.20241001_095205_1cvv3v
PASS SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-drydep RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-drydep.C.20241001_095205_1cvv3v
PASS SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-optics RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-optics.C.20241001_095205_1cvv3v
PASS SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-wetscav RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/SMS_D_Ln5.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.mappy_gnu.scream-mam4xx-wetscav.C.20241001_095205_1cvv3v
PASS SMS_D_Ln9.ne4_ne4.F2010-SCREAMv1-noAero.mappy_gnu.scream-output-preset-3 RUN
    Case dir: /home/e3sm-jenkins/acme/scratch/SMS_D_Ln9.ne4_ne4.F2010-SCREAMv1-noAero.mappy_gnu.scream-output-preset-3.C.20241001_095205_1cvv3v
test-scheduler took 1810.660885334015 seconds
######################################################
Build step 'Execute shell' marked build as failure
$ ssh-agent -k
unset SSH_AUTH_SOCK;
unset SSH_AGENT_PID;
echo Agent pid 690093 killed;
[ssh-agent] Stopped.
Performing Post build task...
Match found for : : True
Logical operation result is TRUE
Running script  : #!/bin/bash -le

cd $WORKSPACE/${BUILD_ID}/

./scream/components/eamxx/scripts/jenkins/jenkins_cleanup.sh

We're having issues with some test-launcher job hanging forever. So let's make sure we clean all penting test-launcher jobs

squeue -o"%.7i %u %40j" | grep e3sm-jenkins | grep test-launcher | awk '{ print $1 }' | xargs -r scancel

[SCREAM_PullRequest_Autotester_Mappy] $ /bin/bash -le /tmp/jenkins5110261980098509821.sh
POST BUILD TASK : SUCCESS
END OF POST BUILD TASK : 0
Sending e-mails to: lbertag@sandia.gov
Finished: FAILURE

@E3SM-Bot
Copy link
Collaborator

E3SM-Bot commented Oct 3, 2024

Status Flag 'Pull Request AutoTester' - Testing Jenkins Projects:

Pull Request Auto Testing STARTING (click to expand)

Build Information

Test Name: SCREAM_PullRequest_Autotester_Weaver

  • Build Num: 6099
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
PR_LABELS code quality;BFB;AT: AUTOMERGE;code cleanup
PULLREQUESTNUM 3021
SCREAM_SOURCE_REPO https://github.com/E3SM-Project/scream
SCREAM_SOURCE_SHA 756ea4a
SCREAM_TARGET_BRANCH master
SCREAM_TARGET_REPO https://github.com/E3SM-Project/scream
SCREAM_TARGET_SHA 5ad00c2
TEST_REPO_ALIAS SCREAM

Build Information

Test Name: SCREAM_PullRequest_Autotester_Mappy

  • Build Num: 5869
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
PR_LABELS code quality;BFB;AT: AUTOMERGE;code cleanup
PULLREQUESTNUM 3021
SCREAM_SOURCE_REPO https://github.com/E3SM-Project/scream
SCREAM_SOURCE_SHA 756ea4a
SCREAM_TARGET_BRANCH master
SCREAM_TARGET_REPO https://github.com/E3SM-Project/scream
SCREAM_TARGET_SHA 5ad00c2
TEST_REPO_ALIAS SCREAM

Using Repos:

Repo: SCREAM (E3SM-Project/scream)
  • Branch: bartgol/eamxx/remove-runtime-allocs
  • SHA: 756ea4a
  • Mode: TEST_REPO

Pull Request Author: bartgol

@E3SM-Bot
Copy link
Collaborator

E3SM-Bot commented Oct 3, 2024

Status Flag 'Pull Request AutoTester' - Jenkins Testing: all Jobs PASSED

Pull Request Auto Testing has PASSED (click to expand)

Build Information

Test Name: SCREAM_PullRequest_Autotester_Weaver

  • Build Num: 6099
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
PR_LABELS code quality;BFB;AT: AUTOMERGE;code cleanup
PULLREQUESTNUM 3021
SCREAM_SOURCE_REPO https://github.com/E3SM-Project/scream
SCREAM_SOURCE_SHA 756ea4a
SCREAM_TARGET_BRANCH master
SCREAM_TARGET_REPO https://github.com/E3SM-Project/scream
SCREAM_TARGET_SHA 5ad00c2
TEST_REPO_ALIAS SCREAM

Build Information

Test Name: SCREAM_PullRequest_Autotester_Mappy

  • Build Num: 5869
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
PR_LABELS code quality;BFB;AT: AUTOMERGE;code cleanup
PULLREQUESTNUM 3021
SCREAM_SOURCE_REPO https://github.com/E3SM-Project/scream
SCREAM_SOURCE_SHA 756ea4a
SCREAM_TARGET_BRANCH master
SCREAM_TARGET_REPO https://github.com/E3SM-Project/scream
SCREAM_TARGET_SHA 5ad00c2
TEST_REPO_ALIAS SCREAM

@E3SM-Bot
Copy link
Collaborator

E3SM-Bot commented Oct 3, 2024

Status Flag 'Pre-Merge Inspection' - - This Pull Request Requires Inspection... The code must be inspected by a member of the Team before Testing/Merging
THE LAST COMMIT TO THIS PULL REQUEST HAS NOT BEEN REVIEWED YET!

@E3SM-Bot
Copy link
Collaborator

E3SM-Bot commented Oct 3, 2024

All Jobs Finished; status = PASSED, target_sha=a269ef91a3da65ee4e23f4e33736684366046b4d, However Inspection must be performed before merge can occur...

@mahf708 mahf708 requested a review from ndkeen October 3, 2024 13:49
@mahf708
Copy link
Contributor

mahf708 commented Oct 3, 2024

@ndkeen, recall these are the (small) mallocs in the IO, should we perf-prof? (removing automerge label until we check the perf)

@bartgol
Copy link
Contributor Author

bartgol commented Oct 3, 2024

Up to you if you want to check performance, but given how busy we are, and given that if you look at the code changes you can be fairly sure that there is no perf regression. I wouldn't hold off the merge until perf is checked, especially since we're not talking about 2x speedup/slowdown here. We can't start (manually) perf-checking every single PR, imho.

My 2 cents. If you guys want to run perf checks, feel free to do so.

@mahf708
Copy link
Contributor

mahf708 commented Oct 3, 2024

I think we are only interested in checking things we came across during the hackathon, that's all... I agree this is not worth checking, but I will defer to @ndkeen to decide :)

The side goal is to document perf improvements related to NESAP/Hackathon/etc. for better documentation

@E3SM-Bot
Copy link
Collaborator

E3SM-Bot commented Oct 3, 2024

Status Flag 'Pull Request AutoTester' - AutoMerge IS ENABLED, but the Label AT: AUTOMERGE is not set. Either set Label AT: AUTOMERGE or manually merge the PR...

@mahf708 mahf708 merged commit 2763997 into master Oct 4, 2024
5 checks passed
@mahf708 mahf708 deleted the bartgol/eamxx/remove-runtime-allocs branch October 4, 2024 04:19
@ndkeen
Copy link
Contributor

ndkeen commented Oct 4, 2024

Might be good to note that we found cuda malloc/frees during file output section -- specifically vertical interpolation.
After this change, re-profiling a ne256 case with output, I don't see any malloc/frees in veritical interp.

@ndkeen
Copy link
Contributor

ndkeen commented Oct 4, 2024

For the timer a:EAMxx::IO::vert_remap, the time reduced from 1.7 seconds to 0.3 seconds in a ne256 case on 128 nodes for 5 days with Cess output. It clearly did something, just was already small portion

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Kokkos View allocations in vertical remap

5 participants