github actions utest #169

MinsukJi-NOAA · 2020-07-16T19:31:46Z

This PR addresses issue #168

Include continuous integration related files (scripts, github actions yaml file, dockerfile, etc.)
Include updates to the utest script

* Separate builds and tests into different jobs * Separate input data from build image; instead, use it as a volume before running tests * Make ci subdirectory to cotain ci-related files

.github/workflows/main.yml

.dockerignore

MinsukJi-NOAA · 2020-09-01T01:40:27Z

RT on Orion keeps failing, due to compile_13 failing with this error message:

/work/noaa/stmp/jminsuk/stmp/jminsuk/FV3_RT/rt_420215/compile_13/build_fv3_13/FV3/ccpp/physics/ccpp_static_api.F90(966): catastrophic error: **Internal compiler error: internal abort** Please report this error along with the circumstances in which it occurred in a Software Problem Report. Note: File and line given may not be explicit cause of this error. compilation aborted for /work/noaa/stmp/jminsuk/stmp/jminsuk/FV3_RT/rt_420215/compile_13/build_fv3_13/FV3/ccpp/physics/ccpp_static_api.F90 (code 1)`

climbfuji · 2020-09-01T01:57:43Z

RT on Orion keeps failing, due to compile_13 failing with this error message:

/work/noaa/stmp/jminsuk/stmp/jminsuk/FV3_RT/rt_420215/compile_13/build_fv3_13/FV3/ccpp/physics/ccpp_static_api.F90(966): catastrophic error: **Internal compiler error: internal abort** Please report this error along with the circumstances in which it occurred in a Software Problem Report. Note: File and line given may not be explicit cause of this error. compilation aborted for /work/noaa/stmp/jminsuk/stmp/jminsuk/FV3_RT/rt_420215/compile_13/build_fv3_13/FV3/ccpp/physics/ccpp_static_api.F90 (code 1)`

Hmm. I have seen these internal compiler errors for other projects in the past, it often had to do with a too complicated optimization task that the compiler is trying to accomplish. We need to figure out which COMPILE command build_13 corresponds to, and what line the compiler is complaining about. Which version of the Intel compiler are you using?

MinsukJi-NOAA · 2020-09-01T02:04:56Z

Hmm. I have seen these internal compiler errors for other projects in the past, it often had to do with a too complicated optimization task that the compiler is trying to accomplish. We need to figure out which COMPILE command build_13 corresponds to, and what line the compiler is complaining about. Which version of the Intel compiler are you using?

modulefiles/orion.intel/fv3 module loads intel/2018

compile 13 appears to be CCPP=Y and DEBUG=Y

climbfuji · 2020-09-01T02:26:26Z

Hmm. I have seen these internal compiler errors for other projects in the past, it often had to do with a too complicated optimization task that the compiler is trying to accomplish. We need to figure out which COMPILE command build_13 corresponds to, and what line the compiler is complaining about. Which version of the Intel compiler are you using?

modulefiles/orion.intel/fv3 module loads intel/2018

compile 13 appears to be CCPP=Y and DEBUG=Y

Interesting. I recently made this change (and it worked on all machines, including orion) so that we have at least one compile command that tests compiling the code without providing any suites (i.e. compile all available suites). What you can do is to duplicate this line N times (where N is the number of machines, i.e. one line for hera.intel, one for cheyenne.intel, ...) and modify the line for orion to include just the suites that are required to run the following tests.

MinsukJi-NOAA · 2020-09-01T11:49:20Z

Interesting. I recently made this change (and it worked on all machines, including orion) so that we have at least one compile command that tests compiling the code without providing any suites (i.e. compile all available suites). What you can do is to duplicate this line N times (where N is the number of machines, i.e. one line for hera.intel, one for cheyenne.intel, ...) and modify the line for orion to include just the suites that are required to run the following tests.

That change allowed the RT to pass. Thanks.

MinsukJi-NOAA · 2020-09-03T12:37:40Z

Preliminary CI documentation can be found here: CI tests for UFS-weather-model

MinsukJi-NOAA · 2020-09-03T12:42:53Z

For an example of a CI test, see Add hera RT results

For an example of a skipped CI test, see Add orion RT results

.github/workflows/manage.yml

modulefiles/linux.gnu/fv3

.github/workflows/main.yml

tests/ci/json_helper.py

tests/run_test.sh

junwang-noaa · 2020-09-03T16:06:15Z

@DusanJovic-NOAA I remember Dom said the github action running time for public repo is unlimited, also because of the characteristics of weather model forecast tests, we have to run jobs longer than a few minutes, so what are the issues?

…

On Thu, Sep 3, 2020 at 10:59 AM Dusan Jovic ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In tests/ci/json_helper.py <#169 (comment)> : > @@ -0,0 +1,46 @@ +#!/usr/bin/env python3 Exactly. That's why I think these kind of tests (tests that run longer than few minutes) are simply not suitable for these "continuous integration" testing environments. — You are receiving this because your review was requested. Reply to this email directly, view it on GitHub <#169 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AI7D6TML6CPXCJLC6HASFQTSD6VMFANCNFSM4O4Z2ASA> .

climbfuji · 2020-09-03T16:09:38Z

@DusanJovic-NOAA I remember Dom said the github action running time for public repo is unlimited, also because of the characteristics of weather model forecast tests, we have to run jobs longer than a few minutes, so what are the issues?
…

Have a look at https://github.com/pricing, scroll down to "Compare features" please.

MinsukJi-NOAA · 2020-09-04T00:15:12Z

Updated utest documentation can be found here: Unit Test for UFS-weather-model

MinsukJi-NOAA · 2020-09-08T22:17:43Z

All regression tests passed on Hera, Orion, and Dell. All unit tests except restart test passed on Hera, Orion, and Dell. This PR is ready for merge.

climbfuji

Looks ok to me and my limited understanding of the CI design.

tests/utest

github actions utest (ufs-community#169)

MinsukJi-NOAA added 10 commits July 10, 2020 12:42

add ci-related files. update utest

b476468

Merge remote-tracking branch 'upstream/develop' into feature/ContInteg

489bfc0

modify to auto-run different cases based on ci.test

dc13d67

update to the latest develop and make changes accordingly

2e3410e

Improve CI workflow

d13dd8c

* Separate builds and tests into different jobs * Separate input data from build image; instead, use it as a volume before running tests * Make ci subdirectory to cotain ci-related files

add clean-up after utest run

cb8cefa

fix docker container and image delete

27941d7

Merge remote-tracking branch 'upstream/develop' into feature/ContInteg

4d60345

use ci-test-base v3. modify utest to comply with recent weather changes

83a51f2

add workflow manage files

cbe85e8

MinsukJi-NOAA marked this pull request as ready for review August 31, 2020 21:30

DusanJovic-NOAA reviewed Aug 31, 2020

View reviewed changes

.github/workflows/main.yml Show resolved Hide resolved

DusanJovic-NOAA reviewed Aug 31, 2020

View reviewed changes

.github/workflows/main.yml Outdated Show resolved Hide resolved

DusanJovic-NOAA requested review from climbfuji, aerorahul and junwang-noaa August 31, 2020 22:03

DusanJovic-NOAA reviewed Aug 31, 2020

View reviewed changes

.dockerignore Show resolved Hide resolved

MinsukJi-NOAA and others added 2 commits August 31, 2020 22:25

Add hera RT results

499abe1

Add dell RT results. skip-ci

2cf9f90

Add orion RT results. skip-ci

f4c97c5

Merge remote-tracking branch 'upstream/develop' into feature/ContInteg

d5d7984

DusanJovic-NOAA reviewed Sep 3, 2020

View reviewed changes

.github/workflows/manage.yml Show resolved Hide resolved

climbfuji reviewed Sep 3, 2020

View reviewed changes

modulefiles/linux.gnu/fv3 Show resolved Hide resolved

.github/workflows/main.yml Show resolved Hide resolved

tests/ci/json_helper.py Show resolved Hide resolved

tests/run_test.sh Show resolved Hide resolved

MinsukJi-NOAA added 2 commits September 3, 2020 12:17

move parsing in main.yml to a separate script file

ab73f9f

revert utest mpi test back to original setting. modify usage

6b3b4b7

MinsukJi-NOAA and others added 6 commits September 4, 2020 14:14

In utest change Hera queue, and Orion baseline location. skip-ci

fb444d8

Temporarily turn off restart test in actions

11fb6b5

Attach logs for Hera Regression tests and Unit tests. skip-ci

c700269

Attach logs for Dell Regression tests and Unit tests. skip-ci

c0e281b

Attach logs for Orion Regression tests and Unit tests. skip-ci

a4ab091

Change actions workflow branch name to develop

7caa213

climbfuji approved these changes Sep 8, 2020

View reviewed changes

junwang-noaa reviewed Sep 9, 2020

View reviewed changes

tests/utest Show resolved Hide resolved

junwang-noaa approved these changes Sep 9, 2020

View reviewed changes

DusanJovic-NOAA approved these changes Sep 9, 2020

View reviewed changes

DusanJovic-NOAA merged commit 407df4e into ufs-community:develop Sep 9, 2020

DavidHuber-NOAA added a commit to DavidHuber-NOAA/ufs-weather-model that referenced this pull request Sep 10, 2020

Merge pull request #1 from ufs-community/develop

e3a3489

github actions utest (ufs-community#169)

MinsukJi-NOAA deleted the feature/ContInteg branch October 19, 2020 21:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

github actions utest #169

github actions utest #169

MinsukJi-NOAA commented Jul 16, 2020

MinsukJi-NOAA commented Sep 1, 2020

climbfuji commented Sep 1, 2020

MinsukJi-NOAA commented Sep 1, 2020

climbfuji commented Sep 1, 2020

MinsukJi-NOAA commented Sep 1, 2020

MinsukJi-NOAA commented Sep 3, 2020

MinsukJi-NOAA commented Sep 3, 2020

junwang-noaa commented Sep 3, 2020 via email

climbfuji commented Sep 3, 2020

MinsukJi-NOAA commented Sep 4, 2020

MinsukJi-NOAA commented Sep 8, 2020

climbfuji left a comment

github actions utest #169

github actions utest #169

Conversation

MinsukJi-NOAA commented Jul 16, 2020

MinsukJi-NOAA commented Sep 1, 2020

climbfuji commented Sep 1, 2020

MinsukJi-NOAA commented Sep 1, 2020

climbfuji commented Sep 1, 2020

MinsukJi-NOAA commented Sep 1, 2020

MinsukJi-NOAA commented Sep 3, 2020

MinsukJi-NOAA commented Sep 3, 2020

junwang-noaa commented Sep 3, 2020 via email

climbfuji commented Sep 3, 2020

MinsukJi-NOAA commented Sep 4, 2020

MinsukJi-NOAA commented Sep 8, 2020

climbfuji left a comment

Choose a reason for hiding this comment