more testing #535

jbathegit · 2023-11-03T21:53:35Z

Part of #445

Note that the additional debufr test required a change to the source code for the utility, to ensure that allocated memory was freed following the execution of the newly-tested code. Otherwise, the address sanitizer would complain about memory leakage and segfault during the CI tests.

jbathegit · 2023-11-06T14:08:34Z

Now all of a sudden the MacOS runner is failing with a missing python module:

Just in case this was some temporary glitch, I tried rerunning it several times and on different days, but to no avail.

@AlexanderRichert-NOAA @aerorahul @climbfuji @edwardhartnett any suggestions?

jbathegit · 2023-11-06T14:18:12Z

BTW, I realize I could just change the -DENABLE_PYTHON=ON flag to -DENABLE_PYTHON=OFF in the cmake step of the build-bufr job in the MacOS.yml file, but I'm assuming somebody at some point had a good reason for enabling python testing on this platform(?)

Currently, we have python testing enabled in the develop, Linux, and MacOS runners, but it's currently disabled in the Intel runner.

jbathegit · 2023-11-07T20:59:45Z

And now all of a sudden the MacOs build is working again!?

jbathegit · 2023-11-07T21:48:46Z

In this latest commit I added more bort testing for nemtbb() and nemtbd(). But I'm struggling to write test cases for several of the bort options in those routines using dummied-up DX tables containing obvious errors. For example:

The 901 fail in both routines is if an FXY number isn't between 000000 and 063255. But any number outside of that range would already get diagnosed as an error in rdusdx() when the DX table is first read in via a call to openbf() or otherwise. And such a table needs to be previously read in because otherwise there's no data in the internal tabb and tabd arrays for nemtbb() and nemtbd() to work with. So can such an error ever really get triggered in either of those routines (i.e. do we really need this bort check in those routines)?
On a similar note, in subroutine elemdx() (which is called from rdusdx()), a maximum of 3 digits is stored for a scale factor for any Table B value in the tabb array, and a maximum of 10 digits is stored for any reference value. So how exactly could we ever have a 902 or 903 fail in nemtbb() for a stored scale factor not in the range of -999 to 999, or for a stored reference value not in the range of -1E11 to 1E11? In other words, how could either of those bort cases ever realistically get triggered in nemtbb(), given that such outlier values aren't even storable in the internal tabb array?
Furthermore, the 904 and 906 fail cases in nemtbd() couldn't seemingly ever get triggered either, because any such cases would already get diagnosed as an error within seqsdx() (which is also called from rdusdx()) when the DX table is first read in to the internal tabd array. So how exactly would those cases ever realistically occur?

In summary, it seems to me we could just get rid of a number of these bort tests in the nemtbb() and nemtbd() routines, because unless I'm missing something they don't really seem to be serving any practical purpose. @jack-woollen please chime in if you have any thoughts on this - thanks!

jbathegit · 2023-11-08T13:13:16Z

Thanks @jack-woollen, but before I merge this PR (and which in turn will close it :-) do you have any thoughts on my above suggestions to eliminate some of those bort tests in nemtbb() and nemtbd(). Do you agree with my thought process, or can you think of something I'm missing in my analysis?

jack-woollen · 2023-11-08T14:01:43Z

Jeff, a few thoughts. First, it's not impossible for those bort conditions to happen, its just impossible to test them. Because the only way the impossible conditions are possible is all the rules go out the window such as memory clobbering due to bad inputs or bad coding, or some other bad things. I think these types of tests are worthwhile and do not at all subscribe to the idea that they are useless. The only reason I think is important enough to consider maybe getting rid of some of them would be if they cost significant amounts of time. I was in the process of starting to check that out when dogwood switched to production. I'll pick it up again later today. It may be useful to classify the bort statements into those that check on function and those that check on dysfunction, to see the ratio, and calculate the actual costs of each type.

edwardhartnett · 2023-11-09T12:49:52Z

If code cannot actually be reached, then it should be removed. Nor can we try to capture errors caused my random memory overwrites or other totally unpredictable conditions.

What often happens in a testing campaign is that you realize some code can never be reached. So remove that code!

Nothing should stay in the code just because "it does no harm." Every line of code costs money and time to write and more importantly maintain. So if the line of code is not doing anything useful, remove it, and cut code size and maintenance costs.

Consider this: right now @jbathegit has figured out that these lines can't be reached, and raised questions about them, and that's caused us all to take some time to look at the problem. That's not free. This code is already costing NOAA extra time and money. So let's resolve the problem - if the lines are not needed, remove them.

Then, in 5 years, when this issue is long forgotten, NOAA won't incur the same costs by some programmer wondering if those lines of code could ever be reached and raising the whole issue all over again. Which will once again cause expense for NOAA for lines of code that we don't need.

jbathegit · 2023-11-22T15:48:36Z

After giving this some more thought, I'm leaning more towards Ed's way of thinking about this. I hear what you're saying Jack, but I agree with Ed that we shouldn't retain code that we can't test, and that all bets are off anyway if something unpredictable happens.

So, in the near future, I'll push up another commit with those unreachable bort tests removed.

jbathegit · 2023-11-22T19:44:00Z

I've made the aforementioned nemtbb and nemtbd changes to remove those untestable bort cases, and I've also merged in the changes from #539. But now I'm once again seeing CI errors for MacOS (with a different Python build error) and Intel (with cmake now apparently trying to build the oneAPI environment).

@AlexanderRichert-NOAA not sure if this is related in any way to what you're working on in #536, but obviously none of that has been merged into this branch yet, so I'm a bit stumped.

AlexanderRichert-NOAA

I suggest merging #542 first in order to test with Intel CI

jbathegit added 3 commits November 3, 2023 21:48

add test case for debufr utility

c54aa0f

fix verbiage in -v test for debufr and xbfmg utilities

1190302

a few more tweaks to test_debufr.sh

7a205cd

add bort test cases for upb8 and elemdx

f5665db

edwardhartnett previously approved these changes Nov 7, 2023

View reviewed changes

add bort test cases for nemtbb and nemtbd

09c7d86

jbathegit dismissed edwardhartnett’s stale review via 09c7d86 November 7, 2023 20:56

jbathegit requested review from edwardhartnett and jack-woollen November 7, 2023 21:52

jack-woollen previously approved these changes Nov 8, 2023

View reviewed changes

jbathegit mentioned this pull request Nov 22, 2023

Fix Python 3.12 issues (use f2py directly through cmake) #539

Merged

jbathegit added 2 commits November 22, 2023 19:09

Merge branch 'develop' into jba_moretesting

1b71c96

remove untestable bort cases from nemtbb and nemtbd

4d5449c

jbathegit dismissed jack-woollen’s stale review via 4d5449c November 22, 2023 19:28

jbathegit added 2 commits November 27, 2023 17:29

remove one more untestable bort case from nemtbd

8960bf2

clean up unused variable in nemtbd

300af72

jbathegit marked this pull request as ready for review November 30, 2023 16:56

jbathegit requested a review from AlexanderRichert-NOAA November 30, 2023 16:57

jbathegit mentioned this pull request Nov 30, 2023

updating tables directory with version 41 of WMO master tables #541

Merged

AlexanderRichert-NOAA approved these changes Nov 30, 2023

View reviewed changes

Merge branch 'develop' into jba_moretesting

d66b96e

jbathegit merged commit e145fe2 into develop Nov 30, 2023
6 checks passed

jbathegit deleted the jba_moretesting branch November 30, 2023 22:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

more testing #535

more testing #535

jbathegit commented Nov 3, 2023

jbathegit commented Nov 6, 2023

jbathegit commented Nov 6, 2023

jbathegit commented Nov 7, 2023

jbathegit commented Nov 7, 2023

jbathegit commented Nov 8, 2023

jack-woollen commented Nov 8, 2023

edwardhartnett commented Nov 9, 2023

jbathegit commented Nov 22, 2023

jbathegit commented Nov 22, 2023

AlexanderRichert-NOAA left a comment

more testing #535

more testing #535

Conversation

jbathegit commented Nov 3, 2023

jbathegit commented Nov 6, 2023

jbathegit commented Nov 6, 2023

jbathegit commented Nov 7, 2023

jbathegit commented Nov 7, 2023

jbathegit commented Nov 8, 2023

jack-woollen commented Nov 8, 2023

edwardhartnett commented Nov 9, 2023

jbathegit commented Nov 22, 2023

jbathegit commented Nov 22, 2023

AlexanderRichert-NOAA left a comment

Choose a reason for hiding this comment