nose testing: you cannot select individual tests (rather than scripts) #160

danlipsa · 2017-03-29T15:33:53Z

Many tests share the same python script. Certain scripts contain tens of tests. In our current implementation of the nose testing framework you can only select to run individual scripts rather than individual tests.

For instance, in the ctest framework we used to be able to do 'ctest -R streamlines'.
Or, I could do ctest -R 'opacity|transparency' to quickly see if the transparency tests pass.

You cannot do that in the new framework as far as I can see.

Another problem is that a testing script stops at the first failure - this prevents upgrading baselines using scripts in case of many failures.

doutriaux1 · 2017-03-29T15:40:08Z

@danlipsa yes you're right.
1- I don't think nose let you select individual tests within a class
2- Some of our tests run a loop (basic_gms) which anyway make it look like one test to unitttest, this is due to limitation currently on the html/image-compare post processing script. It is possible (although borderline hacky) to generate multiple tests within a loop for udunits.

I'm not sure it's worth the work though. In theory the test suite is ran automatically and if one of these break it will be easy enough to run the loop only on the failing one while fixing it. Beside these loops are usually closely related and fixing one will likely fix most.

danlipsa · 2017-03-29T20:19:33Z

@doutriaux1 As suggested by @aashish24 , you can define several tests inside a script using

def test...

Then you can run only these tests individually. Unfortunatelly you still have to pass the script name:
nosetests -s tests/test_vcs_vectors_robinson.py -m streamline
It would be cool figure out how you can run this kind of search without passing the script name.
See
#159

doutriaux1 · 2017-03-29T23:19:00Z

@danlipsa I can add the search bit it's easy enough. I'll let you and @aashish24 split the tests ;)

danlipsa · 2017-04-04T17:45:09Z

@doutriaux1 @aashish24 Another related drawback of the new testing framework: I had to replace several baselines so I had execute the following operations multiple time: run the script, replace a baseline, run the script, replace the next baseline, ...

In the past, a test run generated all failures and then I could run a for to replace all baselines. After the release, we should make our testing framework run as in the past.

doutriaux1 · 2017-04-04T19:51:22Z

@danlipsa all failures should be left in the directory test_pngs but maybe we should add an option to automatically update baselines.

danlipsa · 2017-04-04T20:05:19Z

@doutriaux1 The problem is that the script stops at the first failure, isn't it? So you cannot generate all baselines and then copy them over.

doutriaux1 · 2017-04-04T20:53:36Z

@danlipsa I see your point, yres right now you have to update the baselines one at a time as you fix the trest

danlipsa · 2017-04-06T15:14:58Z

@doutriaux1 @aashish24 We can use a generator to create several tests in a for loop (inside the same script)
http://stackoverflow.com/questions/13611658/repeated-single-or-multiple-tests-with-nose

@doutriaux1 Do we need to worry about anything else? We might have one of our colleague work on this.

doutriaux1 · 2017-04-06T18:00:13Z

@danlipsa yes I know about this, I even implemented some of it at first. The only issue is really the html generation and parsing all the mangled error messages to make sure many image comparisons are generated, I think we should break it down by one page per script that then splits it into N pages for each script failed, with each landing page being the current image_compare. If someone at Kitware wants to take the lead on this that's great but not urgent, I would rather that we spend time fixing the seg fault on travis and circleci.

danlipsa · 2017-04-06T22:05:38Z

@doutriaux1 Is this because we expect only one image failure per script? The output does not change as far as I can see, but I think it won't stop at the first failure.

danlipsa · 2017-04-07T15:59:09Z

@doutriaux1 @aashish24 @sankhesh nosetests seems to test the vcs in the source directory rather than the vcs installed in the coda env. Should we worry about that? (I tested this by adding an print in the sources and run the test without installing the sources into conda)

doutriaux1 · 2017-04-10T14:50:32Z

@danlipsa it's due to setuptools. On the cdms2 version of run_test I have an option to run out of source once i unified all the tests under one core runtests.py we can use this option for vcs as well.

doutriaux1 · 2017-04-10T14:52:02Z

@danlipsa the post processing step that parses the output of each .py file is only setup to look at the last "diff" so we would need to improve it to look for many of them. Also I'm not 100% sure that each individual test within the .py wouldn't have their output mangled together.

danlipsa · 2017-05-01T14:48:41Z

@dorukozturk As we discussed, in this branch I created several tests per script for two scripts.
https://github.com/UV-CDAT/vcs/tree/many-tests-per-script
With this branch we should be able to

run individual tests rather than whole scripts for both the static and dynamically generated tests.
be able to run tests using patter matching like in ctest.

danlipsa · 2017-05-01T15:02:32Z

@doutriaux1 If we write the script to submit tests results to cdash do we still need to make the html generation work for running individual tests rather than running whole scripts and stopping at the first error?

doutriaux1 · 2017-05-01T17:26:54Z

@danlipsa i guess if we can tell cdash to dump the output to a local file so we casn run this offline.

danlipsa · 2017-05-02T20:03:49Z

@doutriaux1 Why would you want to run this off-line if it is available online and it is also stored for later?

doutriaux1 · 2017-05-02T21:35:39Z

plane, beach, etc...

doutriaux1 · 2017-05-02T21:36:04Z

or if the lab suddenly decides to block the server where the data are uploaded

durack1 · 2017-05-02T21:42:48Z

@doutriaux1 FYI last week raw.githubusercontent.com was blocked, it was unblocked this morning, but caused some grief for @dnadeau4 and myself on Friday and the weekend

danlipsa · 2017-05-03T14:08:54Z

@doutriaux1 I never used the slider feature to be honest. I always did:

for i in `ls *.png | grep -v diff`; do eog $i ../uvcdat-testdata/baselines/vcs/$i; done

You can switch between baseline and new image using arrows, Alt-F4 moves to the next set of differences. You replace eog with cp to copy them to the baselines.

doutriaux1 · 2017-05-03T15:18:54Z

tha ks @danlipsa that's useful. I'll ad it to my bashrc. The slider is actually very useful too. It's nice to have both.

doutriaux1 added the enhancement label Mar 29, 2017

danlipsa mentioned this issue May 2, 2017

Baselines for ENH: Add vector legend. CDAT/uvcdat-testdata#169

Merged

doutriaux1 modified the milestone: 3.0 May 5, 2017

doutriaux1 modified the milestones: 3.0, post 3.0 Mar 29, 2018

doutriaux1 modified the milestones: 8.1, 8.2 Mar 27, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nose testing: you cannot select individual tests (rather than scripts) #160

nose testing: you cannot select individual tests (rather than scripts) #160

danlipsa commented Mar 29, 2017 •

edited

Loading

doutriaux1 commented Mar 29, 2017

danlipsa commented Mar 29, 2017

doutriaux1 commented Mar 29, 2017

danlipsa commented Apr 4, 2017

doutriaux1 commented Apr 4, 2017

danlipsa commented Apr 4, 2017

doutriaux1 commented Apr 4, 2017

danlipsa commented Apr 6, 2017

doutriaux1 commented Apr 6, 2017

danlipsa commented Apr 6, 2017

danlipsa commented Apr 7, 2017

doutriaux1 commented Apr 10, 2017

doutriaux1 commented Apr 10, 2017

danlipsa commented May 1, 2017

danlipsa commented May 1, 2017 •

edited

Loading

doutriaux1 commented May 1, 2017

danlipsa commented May 2, 2017

doutriaux1 commented May 2, 2017

doutriaux1 commented May 2, 2017

durack1 commented May 2, 2017

danlipsa commented May 3, 2017

doutriaux1 commented May 3, 2017

nose testing: you cannot select individual tests (rather than scripts) #160

nose testing: you cannot select individual tests (rather than scripts) #160

Comments

danlipsa commented Mar 29, 2017 • edited Loading

doutriaux1 commented Mar 29, 2017

danlipsa commented Mar 29, 2017

doutriaux1 commented Mar 29, 2017

danlipsa commented Apr 4, 2017

doutriaux1 commented Apr 4, 2017

danlipsa commented Apr 4, 2017

doutriaux1 commented Apr 4, 2017

danlipsa commented Apr 6, 2017

doutriaux1 commented Apr 6, 2017

danlipsa commented Apr 6, 2017

danlipsa commented Apr 7, 2017

doutriaux1 commented Apr 10, 2017

doutriaux1 commented Apr 10, 2017

danlipsa commented May 1, 2017

danlipsa commented May 1, 2017 • edited Loading

doutriaux1 commented May 1, 2017

danlipsa commented May 2, 2017

doutriaux1 commented May 2, 2017

doutriaux1 commented May 2, 2017

durack1 commented May 2, 2017

danlipsa commented May 3, 2017

doutriaux1 commented May 3, 2017

danlipsa commented Mar 29, 2017 •

edited

Loading

danlipsa commented May 1, 2017 •

edited

Loading