-
Notifications
You must be signed in to change notification settings - Fork 578
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New test 'TrilinosInstallTests_find_package_Trilinos' failing in ATS-2 PR builds with failure to find ShyLU_NodeConfig.cmake #10865
Comments
CC: @srajama1, @trilinos/shylu NOTE: This issue is undoubtedly related to the fact that the
This problem is discussed in detail in #9497. This might be a good time to design and implement the meta-package solution where when someone configures with:
this will behave the way that someone would expect and where there is not |
However, in the short term, it might just be better to comment out the Line 128 in e061ffc
I would rather finish the refactoring needed to complete TriBITSPub/TriBITS#63 before implementing support for meta-packages. The only negative downside of commenting out the definition of the
would need to enable:
And that would behave exactly the way that one would expect, such as was the case with #9497. |
However, before I change anything, I want to try to reproduce this error and see what is happening. I also want to understand why this error only occurred in the ATS-2 build on 'vortex' PR-10775-test-ats2_cuda-10.1.243-gnu-8.3.1-spmpi-rolling_release_static_Volta70_Power9_no-asan_no-complex_no-fpic_mpi_pt_no-rdc_uvm_deprecated-on_no-package-enables-867 and not in any of the other builds that should have the exact same set of enables/disables shown here. |
The set of initial enables/disables for the ATS-2 build look to be unique from all of the other builds. This seems to be triggering this error (the details are in the expandable section below). I think the simple solution is to update the GenConfig files to disable the ShyLU package in the ATS-2 build to get around this problem in the short-term. Analysis of enables/disables of the ATS-2 build compared to other PR builds: (click to expand)Okay, I can see that the CUDA build on ATS-2 is strange in that it disables a bunch of packages up front as shown here which shows the initial set of enables/disables:
So we see that a bunch of ShyLU_Node and ShyLU_DD subpackages are being disabled. Why? This results in the final set of enabled and disable packages:
What is strange is that the other ascicgpu CUDA build shown here for the same PR iteration in the same PR shows a different initial set of enables/disables:
Note that only explicitly enabled SE packages in that build are So, ShyLU subpackages and packages are getting enabled in the ATS-2 build as shown here by the enable/disable logic showing:
However, in the other CUDA build on that ascicgpu machine shown here we see the enable/disable logic showing:
|
This should avoid a strange find_package(Trilinos) error being generated about missing ShyLUConfig.cmake. But since ShyLU is not a real TriBITS package (i.e. does not contain any libs, tests, or subpkages and there are no downstream dependencies on the pseudo package ShyLU), this will cause no lack of testing. The right solution will be to remove the pesudo ShyLU package and instead implement it as a TriBITS meta-package (see #9497).
FYI: I just pushed the branch 10865-ats2-disable-shylu which disables ShyLU for the ATS-2 build. I am going to suggest that it gets directly merged to the PR branch in #10775. |
With the merge of branch 10865-ats2-disable-shylu into the branch for PR #10775, that test should be fixed in that PR. |
…0807-kokkos-kernels-cublas (#10865)
This is resolved by PR #10872. Putting this in review. |
…hylu Automatically Merged using Trilinos Pull Request AutoTester PR Title: Disable pseudo package ShyLU in ATS-2 builds (#10865) PR Author: bartlettroscoe
…s:develop' (517d301). * trilinos-develop: (46 commits) Panzer MiniEM: Adjust rebalancing targets for RefMaxwell sub-problems Disable pseudo package ShyLU in ATS-2 builds (trilinos#10865) Automatic snapshot commit from tribits at b00ab335 Panzer: Bug fix for PlaneMatcher param parsing. Returning unit test paramlist to original. Tacho - use the same destruction order as hip Tacho - reorder stream deallocation order Remove multi-processor spai meshtying test Panzer: Leave todo reminding us to change the default BBox flag but remove ownership. Panzer: Make the bbox flag and related function calls more descriptive. Clean up a few units tests. Addresses Roger's review. Panzer: Missing bit of logic in the acceptance test (STK Search). Panzer: Reorder var in constructor to fix warning. Panzer: Fix an old test for periodic bcs. Panzer: Acceptance test for STKSearch and search implementation. Panzer: Mesh factories need to pass periodic search info along downstream. Panzer: Source and tests for periodic bc refactor. Employs STKSearch. Tacho : include mkl.h to define __INTEL_MKL__ before including Tacho_Pardiso.hpp Tacho : fix error message Amesos2-Tacho : add parameters to set algorithmic variant for factorization and solver, also call initialize after analyze in symbolic Fix build part of test TrilinosInstallTests_simpleBuildAgainstTrilinos on 'ats2' (trilinos#10774, trilinos#10813) Fix running of mpiexec MyApp for rhel7_sems-intel-17.0.1-mpich-3.2-serial build (trilinos#10774, trilinos#10813) ...
CC: @csiefer2 FYI: PR #10872 was merged just 23 hours ago and since then, there have not been any failures. In fact, if you sort this query by build name, you will see this test went from passing to failing in PR builds run after the merge for PRs #10851, #10834, #10827, #10808, #10802, and #10751. That is pretty good evidence that this test is fixed. Closing as complete. |
…s:develop' (517d301). * trilinos-develop: (59 commits) Panzer MiniEM: Adjust rebalancing targets for RefMaxwell sub-problems Add citation message to CMakeLists.txt trilinos#7423 Panzer: Fix for new periodic BC implementation. Final test mesh for converters Panzer: Test that mesh converters errors are being caught. Add Hex20 mesh and a bit of clean up. Waiting to finalize multiblock test mesh. Panzer: Hex20 converter added. Test meshes still needed. Panzer: Infer input/output topologies from quadratic mesh directly rather than asking the user through param list. Add meshes for quad converter tests and modify build system. Panzer: First pass at more general second-order to first-order mesh converters. Compiling and able to reproduce test with old functionality. Disable test TpetraTSQR_SequentialTsqr_contiguousCacheBlocks_MPI_1 just on ascicgpu PR build (trilinos#10847) Disable test TpetraTSQR_SequentialTsqr_noncontiguousCacheBlocks_MPI_1 just on ascicgpu PR build (trilinos#10847) Remove FORCE from all test disables (trilinos#10870) Disable pseudo package ShyLU in ATS-2 builds (trilinos#10865) Intrepid2: fix shadowed ExecutionSpace using Automatic snapshot commit from tribits at b00ab335 Panzer: Bug fix for PlaneMatcher param parsing. Returning unit test paramlist to original. Tacho - use the same destruction order as hip Tacho - reorder stream deallocation order Remove multi-processor spai meshtying test Panzer: Leave todo reminding us to change the default BBox flag but remove ownership. Panzer: Make the bbox flag and related function calls more descriptive. Clean up a few units tests. Addresses Roger's review. Panzer: Missing bit of logic in the acceptance test (STK Search). ...
…s:develop' (517d301). * trilinos-develop: (62 commits) add back some code that somehow got accidentally deleted remove describes Tpetra: Remove informational output from passing tests Panzer MiniEM: Adjust rebalancing targets for RefMaxwell sub-problems Add citation message to CMakeLists.txt trilinos#7423 Panzer: Fix for new periodic BC implementation. Final test mesh for converters Panzer: Test that mesh converters errors are being caught. Add Hex20 mesh and a bit of clean up. Waiting to finalize multiblock test mesh. Panzer: Hex20 converter added. Test meshes still needed. Panzer: Infer input/output topologies from quadratic mesh directly rather than asking the user through param list. Add meshes for quad converter tests and modify build system. Panzer: First pass at more general second-order to first-order mesh converters. Compiling and able to reproduce test with old functionality. Disable test TpetraTSQR_SequentialTsqr_contiguousCacheBlocks_MPI_1 just on ascicgpu PR build (trilinos#10847) Disable test TpetraTSQR_SequentialTsqr_noncontiguousCacheBlocks_MPI_1 just on ascicgpu PR build (trilinos#10847) Remove FORCE from all test disables (trilinos#10870) Disable pseudo package ShyLU in ATS-2 builds (trilinos#10865) Intrepid2: fix shadowed ExecutionSpace using Automatic snapshot commit from tribits at b00ab335 Panzer: Bug fix for PlaneMatcher param parsing. Returning unit test paramlist to original. Tacho - use the same destruction order as hip Tacho - reorder stream deallocation order Remove multi-processor spai meshtying test ...
…s:develop' (517d301). * trilinos-develop: (62 commits) add back some code that somehow got accidentally deleted remove describes Tpetra: Remove informational output from passing tests Panzer MiniEM: Adjust rebalancing targets for RefMaxwell sub-problems Add citation message to CMakeLists.txt trilinos#7423 Panzer: Fix for new periodic BC implementation. Final test mesh for converters Panzer: Test that mesh converters errors are being caught. Add Hex20 mesh and a bit of clean up. Waiting to finalize multiblock test mesh. Panzer: Hex20 converter added. Test meshes still needed. Panzer: Infer input/output topologies from quadratic mesh directly rather than asking the user through param list. Add meshes for quad converter tests and modify build system. Panzer: First pass at more general second-order to first-order mesh converters. Compiling and able to reproduce test with old functionality. Disable test TpetraTSQR_SequentialTsqr_contiguousCacheBlocks_MPI_1 just on ascicgpu PR build (trilinos#10847) Disable test TpetraTSQR_SequentialTsqr_noncontiguousCacheBlocks_MPI_1 just on ascicgpu PR build (trilinos#10847) Remove FORCE from all test disables (trilinos#10870) Disable pseudo package ShyLU in ATS-2 builds (trilinos#10865) Intrepid2: fix shadowed ExecutionSpace using Automatic snapshot commit from tribits at b00ab335 Panzer: Bug fix for PlaneMatcher param parsing. Returning unit test paramlist to original. Tacho - use the same destruction order as hip Tacho - reorder stream deallocation order Remove multi-processor spai meshtying test ...
…s:develop' (517d301). * trilinos-develop: (63 commits) add back some code that somehow got accidentally deleted remove describes Tpetra: Remove informational output from passing tests Panzer MiniEM: Adjust rebalancing targets for RefMaxwell sub-problems Add citation message to CMakeLists.txt trilinos#7423 Panzer: Fix for new periodic BC implementation. Final test mesh for converters Panzer: Test that mesh converters errors are being caught. Add Hex20 mesh and a bit of clean up. Waiting to finalize multiblock test mesh. Panzer: Hex20 converter added. Test meshes still needed. Panzer: Infer input/output topologies from quadratic mesh directly rather than asking the user through param list. Add meshes for quad converter tests and modify build system. Panzer: First pass at more general second-order to first-order mesh converters. Compiling and able to reproduce test with old functionality. Disable test TpetraTSQR_SequentialTsqr_contiguousCacheBlocks_MPI_1 just on ascicgpu PR build (trilinos#10847) Disable test TpetraTSQR_SequentialTsqr_noncontiguousCacheBlocks_MPI_1 just on ascicgpu PR build (trilinos#10847) Remove FORCE from all test disables (trilinos#10870) Disable pseudo package ShyLU in ATS-2 builds (trilinos#10865) Intrepid2: fix shadowed ExecutionSpace using Automatic snapshot commit from tribits at b00ab335 Panzer: Bug fix for PlaneMatcher param parsing. Returning unit test paramlist to original. Tacho - use the same destruction order as hip Tacho - reorder stream deallocation order Remove multi-processor spai meshtying test ...
…s:develop' (517d301). * trilinos-develop: (63 commits) add back some code that somehow got accidentally deleted remove describes Tpetra: Remove informational output from passing tests Panzer MiniEM: Adjust rebalancing targets for RefMaxwell sub-problems Add citation message to CMakeLists.txt trilinos#7423 Panzer: Fix for new periodic BC implementation. Final test mesh for converters Panzer: Test that mesh converters errors are being caught. Add Hex20 mesh and a bit of clean up. Waiting to finalize multiblock test mesh. Panzer: Hex20 converter added. Test meshes still needed. Panzer: Infer input/output topologies from quadratic mesh directly rather than asking the user through param list. Add meshes for quad converter tests and modify build system. Panzer: First pass at more general second-order to first-order mesh converters. Compiling and able to reproduce test with old functionality. Disable test TpetraTSQR_SequentialTsqr_contiguousCacheBlocks_MPI_1 just on ascicgpu PR build (trilinos#10847) Disable test TpetraTSQR_SequentialTsqr_noncontiguousCacheBlocks_MPI_1 just on ascicgpu PR build (trilinos#10847) Remove FORCE from all test disables (trilinos#10870) Disable pseudo package ShyLU in ATS-2 builds (trilinos#10865) Intrepid2: fix shadowed ExecutionSpace using Automatic snapshot commit from tribits at b00ab335 Panzer: Bug fix for PlaneMatcher param parsing. Returning unit test paramlist to original. Tacho - use the same destruction order as hip Tacho - reorder stream deallocation order Remove multi-processor spai meshtying test ...
Description
After the merge of PR #10813, there are now install tests and a test that calls
find_package(Trilinos)
to see if there are problems with theTrilinosConfig.cmake
file. All of those tests were run and passed with PR #10813.For example, for the last testing iteration for PR #10775 shown here we see the failing test
TrilinosInstallTests_find_package_Trilinos
here showing the error:Full test error output: (click to expand)
Tasks
The text was updated successfully, but these errors were encountered: