Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MueLu: change default for "aggregation: deterministic" to true #12351

Open
wants to merge 1 commit into
base: develop
Choose a base branch
from

Conversation

GrahamBenHarper
Copy link
Contributor

@GrahamBenHarper GrahamBenHarper commented Oct 2, 2023

@trilinos/muelu @cgcgcg

Motivation

This has been a long-needed change, and has been suggested by applications (e.g. @rppawlo in #11026), but it needs careful evaluation. We discussed it at standup today, and we decided we will pursue this if we can be certain from a performance standpoint that we don't horribly increase aggregation time running with default settings. I'll follow-up with performance testing data in this PR once it's done. This is backwards incompatible, so it's appropriate to try to merge before Trilinos 15.0 is released if the performance looks reasonable.

I'll ping other relevant application contacts once I'm certain this is okay to merge.

Related Issues

Closes #11026.
See also issues linked in #11026, for example.

Testing

This was never covered by testing before, as our usual "fix" was to manually specify "aggregation: deterministic" = true in testing, c.f. #11022 for example. Between the autotester and the performance tests, we should have everything covered.

@trilinos-autotester
Copy link
Contributor

Status Flag 'Pre-Test Inspection' - Auto Inspected - Inspection is Not Necessary for this Pull Request.

@trilinos-autotester
Copy link
Contributor

Status Flag 'Pull Request AutoTester' - Testing Jenkins Projects:

Pull Request Auto Testing STARTING (click to expand)

Build Information

Test Name: Trilinos_PR_gcc-8.3.0

  • Build Num: 3026
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel7_sems-gnu-8.3.0-openmpi-1.10.1-openmp_release-debug_static_no-kokkos-arch_no-asan_no-complex_no-fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL trilinos-any
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: Trilinos_PR_gcc-8.3.0-serial

  • Build Num: 1525
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel7_sems-v2-gnu-8.3.0-serial_release-debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_no-mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL trilinos-any
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: Trilinos_PR_gcc-8.3.0-debug

  • Build Num: 1515
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel7_sems-gnu-8.3.0-openmpi-1.10.1-serial_debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL trilinos-any
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: Trilinos_PR_clang-11.0.1

  • Build Num: 1514
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel7_sems-clang-11.0.1-openmpi-1.10.1-serial_release-debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL trilinos-any
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: Trilinos_PR_python3

  • Build Num: 2729
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel7_sems-gnu-7.2.0-anaconda3-serial_debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_no-mpi_no-pt_no-rdc_no-uvm_deprecated-on_pr-framework
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL ascic
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: Trilinos_PR_cuda-11.4.2-uvm-off

  • Build Num: 2517
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel7_sems-cuda-11.4.2-sems-gnu-10.1.0-sems-openmpi-4.0.5_release_static_Volta70_no-asan_complex_no-fpic_mpi_pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL GPU
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: Trilinos_PR_intel-2021.3

  • Build Num: 1156
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel7_sems-intel-2021.3-sems-openmpi-4.0.5_release-debug_shared_no-kokkos-arch_no-asan_no-complex_fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-off_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL trilinos-any
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Using Repos:

Repo: TRILINOS (GrahamBenHarper/Trilinos)
  • Branch: muelu-deterministic-agg
  • SHA: 9fc7f46
  • Mode: TEST_REPO

Pull Request Author: GrahamBenHarper

@trilinos-autotester
Copy link
Contributor

Status Flag 'Pull Request AutoTester' - Jenkins Testing: 1 or more Jobs FAILED

Note: Testing will normally be attempted again in approx. 2 Hrs 30 Mins. If a change to the PR source branch occurs, the testing will be attempted again on next available autotester run.

Pull Request Auto Testing has FAILED (click to expand)

Build Information

Test Name: Trilinos_PR_gcc-8.3.0

  • Build Num: 3026
  • Status: FAILED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel7_sems-gnu-8.3.0-openmpi-1.10.1-openmp_release-debug_static_no-kokkos-arch_no-asan_no-complex_no-fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL trilinos-any
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: Trilinos_PR_gcc-8.3.0-serial

  • Build Num: 1525
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel7_sems-v2-gnu-8.3.0-serial_release-debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_no-mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL trilinos-any
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: Trilinos_PR_gcc-8.3.0-debug

  • Build Num: 1515
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel7_sems-gnu-8.3.0-openmpi-1.10.1-serial_debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL trilinos-any
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: Trilinos_PR_clang-11.0.1

  • Build Num: 1514
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel7_sems-clang-11.0.1-openmpi-1.10.1-serial_release-debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL trilinos-any
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: Trilinos_PR_python3

  • Build Num: 2729
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel7_sems-gnu-7.2.0-anaconda3-serial_debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_no-mpi_no-pt_no-rdc_no-uvm_deprecated-on_pr-framework
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL ascic
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: Trilinos_PR_cuda-11.4.2-uvm-off

  • Build Num: 2517
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel7_sems-cuda-11.4.2-sems-gnu-10.1.0-sems-openmpi-4.0.5_release_static_Volta70_no-asan_complex_no-fpic_mpi_pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL GPU
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: Trilinos_PR_intel-2021.3

  • Build Num: 1156
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel7_sems-intel-2021.3-sems-openmpi-4.0.5_release-debug_shared_no-kokkos-arch_no-asan_no-complex_fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-off_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL trilinos-any
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79


CDash Test Results for PR# 12351.


Wiki: How to Reproduce PR Testing Builds and Errors.

@GrahamBenHarper
Copy link
Contributor Author

@cgcgcg @jhux2
Here are some performance studies from our ATS2 testbed Vortex using MueLu_Driver --matrixType=Laplace3D --xml=scaling.xml.

Problem size: 1000000 (100x100x100)
Driver: 2 - MueLu Setup                                                                                       0.3937 (1)                 
MueLu: AggregationPhase1Algorithm_kokkos: BuildAggregatesDeterministic (total)                                0.003601 (3)               
MueLu: AggregationPhase2aAlgorithm_kokkos: BuildAggregatesDeterministic (total)                               0.03939 (3)                
MueLu: AggregationPhase2bAlgorithm_kokkos: BuildAggregatesDeterministic (total)                               0.05134 (3)                
MueLu: AggregationPhase3Algorithm_kokkos: BuildAggregatesDeterministic (total)                                0.01928 (3)                
MueLu: PreserveDirichletAggregationAlgorithm_kokkos: BuildAggregates (total)                                  0.0004438 (3)              

Problem size: 1000000 (100x100x100)
Driver: 2 - MueLu Setup                                                                                       0.3061 (1)                 
MueLu: AggregationPhase1Algorithm_kokkos: BuildAggregatesRandom (total)                                       0.001741 (3)               
MueLu: AggregationPhase2aAlgorithm_kokkos: BuildAggregatesRandom (total)                                      0.01619 (3)                
MueLu: AggregationPhase2bAlgorithm_kokkos: BuildAggregatesRandom (total)                                      0.03281 (3)                
MueLu: AggregationPhase3Algorithm_kokkos: BuildAggregatesRandom (total)                                       0.0176 (3)                 
MueLu: PreserveDirichletAggregationAlgorithm_kokkos: BuildAggregates (total)                                  0.0004288 (3)              

Problem size: 8000000 (200x200x200)
Driver: 2 - MueLu Setup                                                                                       1.287 (1)                  
MueLu: AggregationPhase1Algorithm_kokkos: BuildAggregatesDeterministic (total)                                0.008021 (3)               
MueLu: AggregationPhase2aAlgorithm_kokkos: BuildAggregatesDeterministic (total)                               0.05038 (3)                
MueLu: AggregationPhase2bAlgorithm_kokkos: BuildAggregatesDeterministic (total)                               0.06513 (3)                
MueLu: AggregationPhase3Algorithm_kokkos: BuildAggregatesDeterministic (total)                                0.02173 (3)                
MueLu: PreserveDirichletAggregationAlgorithm_kokkos: BuildAggregates (total)                                  0.0007424 (3)              

Problem size: 8000000 (200x200x200)
Driver: 2 - MueLu Setup                                                                                       1.083 (1)                  
MueLu: AggregationPhase1Algorithm_kokkos: BuildAggregatesRandom (total)                                       0.01185 (3)                
MueLu: AggregationPhase2aAlgorithm_kokkos: BuildAggregatesRandom (total)                                      0.01868 (3)                
MueLu: AggregationPhase2bAlgorithm_kokkos: BuildAggregatesRandom (total)                                      0.04664 (3)                
MueLu: AggregationPhase3Algorithm_kokkos: BuildAggregatesRandom (total)                                       0.01846 (3)                
MueLu: PreserveDirichletAggregationAlgorithm_kokkos: BuildAggregates (total)                                  0.0006285 (3) 

We see a slowdown in setup of 28% on the 100x100x100 problem and a slowdown in setup of 18% on the 200x200x200 problem. Does that seem reasonable, or is that too significant?

@cgcgcg
Copy link
Contributor

cgcgcg commented Oct 9, 2023

While you're at it, can you also benchmark the coloring on device?

@csiefer2
Copy link
Member

@GrahamBenHarper Can we replicate 100x, please? Numbers that small are generally noisy.

@GrahamBenHarper
Copy link
Contributor Author

GrahamBenHarper commented Oct 14, 2023

@cgcgcg @csiefer2 @rppawlo @jhux2
Based on feedback, I ran a more comprehensive suite of tests on one GPU of Vortex using a Laplace3d matrix. I wanted to run the full four GPUs, but I kept getting strange errors about TCP connections when I tried more than one MPI rank on Vortex. There's probably some setting or configuration I'm missing, but this should still roughly capture the performance differences between the various algorithms. Each run uses this input deck with a minor modification depending on the specific case.

scaling.xml
<ParameterList name="MueLu">

  <!--
    For a generic symmetric scalar problem, these are the recommended settings for MueLu.
  -->

  <!-- ===========  GENERAL ================ -->
    <Parameter        name="verbosity"                            type="string"   value="high"/>
    <Parameter        name="coarse: max size"                     type="int"      value="1000"/>
    <Parameter        name="multigrid algorithm"                  type="string"   value="sa"/>

    <!-- reduces setup cost for symmetric problems -->
    <Parameter        name="transpose: use implicit"              type="bool"     value="true"/>
    <Parameter        name="transfers: half precision"            type="bool"     value="false"/>

    <!-- start of default values for general options (can be omitted) -->
    <Parameter        name="max levels"                	          type="int"      value="10"/>
    <Parameter        name="number of equations"                  type="int"      value="1"/>
    <Parameter        name="sa: use filtered matrix"              type="bool"     value="true"/>
    <!-- end of default values -->

  <!-- ===========  AGGREGATION  =========== -->
    <Parameter        name="aggregation: type"                    type="string"   value="uncoupled"/>
    <Parameter        name="aggregation: drop scheme"             type="string"   value="classical"/>
    <Parameter        name="aggregation: deterministic"           type="bool"     value="false"/>
    <Parameter        name="aggregation: coloring algorithm"      type="string"   value="mis2 coarsening"/>
    <!-- Uncomment the next line to enable dropping of weak connections, which can help AMG convergence
         for anisotropic problems.  The exact value is problem dependent. -->
    <!-- <Parameter        name="aggregation: drop tol"                type="double"   value="0.02"/> -->

  <!-- ===========  SMOOTHING  =========== -->
    <Parameter        name="smoother: type"                       type="string"   value="CHEBYSHEV"/>
    <ParameterList    name="smoother: params">
      <Parameter      name="chebyshev: degree"                    type="int"      value="2"/>
      <Parameter      name="chebyshev: ratio eigenvalue"          type="double"   value="7"/>
      <Parameter      name="chebyshev: min eigenvalue"            type="double"   value="1.0"/>
      <Parameter      name="chebyshev: zero starting solution"    type="bool"     value="true"/>
    </ParameterList>

</ParameterList>

I did 100 runs on three grid sizes: 60x60x60 (2 levels, for comparison with EMPIRE), 100x100x100 (3 levels), and 200x200x200 (3 levels), with each of the following aggregation scenarios:

mis2 coarsening
./output3d_mis2c_n60_np1.txt
 Problem size: 216000 (60x60x60)
Driver: 2 - MueLu Setup                                                                        9.64 (1)                   
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                         0.007753 (100)             
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                              0.4895 (200)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                    0.3552 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                    0.1359 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                      1.154 (200)                
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                         0.01569 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                        0.007654 (200)             
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                     0.01988 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                  0.02715 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                   0.00836 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                     0.02915 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)    0.01653 (200)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'MIS2' (sub, total)                 0.5617 (200)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                    0.6538 (200)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                          0.4054 (100)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                          0.2471 (100)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                            0.6548 (200)               
./output3d_mis2c_n100_np1.txt
 Problem size: 1000000 (100x100x100)
Driver: 2 - MueLu Setup                                                                        19.44 (1)                  
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                         0.01773 (100)              
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                              1.068 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                    0.6938 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                    0.2243 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=3)                                    0.1535 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                      2.72 (300)                 
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                         0.0293 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                        0.01368 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                     0.03865 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                  0.06061 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                   0.01539 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                     0.06864 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)    0.02967 (300)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'MIS2' (sub, total)                 1.363 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                    1.63 (300)                 
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                          0.9794 (100)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                          0.4162 (100)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=2)                          0.232 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                            1.632 (300)                
./output3d_mis2c_n200_np1.txt
 Problem size: 8000000 (200x200x200)
Driver: 2 - MueLu Setup                                                                        64.75 (1)                  
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                         0.1475 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                              3.115 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                    2.495 (100)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                    0.4591 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=3)                                    0.1653 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                      8.463 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                         0.06833 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                        0.0245 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                     0.1245 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                  0.1025 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                   0.03523 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                     0.2664 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)    0.04379 (300)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'MIS2' (sub, total)                 4.603 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                    5.319 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                          3.521 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                          1.525 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=2)                          0.2694 (100)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                            5.321 (300)                
mis2 aggregation
./output3d_mis2a_n60_np1.txt
 Problem size: 216000 (60x60x60)
Driver: 2 - MueLu Setup                                                                        10.33 (1)                  
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                         0.007499 (100)             
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                              0.4967 (200)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                    0.362 (100)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                    0.1364 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                      1.665 (200)                
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                         0.01567 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                        0.007584 (200)             
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                     0.01997 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                  0.02766 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                   0.008395 (200)             
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                     0.02908 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)    0.01617 (200)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'MIS2' (sub, total)                 1.059 (200)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                    1.157 (200)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                          0.6975 (100)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                          0.4581 (100)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                            1.158 (200)                
./output3d_mis2a_n100_np1.txt
 Problem size: 1000000 (100x100x100)
Driver: 2 - MueLu Setup                                                                        24.42 (1)                  
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                         0.02154 (100)              
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                              1.337 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                    0.9002 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                    0.2782 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=3)                                    0.1635 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                      4.295 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                         0.03115 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                        0.01638 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                     0.04173 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                  0.0624 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                   0.01857 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                     0.07443 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)    0.03224 (300)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'MIS2' (sub, total)                 2.648 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                    2.931 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                          1.642 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                          0.8719 (100)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=2)                          0.4135 (100)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                            2.933 (300)                
./output3d_mis2a_n200_np1.txt
 Problem size: 8000000 (200x200x200)
Driver: 2 - MueLu Setup                                                                        86.15 (1)                  
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                         0.1605 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                              3.576 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                    2.886 (100)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                    0.5292 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=3)                                    0.1657 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                      13.08 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                         0.07185 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                        0.02421 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                     0.1318 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                  0.1098 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                   0.03574 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                     0.3478 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)    0.04379 (300)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'MIS2' (sub, total)                 8.743 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                    9.476 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                          5.577 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                          3.366 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=2)                          0.5301 (100)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                            9.478 (300)                
vertex based
./output3d_vb_n60_np1.txt
 Problem size: 216000 (60x60x60)
Driver: 2 - MueLu Setup                                                                                       24.26 (1)                  
MueLu: AggregationPhase1Algorithm_kokkos: BuildAggregatesRandom (total)                                       0.06959 (200)              
MueLu: AggregationPhase2aAlgorithm_kokkos: BuildAggregatesRandom (total)                                      0.5142 (200)               
MueLu: AggregationPhase2bAlgorithm_kokkos: BuildAggregatesRandom (total)                                      0.9527 (200)               
MueLu: AggregationPhase3Algorithm_kokkos: BuildAggregatesRandom (total)                                       0.7056 (200)               
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                                        0.009984 (100)             
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                                             0.5698 (200)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                                   0.4029 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                                   0.1697 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                                     14.98 (200)                
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                                        0.02046 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                                       0.0107 (200)               
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                                    0.02597 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                                 0.03187 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                                  0.01168 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                                    0.03367 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)                   0.02099 (200)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring' (sub, total)                      11.95 (200)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring': KokkosGraph Call (sub, total)    11.93 (200)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase - (Dirichlet)' (sub, total)                 0.04336 (200)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 1 (main)' (sub, total)                      0.08139 (200)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2a (secondary)' (sub, total)                0.5254 (200)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2b (expansion)' (sub, total)                0.964 (200)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 3 (cleanup)' (sub, total)                   0.717 (200)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                                   14.4 (200)                 
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                                         1.018 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                                         13.38 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                                           14.4 (200)                 
./output3d_vb_n100_np1.txt
 Problem size: 1000000 (100x100x100)
Driver: 2 - MueLu Setup                                                                                       49.86 (1)                  
MueLu: AggregationPhase1Algorithm_kokkos: BuildAggregatesRandom (total)                                       0.2024 (300)               
MueLu: AggregationPhase2aAlgorithm_kokkos: BuildAggregatesRandom (total)                                      1.974 (300)                
MueLu: AggregationPhase2bAlgorithm_kokkos: BuildAggregatesRandom (total)                                      4.072 (300)                
MueLu: AggregationPhase3Algorithm_kokkos: BuildAggregatesRandom (total)                                       2.292 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                                        0.01956 (100)              
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                                             1.157 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                                   0.7511 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                                   0.238 (100)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=3)                                                   0.1725 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                                     31.23 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                                        0.03164 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                                       0.01652 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                                    0.04291 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                                 0.06468 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                                  0.01875 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                                    0.08172 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)                   0.03295 (300)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring' (sub, total)                      21.05 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring': KokkosGraph Call (sub, total)    21.03 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase - (Dirichlet)' (sub, total)                 0.06805 (300)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 1 (main)' (sub, total)                      0.22 (300)                 
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2a (secondary)' (sub, total)                1.991 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2b (expansion)' (sub, total)                4.09 (300)                 
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 3 (cleanup)' (sub, total)                   2.31 (300)                 
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                                   30.04 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                                         3.217 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                                         14.04 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=2)                                         12.79 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                                           30.05 (300)                
./output3d_vb_n200_np1.txt
 Problem size: 8000000 (200x200x200)
Driver: 2 - MueLu Setup                                                                                       346.2 (1)                  
MueLu: AggregationPhase1Algorithm_kokkos: BuildAggregatesRandom (total)                                       2.036 (300)                
MueLu: AggregationPhase2aAlgorithm_kokkos: BuildAggregatesRandom (total)                                      3.403 (300)                
MueLu: AggregationPhase2bAlgorithm_kokkos: BuildAggregatesRandom (total)                                      8.283 (300)                
MueLu: AggregationPhase3Algorithm_kokkos: BuildAggregatesRandom (total)                                       3.055 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                                        0.1698 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                                             3.426 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                                   2.767 (100)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                                   0.496 (100)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=3)                                                   0.1673 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                                     276.9 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                                        0.06627 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                                       0.02388 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                                    0.1263 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                                 0.1046 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                                  0.03466 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                                    0.3788 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)                   0.04347 (300)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring' (sub, total)                      255.8 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring': KokkosGraph Call (sub, total)    255.8 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase - (Dirichlet)' (sub, total)                 0.0743 (300)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 1 (main)' (sub, total)                      2.053 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2a (secondary)' (sub, total)                3.419 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2b (expansion)' (sub, total)                8.301 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 3 (cleanup)' (sub, total)                   3.073 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                                   273.5 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                                         27.13 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                                         101.6 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=2)                                         144.7 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                                           273.5 (300)                
vertex based bit set
./output3d_vbbs_n60_np1.txt
 Problem size: 216000 (60x60x60)
Driver: 2 - MueLu Setup                                                                                       28.98 (1)                  
MueLu: AggregationPhase1Algorithm_kokkos: BuildAggregatesRandom (total)                                       0.06033 (200)              
MueLu: AggregationPhase2aAlgorithm_kokkos: BuildAggregatesRandom (total)                                      0.465 (200)                
MueLu: AggregationPhase2bAlgorithm_kokkos: BuildAggregatesRandom (total)                                      0.8773 (200)               
MueLu: AggregationPhase3Algorithm_kokkos: BuildAggregatesRandom (total)                                       0.6378 (200)               
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                                        0.008049 (100)             
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                                             0.4911 (200)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                                   0.3511 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                                   0.1418 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                                     20.54 (200)                
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                                        0.01587 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                                       0.007847 (200)             
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                                    0.02082 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                                 0.0282 (200)               
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                                  0.008562 (200)             
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                                    0.02967 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)                   0.017 (200)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring' (sub, total)                      17.83 (200)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring': KokkosGraph Call (sub, total)    17.82 (200)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase - (Dirichlet)' (sub, total)                 0.03533 (200)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 1 (main)' (sub, total)                      0.06803 (200)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2a (secondary)' (sub, total)                0.4719 (200)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2b (expansion)' (sub, total)                0.8842 (200)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 3 (cleanup)' (sub, total)                   0.6448 (200)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                                   20.04 (200)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                                         0.8818 (100)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                                         19.15 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                                           20.04 (200)                
./output3d_vbbs_n100_np1.txt
 Problem size: 1000000 (100x100x100)
Driver: 2 - MueLu Setup                                                                                       40.25 (1)                  
MueLu: AggregationPhase1Algorithm_kokkos: BuildAggregatesRandom (total)                                       0.1858 (300)               
MueLu: AggregationPhase2aAlgorithm_kokkos: BuildAggregatesRandom (total)                                      1.784 (300)                
MueLu: AggregationPhase2bAlgorithm_kokkos: BuildAggregatesRandom (total)                                      3.666 (300)                
MueLu: AggregationPhase3Algorithm_kokkos: BuildAggregatesRandom (total)                                       1.939 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                                        0.01785 (100)              
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                                             0.9682 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                                   0.6337 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                                   0.1966 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=3)                                                   0.1409 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                                     23.44 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                                        0.02419 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                                       0.01251 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                                    0.03456 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                                 0.05487 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                                  0.01429 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                                    0.07506 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)                   0.02623 (300)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring' (sub, total)                      14.51 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring': KokkosGraph Call (sub, total)    14.49 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase - (Dirichlet)' (sub, total)                 0.05048 (300)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 1 (main)' (sub, total)                      0.197 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2a (secondary)' (sub, total)                1.794 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2b (expansion)' (sub, total)                3.677 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 3 (cleanup)' (sub, total)                   1.95 (300)                 
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                                   22.46 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                                         2.817 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                                         8.426 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=2)                                         11.21 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                                           22.46 (300)                
./output3d_vbbs_n200_np1.txt
 Problem size: 8000000 (200x200x200)
Driver: 2 - MueLu Setup                                                                                       352.7 (1)                  
MueLu: AggregationPhase1Algorithm_kokkos: BuildAggregatesRandom (total)                                       2.036 (300)                
MueLu: AggregationPhase2aAlgorithm_kokkos: BuildAggregatesRandom (total)                                      3.288 (300)                
MueLu: AggregationPhase2bAlgorithm_kokkos: BuildAggregatesRandom (total)                                      8.042 (300)                
MueLu: AggregationPhase3Algorithm_kokkos: BuildAggregatesRandom (total)                                       2.805 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                                        0.1678 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                                             3.254 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                                   2.654 (100)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                                   0.4573 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=3)                                                   0.1455 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                                     285.5 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                                        0.05895 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                                       0.02097 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                                    0.1169 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                                 0.0943 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                                  0.0317 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                                    0.3739 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)                   0.03818 (300)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring' (sub, total)                      265.3 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring': KokkosGraph Call (sub, total)    265.3 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase - (Dirichlet)' (sub, total)                 0.06423 (300)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 1 (main)' (sub, total)                      2.048 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2a (secondary)' (sub, total)                3.299 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2b (expansion)' (sub, total)                8.054 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 3 (cleanup)' (sub, total)                   2.818 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                                   282.2 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                                         25.15 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                                         65.63 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=2)                                         191.5 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                                           282.2 (300)                
edge filtering
./output3d_ef_n60_np1.txt
 Problem size: 216000 (60x60x60)
Driver: 2 - MueLu Setup                                                                                       39.68 (1)                  
MueLu: AggregationPhase1Algorithm_kokkos: BuildAggregatesRandom (total)                                       0.05979 (200)              
MueLu: AggregationPhase2aAlgorithm_kokkos: BuildAggregatesRandom (total)                                      0.4568 (200)               
MueLu: AggregationPhase2bAlgorithm_kokkos: BuildAggregatesRandom (total)                                      0.8597 (200)               
MueLu: AggregationPhase3Algorithm_kokkos: BuildAggregatesRandom (total)                                       0.6178 (200)               
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                                        0.007941 (100)             
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                                             0.48 (200)                 
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                                   0.3408 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                                   0.141 (100)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                                     31.51 (200)                
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                                        0.01553 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                                       0.007724 (200)             
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                                    0.02038 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                                 0.02794 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                                  0.008367 (200)             
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                                    0.02923 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)                   0.01701 (200)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring' (sub, total)                      28.86 (200)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring': KokkosGraph Call (sub, total)    28.85 (200)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase - (Dirichlet)' (sub, total)                 0.03548 (200)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 1 (main)' (sub, total)                      0.06771 (200)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2a (secondary)' (sub, total)                0.4636 (200)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2b (expansion)' (sub, total)                0.8665 (200)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 3 (cleanup)' (sub, total)                   0.6247 (200)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                                   31.02 (200)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                                         1.665 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                                         29.35 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                                           31.02 (200)                
./output3d_ef_n100_np1.txt
 Problem size: 1000000 (100x100x100)
Driver: 2 - MueLu Setup                                                                                       44.3 (1)                   
MueLu: AggregationPhase1Algorithm_kokkos: BuildAggregatesRandom (total)                                       0.1868 (300)               
MueLu: AggregationPhase2aAlgorithm_kokkos: BuildAggregatesRandom (total)                                      1.785 (300)                
MueLu: AggregationPhase2bAlgorithm_kokkos: BuildAggregatesRandom (total)                                      3.686 (300)                
MueLu: AggregationPhase3Algorithm_kokkos: BuildAggregatesRandom (total)                                       1.947 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                                        0.01807 (100)              
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                                             0.9839 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                                   0.6427 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                                   0.2035 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=3)                                                   0.1409 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                                     27.54 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                                        0.02441 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                                       0.0127 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                                    0.03538 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                                 0.05542 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                                  0.01427 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                                    0.07471 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)                   0.02696 (300)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring' (sub, total)                      18.57 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring': KokkosGraph Call (sub, total)    18.56 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase - (Dirichlet)' (sub, total)                 0.0557 (300)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 1 (main)' (sub, total)                      0.1986 (300)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2a (secondary)' (sub, total)                1.796 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2b (expansion)' (sub, total)                3.697 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 3 (cleanup)' (sub, total)                   1.959 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                                   26.54 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                                         4.405 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                                         10.78 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=2)                                         11.35 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                                           26.54 (300)                
./output3d_ef_n200_np1.txt
 Problem size: 8000000 (200x200x200)
Driver: 2 - MueLu Setup                                                                                       392.5 (1)                  
MueLu: AggregationPhase1Algorithm_kokkos: BuildAggregatesRandom (total)                                       2.03 (300)                 
MueLu: AggregationPhase2aAlgorithm_kokkos: BuildAggregatesRandom (total)                                      3.292 (300)                
MueLu: AggregationPhase2bAlgorithm_kokkos: BuildAggregatesRandom (total)                                      8.051 (300)                
MueLu: AggregationPhase3Algorithm_kokkos: BuildAggregatesRandom (total)                                       2.814 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                                        0.1682 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                                             3.232 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                                   2.628 (100)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                                   0.4602 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=3)                                                   0.1469 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                                     324.7 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                                        0.05938 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                                       0.02102 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                                    0.1173 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                                 0.09602 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                                  0.03175 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                                    0.3748 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)                   0.0396 (300)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring' (sub, total)                      304.4 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring': KokkosGraph Call (sub, total)    304.4 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase - (Dirichlet)' (sub, total)                 0.07127 (300)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 1 (main)' (sub, total)                      2.043 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2a (secondary)' (sub, total)                3.303 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2b (expansion)' (sub, total)                8.062 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 3 (cleanup)' (sub, total)                   2.826 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                                   321.4 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                                         33.22 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                                         76.08 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=2)                                         212.1 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                                           321.4 (300)                
net based bit set
./output3d_nbbs_n60_np1.txt
 Problem size: 216000 (60x60x60)
Driver: 2 - MueLu Setup                                                                                       14.41 (1)                  
MueLu: AggregationPhase1Algorithm_kokkos: BuildAggregatesRandom (total)                                       0.05834 (200)              
MueLu: AggregationPhase2aAlgorithm_kokkos: BuildAggregatesRandom (total)                                      0.4614 (200)               
MueLu: AggregationPhase2bAlgorithm_kokkos: BuildAggregatesRandom (total)                                      0.8754 (200)               
MueLu: AggregationPhase3Algorithm_kokkos: BuildAggregatesRandom (total)                                       0.6111 (200)               
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                                        0.007936 (100)             
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                                             0.4837 (200)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                                   0.348 (100)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                                   0.1375 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                                     5.457 (200)                
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                                        0.01531 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                                       0.007526 (200)             
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                                    0.02015 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                                 0.0275 (200)               
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                                  0.008323 (200)             
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                                    0.03032 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)                   0.01641 (200)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring' (sub, total)                      2.801 (200)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring': KokkosGraph Call (sub, total)    2.791 (200)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase - (Dirichlet)' (sub, total)                 0.0332 (200)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 1 (main)' (sub, total)                      0.06565 (200)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2a (secondary)' (sub, total)                0.4682 (200)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2b (expansion)' (sub, total)                0.8821 (200)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 3 (cleanup)' (sub, total)                   0.6181 (200)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                                   4.962 (200)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                                         2.024 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                                         2.937 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                                           4.963 (200)                
./output3d_nbbs_n100_np1.txt
 Problem size: 1000000 (100x100x100)
Driver: 2 - MueLu Setup                                                                                       44.88 (1)                  
MueLu: AggregationPhase1Algorithm_kokkos: BuildAggregatesRandom (total)                                       0.1816 (300)               
MueLu: AggregationPhase2aAlgorithm_kokkos: BuildAggregatesRandom (total)                                      1.827 (300)                
MueLu: AggregationPhase2bAlgorithm_kokkos: BuildAggregatesRandom (total)                                      3.747 (300)                
MueLu: AggregationPhase3Algorithm_kokkos: BuildAggregatesRandom (total)                                       2.026 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                                        0.01851 (100)              
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                                             1.058 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                                   0.6822 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                                   0.2311 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=3)                                                   0.1476 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                                     27.79 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                                        0.02646 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                                       0.01357 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                                    0.03693 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                                 0.05748 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                                  0.01555 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                                    0.08616 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)                   0.02802 (300)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring' (sub, total)                      18.55 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring': KokkosGraph Call (sub, total)    18.53 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase - (Dirichlet)' (sub, total)                 0.05481 (300)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 1 (main)' (sub, total)                      0.194 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2a (secondary)' (sub, total)                1.839 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2b (expansion)' (sub, total)                3.759 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 3 (cleanup)' (sub, total)                   2.039 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                                   26.71 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                                         8.75 (100)                 
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                                         9.945 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=2)                                         8.014 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                                           26.71 (300)                
./output3d_nbbs_n200_np1.txt
 Problem size: 8000000 (200x200x200)
Driver: 2 - MueLu Setup                                                                                       248.3 (1)                  
MueLu: AggregationPhase1Algorithm_kokkos: BuildAggregatesRandom (total)                                       2.019 (300)                
MueLu: AggregationPhase2aAlgorithm_kokkos: BuildAggregatesRandom (total)                                      3.343 (300)                
MueLu: AggregationPhase2bAlgorithm_kokkos: BuildAggregatesRandom (total)                                      8.3 (300)                  
MueLu: AggregationPhase3Algorithm_kokkos: BuildAggregatesRandom (total)                                       2.743 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                                        0.1667 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                                             3.23 (300)                 
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                                   2.636 (100)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                                   0.4505 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=3)                                                   0.1474 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                                     178.6 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                                        0.05868 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                                       0.0207 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                                    0.1165 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                                 0.09472 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                                  0.03157 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                                    0.3833 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)                   0.03787 (300)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring' (sub, total)                      158.2 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring': KokkosGraph Call (sub, total)    158.2 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase - (Dirichlet)' (sub, total)                 0.0628 (300)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 1 (main)' (sub, total)                      2.03 (300)                 
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2a (secondary)' (sub, total)                3.353 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2b (expansion)' (sub, total)                8.312 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 3 (cleanup)' (sub, total)                   2.755 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                                   175.4 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                                         73.57 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                                         90.27 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=2)                                         11.55 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                                           175.4 (300)                
deterministic (serial coloring)
./output3d_deterministic_n60_np1.txt
 Problem size: 216000 (60x60x60)
Driver: 2 - MueLu Setup                                                                                       14.08 (1)                  
MueLu: AggregationPhase1Algorithm_kokkos: BuildAggregatesDeterministic (total)                                0.1388 (200)               
MueLu: AggregationPhase2aAlgorithm_kokkos: BuildAggregatesDeterministic (total)                               1.151 (200)                
MueLu: AggregationPhase2bAlgorithm_kokkos: BuildAggregatesDeterministic (total)                               1.085 (200)                
MueLu: AggregationPhase3Algorithm_kokkos: BuildAggregatesDeterministic (total)                                0.5199 (200)               
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                                        0.007883 (100)             
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                                             0.5258 (200)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                                   0.3637 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                                   0.164 (100)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                                     4.562 (200)                
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                                        0.01609 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                                       0.00793 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                                    0.02077 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                                 0.02843 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                                  0.008696 (200)             
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                                    0.03216 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 2 (CompressCols) (sub, total)                               0.01155 (100)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 2 (CompressRows) (sub, total)                               0.004483 (100)             
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)                   0.01765 (200)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring' (sub, total)                      0.9656 (200)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring': KokkosGraph Call (sub, total)    0.955 (200)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase - (Dirichlet)' (sub, total)                 0.03583 (200)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 1 (main)' (sub, total)                      0.1467 (200)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2a (secondary)' (sub, total)                1.159 (200)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2b (expansion)' (sub, total)                1.093 (200)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 3 (cleanup)' (sub, total)                   0.5281 (200)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                                   4.024 (200)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                                         1.18 (100)                 
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                                         2.843 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                                           4.025 (200)                
./output3d_deterministic_n100_np1.txt
 Problem size: 1000000 (100x100x100)
Driver: 2 - MueLu Setup                                                                                       37.05 (1)                  
MueLu: AggregationPhase1Algorithm_kokkos: BuildAggregatesDeterministic (total)                                0.2853 (300)               
MueLu: AggregationPhase2aAlgorithm_kokkos: BuildAggregatesDeterministic (total)                               3.985 (300)                
MueLu: AggregationPhase2bAlgorithm_kokkos: BuildAggregatesDeterministic (total)                               4.853 (300)                
MueLu: AggregationPhase3Algorithm_kokkos: BuildAggregatesDeterministic (total)                                1.941 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                                        0.01529 (100)              
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                                             1.026 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                                   0.6583 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                                   0.214 (100)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=3)                                                   0.1568 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                                     16.77 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                                        0.02399 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                                       0.0125 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                                    0.03364 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                                 0.05484 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                                  0.01423 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                                    0.07461 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 2 (CompressCols) (sub, total)                               0.04294 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 2 (CompressRows) (sub, total)                               0.009422 (200)             
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)                   0.02664 (300)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring' (sub, total)                      4.302 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring': KokkosGraph Call (sub, total)    4.285 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase - (Dirichlet)' (sub, total)                 0.05606 (300)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 1 (main)' (sub, total)                      0.2968 (300)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2a (secondary)' (sub, total)                3.996 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2b (expansion)' (sub, total)                4.866 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 3 (cleanup)' (sub, total)                   1.954 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                                   15.73 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                                         3.62 (100)                 
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                                         4.605 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=2)                                         7.499 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                                           15.73 (300)                
./output3d_deterministic_n200_np1.txt
 Problem size: 8000000 (200x200x200)
Driver: 2 - MueLu Setup                                                                                       121.6 (1)                  
MueLu: AggregationPhase1Algorithm_kokkos: BuildAggregatesDeterministic (total)                                0.7531 (300)               
MueLu: AggregationPhase2aAlgorithm_kokkos: BuildAggregatesDeterministic (total)                               4.87 (300)                 
MueLu: AggregationPhase2bAlgorithm_kokkos: BuildAggregatesDeterministic (total)                               6.405 (300)                
MueLu: AggregationPhase3Algorithm_kokkos: BuildAggregatesDeterministic (total)                                2.102 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: AverageCoords (sub, total)                                        0.07388 (100)              
MueLu: Laplace3D: TentativePFactory_kokkos: Build                                                             3.236 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=1)                                                   2.501 (100)                
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=2)                                                   0.5607 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (level=3)                                                   0.1774 (100)               
MueLu: Laplace3D: TentativePFactory_kokkos: Build (total)                                                     53.7 (300)                 
MueLu: Laplace3D: TentativePFactory_kokkos: Calc AggSizes (sub, total)                                        0.05965 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Check good map (sub, total)                                       0.02134 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Create Agg2RowMap (sub, total)                                    0.1065 (300)               
MueLu: Laplace3D: TentativePFactory_kokkos: Get Aggregates graph (sub, total)                                 0.09454 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 0 (InitViews) (sub, total)                                  0.03223 (300)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 1 (LocalQR) (sub, total)                                    0.294 (300)                
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 2 (CompressCols) (sub, total)                               0.09813 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 2 (CompressRows) (sub, total)                               0.01564 (200)              
MueLu: Laplace3D: TentativePFactory_kokkos: Stage 3 (LocalMatrix+FillComplete) (sub, total)                   0.04011 (300)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring' (sub, total)                      35.53 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Graph Coloring': KokkosGraph Call (sub, total)    35.5 (300)                 
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase - (Dirichlet)' (sub, total)                 0.07887 (300)              
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 1 (main)' (sub, total)                      0.7659 (300)               
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2a (secondary)' (sub, total)                4.881 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 2b (expansion)' (sub, total)                6.418 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Algo 'Phase 3 (cleanup)' (sub, total)                   2.114 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build                                                   50.44 (300)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=0)                                         24.51 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=1)                                         17.76 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (level=2)                                         8.175 (100)                
MueLu: Laplace3D: UncoupledAggregationFactory_kokkos: Build (total)                                           50.44 (300)                

Summary

serial coloring (which is the route that deterministic aggregation must take) is faster than all of the algorithms by a wide margin except for the two mis2 algorithms, which beat serial coloring by a lot sadly.
Mis2 coarsening is faster than serial by 30%-50%
Mis2 aggregation is faster than serial by 30%-35%

Mis2 coarsening being 50% faster on extremely large (8M DOFs/GPU) problems is pretty tragic but maybe not too surprising, but for EMPIRE-scale problems, the numbers are closer to 30%. 30% still seems awfully large to merit changing the default aggregation to deterministic, but I want to know what everybody else thinks.

@brian-kelley
Copy link
Contributor

serial coloring (which is the route that deterministic aggregation must take)

Just to be clear, the mis2 methods are also deterministic. The only downside really is you can't set the min/max agg sizes with it, which might make it less appealing as the default.

@jhux2
Copy link
Member

jhux2 commented Oct 16, 2023

@GrahamBenHarper Thanks for doing these experiments. Here's a table summarizing the timer "Driver: 2 - MueLu Setup":

mis2 coarsening                          9.64       19.44        64.75
mis2 aggregation                        10.33       24.42        86.15
vertex based                            24.26       49.86       346.2
vertex based bit set                    28.98       40.25       352.7
edge filtering                          39.68       44.3        392.5
net based bit set                       14.41       44.88       248.3
deterministic (serial coloring)         14.08       37.05       121.6

@GrahamBenHarper
Copy link
Contributor Author

@brian-kelley that's right. I've realized is that "aggregation: deterministic" is a slight misnomer because it forces a serial coloring as well as deterministic aggregates. Really, it seems like the deterministic nature of coloring and aggregation are slightly different. That is, one can have deterministic coloring but still have nondeterministic aggregation. Maybe we should stop forcing the serial coloring when "aggregation: deterministic" is true?
Mis2 is great because it's deterministic and fast; however, I think the only reason mis2 is not default is because it has slightly less control and would make MueLu act less like ML.

@jhux2 also, it might be important to note that setup data uses --rebuild=99 in the driver.

@jhux2
Copy link
Member

jhux2 commented Oct 17, 2023

also, it might be important to note that setup data uses --rebuild=99 in the driver.

Sure, but I assume the main differentiator is the aggregation.

@trilinos-autotester
Copy link
Contributor

Status Flag 'Pre-Test Inspection' - Auto Inspected - Inspection is Not Necessary for this Pull Request.

@trilinos-autotester
Copy link
Contributor

Status Flag 'Pull Request AutoTester' - Testing Jenkins Projects:

Pull Request Auto Testing STARTING (click to expand)

Build Information

Test Name: PR_gcc-openmpi-openmp

  • Build Num: 782
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel8_sems-gnu-8.5.0-openmpi-4.1.6-openmp_release-debug_static_no-kokkos-arch_no-asan_no-complex_no-fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL rhel8
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: PR_gcc

  • Build Num: 832
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel8_sems-gnu-8.5.0-serial_release-debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_no-mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL rhel8
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: PR_gcc-openmpi_debug

  • Build Num: 833
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel8_sems-gnu-8.5.0-openmpi-4.1.6-serial_debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL rhel8
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: PR_clang

  • Build Num: 831
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel8_sems-clang-11.0.1-openmpi-4.0.5-serial_release-debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL rhel8
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: Trilinos_PR_python3

  • Build Num: 4790
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
BLOCKING_BUILD false
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel8_aue-gnu-12.1.0-anaconda3-serial_debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_no-mpi_no-pt_no-rdc_no-uvm_deprecated-on_pr-framework
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL rhel8
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: PR_cuda

  • Build Num: 830
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel8_sems-cuda-11.4.2-gnu-10.1.0-openmpi-4.1.6_release_static_Volta70_no-asan_complex_no-fpic_mpi_pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL rhel8-gpu
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: PR_intel

  • Build Num: 751
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel8_sems-intel-2021.3-sems-openmpi-4.1.6_release-debug_shared_no-kokkos-arch_no-asan_no-complex_fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL rhel8
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: PR_cuda-uvm

  • Build Num: 830
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel8_sems-cuda-11.4.2-gnu-10.1.0-openmpi-4.1.6_release_static_Volta70_no-asan_complex_no-fpic_mpi_pt_no-rdc_uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL rhel8
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Using Repos:

Repo: TRILINOS (GrahamBenHarper/Trilinos)
  • Branch: muelu-deterministic-agg
  • SHA: 9fc7f46
  • Mode: TEST_REPO

Pull Request Author: GrahamBenHarper

@trilinos-autotester
Copy link
Contributor

Status Flag 'Pull Request AutoTester' - Jenkins Testing: 1 or more Jobs FAILED

Note: Testing will normally be attempted again in approx. 2 Hrs 30 Mins. If a change to the PR source branch occurs, the testing will be attempted again on next available autotester run.

Pull Request Auto Testing has FAILED (click to expand)

Build Information

Test Name: PR_gcc-openmpi-openmp

  • Build Num: 782
  • Status: FAILED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel8_sems-gnu-8.5.0-openmpi-4.1.6-openmp_release-debug_static_no-kokkos-arch_no-asan_no-complex_no-fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL rhel8
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: PR_gcc

  • Build Num: 832
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel8_sems-gnu-8.5.0-serial_release-debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_no-mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL rhel8
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: PR_gcc-openmpi_debug

  • Build Num: 833
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel8_sems-gnu-8.5.0-openmpi-4.1.6-serial_debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL rhel8
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: PR_clang

  • Build Num: 831
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel8_sems-clang-11.0.1-openmpi-4.0.5-serial_release-debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL rhel8
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: Trilinos_PR_python3

  • Build Num: 4790
  • Status: FAILED

Jenkins Parameters

Parameter Name Value
BLOCKING_BUILD false
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel8_aue-gnu-12.1.0-anaconda3-serial_debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_no-mpi_no-pt_no-rdc_no-uvm_deprecated-on_pr-framework
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL rhel8
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: PR_cuda

  • Build Num: 830
  • Status: FAILED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel8_sems-cuda-11.4.2-gnu-10.1.0-openmpi-4.1.6_release_static_Volta70_no-asan_complex_no-fpic_mpi_pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL rhel8-gpu
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: PR_intel

  • Build Num: 751
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel8_sems-intel-2021.3-sems-openmpi-4.1.6_release-debug_shared_no-kokkos-arch_no-asan_no-complex_fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL rhel8
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79

Build Information

Test Name: PR_cuda-uvm

  • Build Num: 830
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
FORCE_CLEAN true
GENCONFIG_BUILD_NAME rhel8_sems-cuda-11.4.2-gnu-10.1.0-openmpi-4.1.6_release_static_Volta70_no-asan_complex_no-fpic_mpi_pt_no-rdc_uvm_deprecated-on_no-package-enables
PR_LABELS pkg: MueLu
PULLREQUESTNUM 12351
PULLREQUEST_CDASH_TRACK Pull Request
TEST_REPO_ALIAS TRILINOS
TRILINOS_NODE_LABEL rhel8
TRILINOS_SOURCE_REPO https://github.com/GrahamBenHarper/Trilinos
TRILINOS_SOURCE_SHA 9fc7f46
TRILINOS_SRN_CONFIG true
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA 9c30d79


CDash Test Results for PR# 12351.


Wiki: How to Reproduce PR Testing Builds and Errors.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants