Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/no_roof/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: All

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[ 11%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/13][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/no_roof/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:27.600455 126817350860608 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191332 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:27.601082 126817350860608 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:27.795164 126817350860608 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:27.885508 126817350860608 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.284426 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:27.907951 126817350860608 generateRocpd.cpp:583] writing SQL database for process 2525969 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:07:27.908715 126817350860608 generateRocpd.cpp:606] Opened result file: tests/workloads/no_roof/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525969_results.db (UUID=0001fa7b-e70d-770d-9132-140445ea5820)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:27.991970 126817350860608 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008074 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:27.993156 126817350860608 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001166 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:27.994797 126817350860608 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001626 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:28.005126 126817350860608 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008251 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:28.330939 126817350860608 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.325798 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:28.333204 126817350860608 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002245 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:28.333222 126817350860608 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:28.342406 126817350860608 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009176 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:28.342420 126817350860608 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:28.342426 126817350860608 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:28.342433 126817350860608 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:28.342539 126817350860608 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000098 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:28.342748 126817350860608 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.434798 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:28.345685 126817350860608 simple_timer.cpp:55] [rocprofv3] output generation ::     0.458524 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:28.345776 126817350860608 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.460229 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/no_roof/MI200/out/pmc_1/2525969_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/13][Approximate profiling time left: 25 seconds]...
[profiling] Current input file: tests/workloads/no_roof/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:29.894717 126589032742720 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190766 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:29.895373 126589032742720 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:30.091881 126589032742720 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:30.173258 126589032742720 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.277886 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:30.195926 126589032742720 generateRocpd.cpp:583] writing SQL database for process 2525978 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:07:30.196737 126589032742720 generateRocpd.cpp:606] Opened result file: tests/workloads/no_roof/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525978_results.db (UUID=0001fa7b-f004-7004-94f1-a2eecf0160fb)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:30.278235 126589032742720 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007884 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:30.279370 126589032742720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001118 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:30.280940 126589032742720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001555 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:30.291016 126589032742720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008125 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:30.604083 126589032742720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.313042 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:30.607168 126589032742720 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.003068 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:30.607186 126589032742720 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:30.616821 126589032742720 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009628 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:30.616835 126589032742720 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:30.616842 126589032742720 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:30.616849 126589032742720 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:30.616989 126589032742720 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000102 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:30.617211 126589032742720 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.421285 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:30.620266 126589032742720 simple_timer.cpp:55] [rocprofv3] output generation ::     0.445257 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:30.620346 126589032742720 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.447039 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/no_roof/MI200/out/pmc_1/2525978_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/13][Approximate profiling time left: 23 seconds]...
[profiling] Current input file: tests/workloads/no_roof/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:32.171068 134636768804672 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189082 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:32.171673 134636768804672 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:32.363418 134636768804672 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:32.447358 134636768804672 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.275685 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:32.469624 134636768804672 generateRocpd.cpp:583] writing SQL database for process 2525986 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:07:32.470425 134636768804672 generateRocpd.cpp:606] Opened result file: tests/workloads/no_roof/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525986_results.db (UUID=0001fa7b-f8ea-78ea-a906-e27979345f5d)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:32.551408 134636768804672 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007891 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:32.552556 134636768804672 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001132 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:32.554151 134636768804672 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001580 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:32.564336 134636768804672 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008252 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:32.863981 134636768804672 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.299630 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:32.866183 134636768804672 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002181 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:32.866200 134636768804672 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:32.876470 134636768804672 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.010263 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:32.876485 134636768804672 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:32.876491 134636768804672 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:32.876498 134636768804672 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:32.876659 134636768804672 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000126 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:32.876912 134636768804672 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.407288 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:32.879933 134636768804672 simple_timer.cpp:55] [rocprofv3] output generation ::     0.430882 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:32.880041 134636768804672 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.432633 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/no_roof/MI200/out/pmc_1/2525986_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/13][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: tests/workloads/no_roof/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:34.435233 130960086605632 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190680 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:34.435823 130960086605632 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:34.628822 130960086605632 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:34.712567 130960086605632 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.276744 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:34.735072 130960086605632 generateRocpd.cpp:583] writing SQL database for process 2525994 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:07:34.735876 130960086605632 generateRocpd.cpp:606] Opened result file: tests/workloads/no_roof/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525994_results.db (UUID=0001fa7c-01c1-71c1-9d0a-5266cf4d1e4f)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:34.817973 130960086605632 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007896 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:34.819115 130960086605632 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001122 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:34.821008 130960086605632 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001877 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:34.831184 130960086605632 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008210 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:35.118204 130960086605632 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.287006 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:35.120404 130960086605632 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002181 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:35.120421 130960086605632 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:35.129699 130960086605632 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009271 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:35.129714 130960086605632 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:35.129720 130960086605632 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:35.129727 130960086605632 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:35.129850 130960086605632 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000115 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:35.130108 130960086605632 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.395036 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:35.133133 130960086605632 simple_timer.cpp:55] [rocprofv3] output generation ::     0.418972 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:35.133221 130960086605632 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.420607 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/no_roof/MI200/out/pmc_1/2525994_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/13][Approximate profiling time left: 18 seconds]...
[profiling] Current input file: tests/workloads/no_roof/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:36.685716 134819971866432 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.187932 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:36.686319 134819971866432 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:36.880235 134819971866432 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:36.965469 134819971866432 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.279150 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:36.988082 134819971866432 generateRocpd.cpp:583] writing SQL database for process 2526002 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:07:36.988919 134819971866432 generateRocpd.cpp:606] Opened result file: tests/workloads/no_roof/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526002_results.db (UUID=0001fa7c-0a8e-7a8e-9169-9d2ce481af0c)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:37.072716 134819971866432 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007962 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:37.073913 134819971866432 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001179 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:37.075535 134819971866432 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001606 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:37.085789 134819971866432 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008256 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:37.367255 134819971866432 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.281449 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:37.369499 134819971866432 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002223 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:37.369518 134819971866432 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:37.379503 134819971866432 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009974 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:37.379520 134819971866432 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:37.379533 134819971866432 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:37.379543 134819971866432 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:37.379675 134819971866432 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:37.379890 134819971866432 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.391808 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:37.382953 134819971866432 simple_timer.cpp:55] [rocprofv3] output generation ::     0.415853 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:37.383046 134819971866432 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.417529 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/no_roof/MI200/out/pmc_1/2526002_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/13][Approximate profiling time left: 15 seconds]...
[profiling] Current input file: tests/workloads/no_roof/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:38.903959 126449225187136 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.181653 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:38.904554 126449225187136 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:39.097655 126449225187136 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:39.178418 126449225187136 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.273864 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:39.200635 126449225187136 generateRocpd.cpp:583] writing SQL database for process 2526010 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:07:39.201427 126449225187136 generateRocpd.cpp:606] Opened result file: tests/workloads/no_roof/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526010_results.db (UUID=0001fa7c-133f-733f-b6c6-0bd31b09d0f1)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:39.283023 126449225187136 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007632 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:39.284186 126449225187136 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001138 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:39.285840 126449225187136 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001640 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:39.296405 126449225187136 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008232 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:39.304836 126449225187136 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008416 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:39.306832 126449225187136 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.001982 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:39.306850 126449225187136 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:39.316070 126449225187136 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009213 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:39.316085 126449225187136 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:39.316091 126449225187136 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:39.316097 126449225187136 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:39.316203 126449225187136 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000096 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:39.316400 126449225187136 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.115765 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:39.319563 126449225187136 simple_timer.cpp:55] [rocprofv3] output generation ::     0.139433 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:39.319606 126449225187136 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.141145 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/no_roof/MI200/out/pmc_1/2526010_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/13][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/no_roof/MI200/perfmon/pmc_perf_SQC_DCACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:40.844537 135823461072704 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191136 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:40.845152 135823461072704 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:41.038357 135823461072704 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:41.125396 135823461072704 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280245 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:41.147928 135823461072704 generateRocpd.cpp:583] writing SQL database for process 2526018 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:07:41.148708 135823461072704 generateRocpd.cpp:606] Opened result file: tests/workloads/no_roof/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526018_results.db (UUID=0001fa7c-1aca-7aca-99e5-7aa78de1e31f)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:41.230671 135823461072704 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007982 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:41.231852 135823461072704 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001164 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:41.233750 135823461072704 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001884 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:41.243992 135823461072704 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008335 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:41.653051 135823461072704 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.409045 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:41.655315 135823461072704 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002249 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:41.655333 135823461072704 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:41.664962 135823461072704 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009622 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:41.664976 135823461072704 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:41.664982 135823461072704 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:41.664988 135823461072704 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:41.665102 135823461072704 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000106 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:41.665316 135823461072704 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.517388 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:41.668320 135823461072704 simple_timer.cpp:55] [rocprofv3] output generation ::     0.541195 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:41.668422 135823461072704 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.542987 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/no_roof/MI200/out/pmc_1/2526018_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/13][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/no_roof/MI200/perfmon/pmc_perf_SQC_ICACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:43.208760 127127849250624 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190556 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:43.209397 127127849250624 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:43.402738 127127849250624 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:43.487017 127127849250624 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.277620 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:43.509324 127127849250624 generateRocpd.cpp:583] writing SQL database for process 2526026 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:07:43.510132 127127849250624 generateRocpd.cpp:606] Opened result file: tests/workloads/no_roof/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526026_results.db (UUID=0001fa7c-2407-7407-b986-098f9762e66f)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:43.593745 127127849250624 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008067 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:43.594948 127127849250624 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001186 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:43.596920 127127849250624 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001958 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:43.607071 127127849250624 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008167 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:44.008178 127127849250624 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.401093 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:44.010518 127127849250624 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002322 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:44.010536 127127849250624 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:44.019921 127127849250624 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009377 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:44.019936 127127849250624 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:44.019942 127127849250624 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:44.019949 127127849250624 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:44.020098 127127849250624 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000141 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:44.020373 127127849250624 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.511049 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:44.023410 127127849250624 simple_timer.cpp:55] [rocprofv3] output generation ::     0.534987 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:44.023535 127127849250624 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.536462 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/no_roof/MI200/out/pmc_1/2526026_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/13][Approximate profiling time left: 9 seconds]...
[profiling] Current input file: tests/workloads/no_roof/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:45.606355 128829763182400 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.199795 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:45.606980 128829763182400 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:45.804500 128829763182400 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:45.888280 128829763182400 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.281300 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:45.911181 128829763182400 generateRocpd.cpp:583] writing SQL database for process 2526034 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:07:45.911971 128829763182400 generateRocpd.cpp:606] Opened result file: tests/workloads/no_roof/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526034_results.db (UUID=0001fa7c-2d5b-7d5b-9dd0-3363ae83f81e)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:45.995241 128829763182400 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008327 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:45.996386 128829763182400 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001128 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:45.998339 128829763182400 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001939 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:46.008552 128829763182400 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008222 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:46.594068 128829763182400 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.585501 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:46.596298 128829763182400 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002210 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:46.596315 128829763182400 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:46.605894 128829763182400 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009571 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:46.605908 128829763182400 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:46.605914 128829763182400 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:46.605922 128829763182400 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:46.606086 128829763182400 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000131 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:46.606367 128829763182400 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.695186 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:46.609406 128829763182400 simple_timer.cpp:55] [rocprofv3] output generation ::     0.719251 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:46.609547 128829763182400 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.721220 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/no_roof/MI200/out/pmc_1/2526034_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/13][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: tests/workloads/no_roof/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_LDS_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:48.165728 131165122412352 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190102 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:48.166368 131165122412352 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:48.362012 131165122412352 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:48.459125 131165122412352 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.292757 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:48.481535 131165122412352 generateRocpd.cpp:583] writing SQL database for process 2526042 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:07:48.482331 131165122412352 generateRocpd.cpp:606] Opened result file: tests/workloads/no_roof/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526042_results.db (UUID=0001fa7c-3764-7764-b352-c8d4ecc256cc)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:48.564815 131165122412352 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008121 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:48.566043 131165122412352 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001213 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:48.568169 131165122412352 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002111 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:48.578502 131165122412352 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008251 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:48.924727 131165122412352 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.346211 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:48.927047 131165122412352 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002304 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:48.927065 131165122412352 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:48.936179 131165122412352 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009106 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:48.936194 131165122412352 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:48.936200 131165122412352 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:48.936206 131165122412352 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:48.936319 131165122412352 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000104 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:48.936516 131165122412352 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.454981 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:48.939495 131165122412352 simple_timer.cpp:55] [rocprofv3] output generation ::     0.478786 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:48.939592 131165122412352 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.480419 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/no_roof/MI200/out/pmc_1/2526042_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 11/13][Approximate profiling time left: 4 seconds]...
[profiling] Current input file: tests/workloads/no_roof/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:50.470113 133290531786560 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.188942 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:50.470741 133290531786560 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:50.664350 133290531786560 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:50.751133 133290531786560 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280392 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:50.773431 133290531786560 generateRocpd.cpp:583] writing SQL database for process 2526053 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:07:50.774235 133290531786560 generateRocpd.cpp:606] Opened result file: tests/workloads/no_roof/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526053_results.db (UUID=0001fa7c-4066-7066-8acb-9e0d411cb1aa)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:50.856877 133290531786560 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:50.858081 133290531786560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001188 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:50.860190 133290531786560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002094 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:50.870594 133290531786560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008293 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:51.202608 133290531786560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.331999 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:51.204879 133290531786560 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002252 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:51.204896 133290531786560 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:51.214089 133290531786560 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009186 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:51.214104 133290531786560 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:51.214110 133290531786560 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:51.214117 133290531786560 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:51.214239 133290531786560 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000115 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:51.214488 133290531786560 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.441057 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:51.217532 133290531786560 simple_timer.cpp:55] [rocprofv3] output generation ::     0.465017 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:51.217637 133290531786560 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.466454 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/no_roof/MI200/out/pmc_1/2526053_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 12/13][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: tests/workloads/no_roof/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:52.784555 126687033777984 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.199640 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:52.785162 126687033777984 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:52.979138 126687033777984 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:53.066578 126687033777984 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.281417 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:53.089065 126687033777984 generateRocpd.cpp:583] writing SQL database for process 2526061 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:07:53.089858 126687033777984 generateRocpd.cpp:606] Opened result file: tests/workloads/no_roof/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526061_results.db (UUID=0001fa7c-4965-7965-ab07-3bec31c50698)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:53.171829 126687033777984 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008188 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:53.173024 126687033777984 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001180 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:53.174936 126687033777984 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001886 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:53.185295 126687033777984 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008337 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:53.703156 126687033777984 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.517846 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:53.705360 126687033777984 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002177 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:53.705377 126687033777984 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:53.714148 126687033777984 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008763 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:53.714163 126687033777984 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:53.714169 126687033777984 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:53.714176 126687033777984 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:53.714282 126687033777984 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:53.714514 126687033777984 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.625449 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:53.717593 126687033777984 simple_timer.cpp:55] [rocprofv3] output generation ::     0.649278 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:53.717715 126687033777984 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.651090 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/no_roof/MI200/out/pmc_1/2526061_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 13/13][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/no_roof/MI200/perfmon/pmc_perf_SQ_LEVEL_WAVES_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:55.256860 128534496927552 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189022 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:55.257454 128534496927552 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:55.453397 128534496927552 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:07:55.547573 128534496927552 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.290120 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:55.570094 128534496927552 generateRocpd.cpp:583] writing SQL database for process 2526069 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:07:55.570886 128534496927552 generateRocpd.cpp:606] Opened result file: tests/workloads/no_roof/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526069_results.db (UUID=0001fa7c-5318-7318-bcae-3a164534111e)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:55.653186 128534496927552 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008051 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:55.654373 128534496927552 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001171 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:55.656077 128534496927552 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001689 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:55.666703 128534496927552 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008436 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:55.986645 128534496927552 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.319927 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:55.988965 128534496927552 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002288 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:55.988982 128534496927552 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:55.998758 128534496927552 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009769 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:55.998773 128534496927552 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:55.998779 128534496927552 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:55.998786 128534496927552 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:55.998893 128534496927552 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:55.999106 128534496927552 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.429012 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:56.002169 128534496927552 simple_timer.cpp:55] [rocprofv3] output generation ::     0.452952 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:07:56.002273 128534496927552 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.454650 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/no_roof/MI200/out/pmc_1/2526069_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Skipping roofline
