Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/dispatch_7/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: ['7']
Filtered sections: All

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/13][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/dispatch_7/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:01.524795 127253361549120 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.193508 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:01.525408 127253361549120 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:01.720217 127253361549120 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:01.814190 127253361549120 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.288783 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:01.836889 127253361549120 generateRocpd.cpp:583] writing SQL database for process 2522853 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:56:01.837691 127253361549120 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_7/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522853_results.db (UUID=0001fa71-6f10-7f10-8d97-2895597543e6)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:01.918047 127253361549120 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007982 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:01.919144 127253361549120 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001082 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:01.920702 127253361549120 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001543 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:01.930776 127253361549120 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008177 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:02.255300 127253361549120 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.324510 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:02.257353 127253361549120 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002034 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:02.257370 127253361549120 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:02.266219 127253361549120 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008842 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:02.266233 127253361549120 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:02.266240 127253361549120 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:02.266246 127253361549120 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:02.266362 127253361549120 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000109 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:02.266615 127253361549120 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.429727 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:02.269609 127253361549120 simple_timer.cpp:55] [rocprofv3] output generation ::     0.453536 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:02.269712 127253361549120 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.455463 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
No GPU kernel data collected. The workload may not have dispatched any GPU kernels.
[Run 2/13][Approximate profiling time left: 25 seconds]...
[profiling] Current input file: tests/workloads/dispatch_7/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:03.813577 132661591498560 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190710 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:03.814296 132661591498560 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:04.009315 132661591498560 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:04.099187 132661591498560 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.284891 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:04.121791 132661591498560 generateRocpd.cpp:583] writing SQL database for process 2522862 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:56:04.122607 132661591498560 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_7/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522862_results.db (UUID=0001fa71-7803-7803-83db-84426edfb71c)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:04.205630 132661591498560 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008011 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:04.206936 132661591498560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001289 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:04.209092 132661591498560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002139 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:04.219710 132661591498560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008503 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:04.535753 132661591498560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.316028 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:04.538105 132661591498560 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002326 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:04.538123 132661591498560 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000005 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:04.546830 132661591498560 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008699 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:04.546844 132661591498560 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:04.546850 132661591498560 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:04.546857 132661591498560 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:04.546969 132661591498560 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000104 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:04.547186 132661591498560 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.425396 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:04.550244 132661591498560 simple_timer.cpp:55] [rocprofv3] output generation ::     0.449252 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:04.550345 132661591498560 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.451101 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
No GPU kernel data collected. The workload may not have dispatched any GPU kernels.
[Run 3/13][Approximate profiling time left: 22 seconds]...
[profiling] Current input file: tests/workloads/dispatch_7/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:06.101495 140009223843648 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191412 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:06.102075 140009223843648 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:06.296722 140009223843648 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:06.386666 140009223843648 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.284591 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:06.408906 140009223843648 generateRocpd.cpp:583] writing SQL database for process 2522872 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:56:06.409724 140009223843648 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_7/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522872_results.db (UUID=0001fa71-80f2-70f2-b47e-b59e81e17d88)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:06.486569 140009223843648 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007752 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:06.487628 140009223843648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001044 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:06.489154 140009223843648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001511 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:06.499194 140009223843648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008205 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:06.794471 140009223843648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.295261 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:06.796429 140009223843648 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.001932 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:06.796446 140009223843648 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:06.805767 140009223843648 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009313 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:06.805782 140009223843648 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:06.805788 140009223843648 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:06.805795 140009223843648 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:06.805916 140009223843648 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000114 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:06.806148 140009223843648 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.397242 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:06.809065 140009223843648 simple_timer.cpp:55] [rocprofv3] output generation ::     0.420861 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:06.809165 140009223843648 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.422457 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
No GPU kernel data collected. The workload may not have dispatched any GPU kernels.
[Run 4/13][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: tests/workloads/dispatch_7/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:08.327156 131753378148160 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189118 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:08.327743 131753378148160 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:08.520961 131753378148160 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:08.607360 131753378148160 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.279617 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:08.629734 131753378148160 generateRocpd.cpp:583] writing SQL database for process 2522880 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:56:08.630510 131753378148160 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_7/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522880_results.db (UUID=0001fa71-89a7-79a7-a090-05216f3e922a)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:08.710457 131753378148160 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007918 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:08.711559 131753378148160 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001083 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:08.713545 131753378148160 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001972 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:08.723802 131753378148160 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008204 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:09.010316 131753378148160 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.286499 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:09.012505 131753378148160 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002150 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:09.012522 131753378148160 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:09.021938 131753378148160 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009408 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:09.021952 131753378148160 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:09.021958 131753378148160 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:09.021965 131753378148160 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:09.022093 131753378148160 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000122 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:09.022338 131753378148160 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.392605 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:09.025289 131753378148160 simple_timer.cpp:55] [rocprofv3] output generation ::     0.416224 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:09.025382 131753378148160 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.417981 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
No GPU kernel data collected. The workload may not have dispatched any GPU kernels.
[Run 5/13][Approximate profiling time left: 18 seconds]...
[profiling] Current input file: tests/workloads/dispatch_7/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:10.554244 140581016059712 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189099 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:10.554869 140581016059712 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:10.748218 140581016059712 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:10.827212 140581016059712 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.272343 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:10.849929 140581016059712 generateRocpd.cpp:583] writing SQL database for process 2522889 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:56:10.850742 140581016059712 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_7/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522889_results.db (UUID=0001fa71-925a-725a-9853-5b10b551b8dc)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:10.931112 140581016059712 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007895 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:10.932216 140581016059712 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001089 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:10.933802 140581016059712 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001571 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:10.944083 140581016059712 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008284 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:11.223150 140581016059712 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.279052 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:11.225179 140581016059712 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002013 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:11.225196 140581016059712 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:11.234123 140581016059712 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008919 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:11.234138 140581016059712 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:11.234144 140581016059712 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:11.234151 140581016059712 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:11.234255 140581016059712 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:11.234462 140581016059712 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.384534 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:11.237390 140581016059712 simple_timer.cpp:55] [rocprofv3] output generation ::     0.408329 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:11.237475 140581016059712 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.410209 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
No GPU kernel data collected. The workload may not have dispatched any GPU kernels.
[Run 6/13][Approximate profiling time left: 15 seconds]...
[profiling] Current input file: tests/workloads/dispatch_7/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:12.728941 134959073943360 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.183149 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:12.729524 134959073943360 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:12.924060 134959073943360 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:13.003671 134959073943360 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.274147 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:13.026432 134959073943360 generateRocpd.cpp:583] writing SQL database for process 2522897 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:56:13.027241 134959073943360 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_7/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522897_results.db (UUID=0001fa71-9ade-7ade-b475-8438cc9267fe)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:13.107499 134959073943360 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007615 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:13.108640 134959073943360 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001125 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:13.110216 134959073943360 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001562 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:13.120380 134959073943360 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008236 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:13.128717 134959073943360 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008322 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:13.130638 134959073943360 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.001907 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:13.130655 134959073943360 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:13.139977 134959073943360 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009314 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:13.139994 134959073943360 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:13.140003 134959073943360 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:13.140012 134959073943360 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:13.140119 134959073943360 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000101 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:13.140325 134959073943360 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.113894 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:13.143483 134959073943360 simple_timer.cpp:55] [rocprofv3] output generation ::     0.138165 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:13.143531 134959073943360 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.139795 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
No GPU kernel data collected. The workload may not have dispatched any GPU kernels.
[Run 7/13][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/dispatch_7/MI200/perfmon/pmc_perf_SQC_DCACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:14.676665 138403769753408 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190810 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:14.677270 138403769753408 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:14.868556 138403769753408 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:14.946860 138403769753408 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.269590 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:14.968954 138403769753408 generateRocpd.cpp:583] writing SQL database for process 2522907 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:56:14.969750 138403769753408 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_7/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522907_results.db (UUID=0001fa71-a272-7272-b22e-01cf4f1bf085)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:15.050727 138403769753408 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008067 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:15.051830 138403769753408 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001087 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:15.053774 138403769753408 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001929 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:15.064049 138403769753408 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008317 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:15.473309 138403769753408 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.409246 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:15.475405 138403769753408 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002076 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:15.475422 138403769753408 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:15.484686 138403769753408 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009257 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:15.484701 138403769753408 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:15.484707 138403769753408 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:15.484714 138403769753408 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:15.484818 138403769753408 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:15.485025 138403769753408 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.516071 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:15.487952 138403769753408 simple_timer.cpp:55] [rocprofv3] output generation ::     0.539651 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:15.488072 138403769753408 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.541159 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
No GPU kernel data collected. The workload may not have dispatched any GPU kernels.
[Run 8/13][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/dispatch_7/MI200/perfmon/pmc_perf_SQC_ICACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:17.022087 136379450335040 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192614 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:17.022700 136379450335040 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:17.215707 136379450335040 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:17.303281 136379450335040 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280581 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:17.325710 136379450335040 generateRocpd.cpp:583] writing SQL database for process 2522915 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:56:17.326519 136379450335040 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_7/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522915_results.db (UUID=0001fa71-ab9a-7b9a-b692-6a53337e3ee6)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:17.408804 136379450335040 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008378 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:17.409927 136379450335040 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:17.411910 136379450335040 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001968 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:17.422222 136379450335040 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008329 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:17.829393 136379450335040 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.407155 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:17.831467 136379450335040 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002055 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:17.831486 136379450335040 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:17.840307 136379450335040 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008814 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:17.840322 136379450335040 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:17.840328 136379450335040 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:17.840335 136379450335040 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:17.840472 136379450335040 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000106 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:17.840719 136379450335040 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.515009 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:17.843799 136379450335040 simple_timer.cpp:55] [rocprofv3] output generation ::     0.538756 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:17.843914 136379450335040 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.540582 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
No GPU kernel data collected. The workload may not have dispatched any GPU kernels.
[Run 9/13][Approximate profiling time left: 8 seconds]...
[profiling] Current input file: tests/workloads/dispatch_7/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:19.410587 131447944978240 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.196521 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:19.411218 131447944978240 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:19.604682 131447944978240 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:19.690922 131447944978240 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.279704 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:19.713179 131447944978240 generateRocpd.cpp:583] writing SQL database for process 2522924 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:56:19.713998 131447944978240 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_7/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522924_results.db (UUID=0001fa71-b4eb-74eb-8d96-47b9d7147ef0)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:19.796600 131447944978240 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008107 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:19.797707 131447944978240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001090 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:19.799702 131447944978240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001979 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:19.810089 131447944978240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008403 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:20.392746 131447944978240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.582642 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:20.394933 131447944978240 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002159 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:20.394951 131447944978240 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:20.403419 131447944978240 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008461 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:20.403433 131447944978240 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:20.403439 131447944978240 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:20.403446 131447944978240 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:20.403562 131447944978240 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000109 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:20.403804 131447944978240 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.690625 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:20.406740 131447944978240 simple_timer.cpp:55] [rocprofv3] output generation ::     0.714372 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:20.406866 131447944978240 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.715887 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
No GPU kernel data collected. The workload may not have dispatched any GPU kernels.
[Run 10/13][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: tests/workloads/dispatch_7/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_LDS_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:21.955792 129818264428352 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189519 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:21.956364 129818264428352 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:22.149243 129818264428352 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:22.234689 129818264428352 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.278325 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:22.256859 129818264428352 generateRocpd.cpp:583] writing SQL database for process 2522932 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:56:22.257662 129818264428352 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_7/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522932_results.db (UUID=0001fa71-bee3-7ee3-ad84-e960cbf895a5)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:22.338163 129818264428352 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008007 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:22.339288 129818264428352 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001108 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:22.341252 129818264428352 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001949 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:22.351491 129818264428352 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008277 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:22.694637 129818264428352 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.343132 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:22.696783 129818264428352 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002127 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:22.696800 129818264428352 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:22.705199 129818264428352 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008393 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:22.705213 129818264428352 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:22.705219 129818264428352 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:22.705226 129818264428352 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:22.705329 129818264428352 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000096 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:22.705530 129818264428352 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.448671 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:22.708480 129818264428352 simple_timer.cpp:55] [rocprofv3] output generation ::     0.472150 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:22.708579 129818264428352 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.473845 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
No GPU kernel data collected. The workload may not have dispatched any GPU kernels.
[Run 11/13][Approximate profiling time left: 4 seconds]...
[profiling] Current input file: tests/workloads/dispatch_7/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:24.242488 137152388546368 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190782 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:24.243066 137152388546368 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:24.434145 137152388546368 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:24.515585 137152388546368 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.272519 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:24.537832 137152388546368 generateRocpd.cpp:583] writing SQL database for process 2522940 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:56:24.538631 137152388546368 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_7/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522940_results.db (UUID=0001fa71-c7d0-77d0-b524-1a21b7c1b73a)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:24.618927 137152388546368 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007889 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:24.620044 137152388546368 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001098 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:24.621967 137152388546368 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001909 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:24.632174 137152388546368 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008251 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:24.964791 137152388546368 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.332602 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:24.966905 137152388546368 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002092 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:24.966923 137152388546368 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:24.975270 137152388546368 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008340 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:24.975286 137152388546368 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:24.975292 137152388546368 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:24.975299 137152388546368 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:24.975415 137152388546368 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000107 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:24.975636 137152388546368 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.437804 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:24.978575 137152388546368 simple_timer.cpp:55] [rocprofv3] output generation ::     0.461363 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:24.978680 137152388546368 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.463049 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
No GPU kernel data collected. The workload may not have dispatched any GPU kernels.
[Run 12/13][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: tests/workloads/dispatch_7/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:26.530233 133056825466688 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.202517 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:26.530848 133056825466688 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:26.724816 133056825466688 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:26.802087 133056825466688 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.271239 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:26.823902 133056825466688 generateRocpd.cpp:583] writing SQL database for process 2522949 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:56:26.824648 133056825466688 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_7/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522949_results.db (UUID=0001fa71-d0b4-70b4-b19b-3e8e0dab82a7)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:26.905229 133056825466688 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008060 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:26.906319 133056825466688 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001073 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:26.908248 133056825466688 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001915 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:26.918609 133056825466688 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008402 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:27.436822 133056825466688 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.518198 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:27.438866 133056825466688 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002012 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:27.438884 133056825466688 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:27.447079 133056825466688 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008188 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:27.447093 133056825466688 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:27.447099 133056825466688 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:27.447106 133056825466688 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:27.447208 133056825466688 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000095 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:27.447421 133056825466688 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.623519 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:27.450292 133056825466688 simple_timer.cpp:55] [rocprofv3] output generation ::     0.646837 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:27.450381 133056825466688 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.648255 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
No GPU kernel data collected. The workload may not have dispatched any GPU kernels.
[Run 13/13][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/dispatch_7/MI200/perfmon/pmc_perf_SQ_LEVEL_WAVES_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:28.978469 137139724558144 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.187615 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:28.979062 137139724558144 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:29.172328 137139724558144 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:29.253363 137139724558144 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.274301 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:29.275650 137139724558144 generateRocpd.cpp:583] writing SQL database for process 2522957 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:56:29.276437 137139724558144 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_7/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522957_results.db (UUID=0001fa71-da53-7a53-8e66-db9931e0ba24)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:29.357126 137139724558144 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007938 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:29.358224 137139724558144 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001082 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:29.359807 137139724558144 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001568 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:29.370084 137139724558144 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008304 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:29.691600 137139724558144 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.321500 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:29.693744 137139724558144 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002114 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:29.693761 137139724558144 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:29.703240 137139724558144 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009472 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:29.703255 137139724558144 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:29.703261 137139724558144 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:29.703268 137139724558144 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:29.703386 137139724558144 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000106 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:29.703598 137139724558144 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.427948 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:29.706551 137139724558144 simple_timer.cpp:55] [rocprofv3] output generation ::     0.451440 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:29.706656 137139724558144 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.453249 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
No GPU kernel data collected. The workload may not have dispatched any GPU kernels.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Checking for roofline.csv in tests/workloads/dispatch_7/MI200
[roofline] Benchmark execution failed: 'L1'. Skipping roofline.
