Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/dispatch_inv/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: All

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[ 33%] Built target fmt
[ 33%] Built target gsl_assert
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/13][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/dispatch_inv/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:11.234457 139183706750784 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191973 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:11.235100 139183706750784 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:11.427471 139183706750784 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:11.510090 139183706750784 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.274990 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:11.533185 139183706750784 generateRocpd.cpp:583] writing SQL database for process 2526399 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:09:11.533985 139183706750784 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_inv/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526399_results.db (UUID=0001fa7d-7bdf-7bdf-8727-d56b821c259a)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:11.618188 139183706750784 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007979 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:11.619298 139183706750784 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001092 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:11.620875 139183706750784 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001562 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:11.631075 139183706750784 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008187 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:11.958530 139183706750784 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.327441 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:11.960676 139183706750784 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002128 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:11.960693 139183706750784 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:11.970149 139183706750784 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009449 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:11.970163 139183706750784 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:11.970169 139183706750784 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:11.970175 139183706750784 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:11.970285 139183706750784 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000098 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:11.970474 139183706750784 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.437288 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:11.973512 139183706750784 simple_timer.cpp:55] [rocprofv3] output generation ::     0.461810 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:11.973614 139183706750784 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.463475 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_inv/MI200/out/pmc_1/2526399_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/13][Approximate profiling time left: 25 seconds]...
[profiling] Current input file: tests/workloads/dispatch_inv/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:13.510993 125227540430656 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190593 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:13.511577 125227540430656 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:13.704797 125227540430656 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:13.798407 125227540430656 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.286831 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:13.820618 125227540430656 generateRocpd.cpp:583] writing SQL database for process 2526408 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:09:13.821412 125227540430656 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_inv/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526408_results.db (UUID=0001fa7d-84c5-74c5-8778-2584f14c8051)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:13.903915 125227540430656 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007939 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:13.905045 125227540430656 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001114 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:13.906632 125227540430656 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001572 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:13.916822 125227540430656 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008215 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:14.227285 125227540430656 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.310449 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:14.229345 125227540430656 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002041 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:14.229362 125227540430656 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:14.238405 125227540430656 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009036 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:14.238420 125227540430656 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:14.238426 125227540430656 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:14.238432 125227540430656 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:14.238540 125227540430656 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000101 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:14.238749 125227540430656 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.418131 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:14.241708 125227540430656 simple_timer.cpp:55] [rocprofv3] output generation ::     0.441845 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:14.241808 125227540430656 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.443351 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_inv/MI200/out/pmc_1/2526408_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/13][Approximate profiling time left: 22 seconds]...
[profiling] Current input file: tests/workloads/dispatch_inv/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:15.799805 135489609568064 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192333 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:15.800436 135489609568064 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:15.994128 135489609568064 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:16.082062 135489609568064 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.281626 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:16.104790 135489609568064 generateRocpd.cpp:583] writing SQL database for process 2526417 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:09:16.105589 135489609568064 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_inv/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526417_results.db (UUID=0001fa7d-8db4-7db4-b3cc-f11a2df878a9)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:16.189385 135489609568064 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007897 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:16.190601 135489609568064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001199 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:16.192193 135489609568064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001577 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:16.202464 135489609568064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008274 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:16.501392 135489609568064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.298914 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:16.503623 135489609568064 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002215 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:16.503641 135489609568064 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:16.512371 135489609568064 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008722 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:16.512385 135489609568064 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:16.512391 135489609568064 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:16.512397 135489609568064 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:16.512502 135489609568064 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000098 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:16.512719 135489609568064 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.407929 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:16.515781 135489609568064 simple_timer.cpp:55] [rocprofv3] output generation ::     0.431995 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:16.515877 135489609568064 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.433767 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_inv/MI200/out/pmc_1/2526417_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/13][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: tests/workloads/dispatch_inv/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:18.057837 129377764208448 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.196761 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:18.058498 129377764208448 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:18.251252 129377764208448 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:18.354636 129377764208448 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.296138 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:18.376967 129377764208448 generateRocpd.cpp:583] writing SQL database for process 2526425 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:09:18.377770 129377764208448 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_inv/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526425_results.db (UUID=0001fa7d-9681-7681-90e9-d501eef10570)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:18.462254 129377764208448 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008072 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:18.463461 129377764208448 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001191 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:18.465591 129377764208448 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002115 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:18.476081 129377764208448 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008334 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:18.762846 129377764208448 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.286751 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:18.765198 129377764208448 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002329 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:18.765216 129377764208448 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:18.774007 129377764208448 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008784 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:18.774022 129377764208448 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:18.774028 129377764208448 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:18.774043 129377764208448 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:18.774147 129377764208448 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000098 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:18.774370 129377764208448 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.397404 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:18.777725 129377764208448 simple_timer.cpp:55] [rocprofv3] output generation ::     0.421729 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:18.777831 129377764208448 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.423144 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_inv/MI200/out/pmc_1/2526425_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/13][Approximate profiling time left: 18 seconds]...
[profiling] Current input file: tests/workloads/dispatch_inv/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:20.329801 123718198509376 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.188877 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:20.330413 123718198509376 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:20.522206 123718198509376 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:20.609513 123718198509376 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.279100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:20.631618 123718198509376 generateRocpd.cpp:583] writing SQL database for process 2526433 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:09:20.632367 123718198509376 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_inv/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526433_results.db (UUID=0001fa7d-9f69-7f69-a765-5192ae23787a)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:20.716538 123718198509376 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007976 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:20.717677 123718198509376 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001122 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:20.719280 123718198509376 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001585 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:20.729479 123718198509376 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008164 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:21.010053 123718198509376 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.280560 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:21.012165 123718198509376 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002088 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:21.012182 123718198509376 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:21.021088 123718198509376 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008899 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:21.021102 123718198509376 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:21.021108 123718198509376 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:21.021115 123718198509376 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:21.021234 123718198509376 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000111 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:21.021476 123718198509376 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.389858 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:21.024516 123718198509376 simple_timer.cpp:55] [rocprofv3] output generation ::     0.413553 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:21.024618 123718198509376 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.415069 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_inv/MI200/out/pmc_1/2526433_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/13][Approximate profiling time left: 15 seconds]...
[profiling] Current input file: tests/workloads/dispatch_inv/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:22.522320 135364218380096 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.185379 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:22.522943 135364218380096 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:22.717149 135364218380096 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:22.809497 135364218380096 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.286555 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:22.831644 135364218380096 generateRocpd.cpp:583] writing SQL database for process 2526441 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:09:22.832441 135364218380096 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_inv/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526441_results.db (UUID=0001fa7d-a7fd-77fd-b892-afef3828211f)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:22.916258 135364218380096 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007727 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:22.917374 135364218380096 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:22.918975 135364218380096 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001586 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:22.929342 135364218380096 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008299 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:22.937960 135364218380096 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008603 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:22.940084 135364218380096 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002107 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:22.940102 135364218380096 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:22.948891 135364218380096 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008782 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:22.948905 135364218380096 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:22.948912 135364218380096 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:22.948918 135364218380096 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:22.949024 135364218380096 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:22.949226 135364218380096 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.117582 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:22.952169 135364218380096 simple_timer.cpp:55] [rocprofv3] output generation ::     0.141170 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:22.952222 135364218380096 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.142683 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_inv/MI200/out/pmc_1/2526441_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/13][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/dispatch_inv/MI200/perfmon/pmc_perf_SQC_DCACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:24.469648 130753197596480 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190515 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:24.470254 130753197596480 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:24.664140 130753197596480 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:24.754708 130753197596480 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.284454 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:24.777162 130753197596480 generateRocpd.cpp:583] writing SQL database for process 2526449 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:09:24.777973 130753197596480 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_inv/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526449_results.db (UUID=0001fa7d-af94-7f94-ba09-af57a8637672)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:24.861872 130753197596480 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008096 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:24.862988 130753197596480 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001098 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:24.864979 130753197596480 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001977 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:24.875439 130753197596480 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008425 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:25.284116 130753197596480 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.408663 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:25.286304 130753197596480 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002168 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:25.286321 130753197596480 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:25.295014 130753197596480 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008685 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:25.295029 130753197596480 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:25.295040 130753197596480 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:25.295053 130753197596480 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:25.295189 130753197596480 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000126 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:25.295448 130753197596480 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.518287 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:25.298438 130753197596480 simple_timer.cpp:55] [rocprofv3] output generation ::     0.542382 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:25.298559 130753197596480 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.543802 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_inv/MI200/out/pmc_1/2526449_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/13][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/dispatch_inv/MI200/perfmon/pmc_perf_SQC_ICACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:26.839813 125277133938496 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190513 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:26.840423 125277133938496 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:27.033216 125277133938496 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:27.114627 125277133938496 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.274203 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:27.136930 125277133938496 generateRocpd.cpp:583] writing SQL database for process 2526458 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:09:27.137720 125277133938496 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_inv/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526458_results.db (UUID=0001fa7d-b8d6-78d6-860a-94f37c475021)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:27.221721 125277133938496 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008107 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:27.222848 125277133938496 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001111 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:27.224832 125277133938496 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001967 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:27.235005 125277133938496 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008181 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:27.667598 125277133938496 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.432579 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:27.669590 125277133938496 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.001968 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:27.669607 125277133938496 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:27.679422 125277133938496 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009807 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:27.679438 125277133938496 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:27.679444 125277133938496 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:27.679450 125277133938496 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:27.679563 125277133938496 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000104 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:27.679778 125277133938496 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.542848 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:27.682835 125277133938496 simple_timer.cpp:55] [rocprofv3] output generation ::     0.566825 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:27.682942 125277133938496 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.568265 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_inv/MI200/out/pmc_1/2526458_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/13][Approximate profiling time left: 8 seconds]...
[profiling] Current input file: tests/workloads/dispatch_inv/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:29.257148 136721593802560 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.197875 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:29.257752 136721593802560 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:29.451960 136721593802560 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:29.538302 136721593802560 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280549 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:29.561061 136721593802560 generateRocpd.cpp:583] writing SQL database for process 2526468 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:09:29.561859 136721593802560 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_inv/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526468_results.db (UUID=0001fa7d-c240-7240-886b-7ecb3c3b9946)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:29.648395 136721593802560 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008382 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:29.649617 136721593802560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001206 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:29.651769 136721593802560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002137 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:29.662370 136721593802560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008456 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:30.245884 136721593802560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.583498 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:30.249140 136721593802560 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.003228 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:30.249158 136721593802560 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:30.258573 136721593802560 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009408 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:30.258588 136721593802560 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:30.258595 136721593802560 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:30.258602 136721593802560 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:30.258769 136721593802560 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000130 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:30.259062 136721593802560 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.698001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:30.262123 136721593802560 simple_timer.cpp:55] [rocprofv3] output generation ::     0.722194 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:30.262271 136721593802560 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.723919 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_inv/MI200/out/pmc_1/2526468_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/13][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: tests/workloads/dispatch_inv/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_LDS_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:31.804017 126715691540288 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191786 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:31.804637 126715691540288 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:31.999265 126715691540288 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:32.095435 126715691540288 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.290798 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:32.118520 126715691540288 generateRocpd.cpp:583] writing SQL database for process 2526476 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:09:32.119324 126715691540288 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_inv/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526476_results.db (UUID=0001fa7d-cc39-7c39-b011-545e05affbe6)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:32.204131 126715691540288 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008136 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:32.205235 126715691540288 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001088 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:32.207212 126715691540288 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001962 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:32.217404 126715691540288 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008209 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:32.561531 126715691540288 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.344112 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:32.563617 126715691540288 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002061 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:32.563635 126715691540288 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:32.572412 126715691540288 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008770 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:32.572427 126715691540288 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:32.572434 126715691540288 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:32.572440 126715691540288 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:32.572586 126715691540288 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000112 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:32.572823 126715691540288 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.454303 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:32.575859 126715691540288 simple_timer.cpp:55] [rocprofv3] output generation ::     0.478586 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:32.575967 126715691540288 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.480482 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_inv/MI200/out/pmc_1/2526476_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 11/13][Approximate profiling time left: 4 seconds]...
[profiling] Current input file: tests/workloads/dispatch_inv/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:34.118441 136295431835456 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191920 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:34.119068 136295431835456 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:34.314275 136295431835456 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:34.401348 136295431835456 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.282281 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:34.423728 136295431835456 generateRocpd.cpp:583] writing SQL database for process 2526484 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:09:34.424490 136295431835456 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_inv/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526484_results.db (UUID=0001fa7d-d543-7543-a97e-eca1538b29a1)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:34.509083 136295431835456 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008068 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:34.510324 136295431835456 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001225 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:34.512300 136295431835456 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001961 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:34.522641 136295431835456 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008318 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:34.853276 136295431835456 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.330617 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:34.855598 136295431835456 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002301 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:34.855618 136295431835456 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:34.864930 136295431835456 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009297 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:34.864945 136295431835456 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:34.864952 136295431835456 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:34.864959 136295431835456 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:34.865132 136295431835456 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000136 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:34.865384 136295431835456 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.441656 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:34.868817 136295431835456 simple_timer.cpp:55] [rocprofv3] output generation ::     0.465934 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:34.868935 136295431835456 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.467553 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_inv/MI200/out/pmc_1/2526484_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 12/13][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: tests/workloads/dispatch_inv/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:36.429389 130832552501056 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.199720 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:36.430016 130832552501056 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:36.623809 130832552501056 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:36.717698 130832552501056 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.287683 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:36.740099 130832552501056 generateRocpd.cpp:583] writing SQL database for process 2526499 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:09:36.740896 130832552501056 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_inv/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526499_results.db (UUID=0001fa7d-de42-7e42-ae72-224f26a738ec)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:36.825161 130832552501056 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008196 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:36.826300 130832552501056 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001119 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:36.828262 130832552501056 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001947 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:36.838490 130832552501056 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008215 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:37.358144 130832552501056 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.519639 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:37.360266 130832552501056 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002098 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:37.360284 130832552501056 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:37.370414 130832552501056 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.010123 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:37.370430 130832552501056 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:37.370436 130832552501056 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:37.370443 130832552501056 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:37.370562 130832552501056 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000109 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:37.370775 130832552501056 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.630676 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:37.373904 130832552501056 simple_timer.cpp:55] [rocprofv3] output generation ::     0.654789 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:37.374042 130832552501056 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.656296 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_inv/MI200/out/pmc_1/2526499_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 13/13][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/dispatch_inv/MI200/perfmon/pmc_perf_SQ_LEVEL_WAVES_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:38.913870 139655746510656 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191135 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:38.914501 139655746510656 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:39.108748 139655746510656 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:09:39.193438 139655746510656 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.278937 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:39.215547 139655746510656 generateRocpd.cpp:583] writing SQL database for process 2526507 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:09:39.216345 139655746510656 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_inv/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526507_results.db (UUID=0001fa7d-e7ff-77ff-b707-8d944ca1e291)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:39.299304 139655746510656 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008024 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:39.300467 139655746510656 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001146 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:39.302146 139655746510656 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001665 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:39.312495 139655746510656 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008297 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:39.631480 139655746510656 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.318970 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:39.633782 139655746510656 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002286 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:39.633800 139655746510656 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:39.643025 139655746510656 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009218 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:39.643047 139655746510656 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:39.643053 139655746510656 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:39.643059 139655746510656 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:39.643185 139655746510656 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000093 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:39.643364 139655746510656 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.427817 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:39.646367 139655746510656 simple_timer.cpp:55] [rocprofv3] output generation ::     0.451624 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:09:39.646462 139655746510656 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.452977 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_inv/MI200/out/pmc_1/2526507_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Checking for roofline.csv in tests/workloads/dispatch_inv/MI200
[roofline] Benchmark execution failed: 'L1'. Skipping roofline.
