alias: cu_ins, block id: 10
alias: cu_pipe, block id: 11
alias: tatd, block id: 15
Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_TA/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: ['cu_ins', 'cu_pipe', 'tatd']

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/8][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/ipblocks_SQ_TA/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:47.813127 136825358786368 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.184645 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:47.813739 136825358786368 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:48.005300 136825358786368 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:48.097022 136825358786368 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.283283 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:48.119024 136825358786368 generateRocpd.cpp:583] writing SQL database for process 2521403 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:50:48.119798 136825358786368 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_SQ_TA/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521403_results.db (UUID=0001fa6c-a5a9-75a9-9764-fcfc55f5e162)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:48.203463 136825358786368 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007817 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:48.204634 136825358786368 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001153 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:48.206234 136825358786368 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001586 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:48.216583 136825358786368 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008390 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:48.302950 136825358786368 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.086352 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:48.305214 136825358786368 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002249 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:48.305232 136825358786368 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:48.313944 136825358786368 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008705 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:48.313959 136825358786368 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:48.313965 136825358786368 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:48.313972 136825358786368 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:48.314082 136825358786368 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000102 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:48.314317 136825358786368 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.195293 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:48.317264 136825358786368 simple_timer.cpp:55] [rocprofv3] output generation ::     0.218769 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:48.317319 136825358786368 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.220234 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_SQ_TA/MI200/out/pmc_1/2521403_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/8][Approximate profiling time left: 12 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_SQ_TA/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:49.807197 136173960896320 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.182966 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:49.807804 136173960896320 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:49.998949 136173960896320 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:50.096661 136173960896320 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.288857 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:50.119132 136173960896320 generateRocpd.cpp:583] writing SQL database for process 2521414 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:50:50.119921 136173960896320 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_SQ_TA/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521414_results.db (UUID=0001fa6c-ad75-7d75-b33d-f266f27e9b32)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:50.201840 136173960896320 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007746 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:50.202986 136173960896320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001129 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:50.204546 136173960896320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001545 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:50.214701 136173960896320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008237 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:50.292339 136173960896320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.077622 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:50.294598 136173960896320 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002242 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:50.294616 136173960896320 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:50.303157 136173960896320 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008535 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:50.303171 136173960896320 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:50.303177 136173960896320 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:50.303184 136173960896320 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:50.303291 136173960896320 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000099 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:50.303492 136173960896320 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.184360 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:50.306395 136173960896320 simple_timer.cpp:55] [rocprofv3] output generation ::     0.208022 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:50.306461 136173960896320 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.209757 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_SQ_TA/MI200/out/pmc_1/2521414_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/8][Approximate profiling time left: 10 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_SQ_TA/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:51.824958 135554362916672 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.186085 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:51.825582 135554362916672 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:52.019596 135554362916672 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:52.119235 135554362916672 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.293653 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:52.141807 135554362916672 generateRocpd.cpp:583] writing SQL database for process 2521426 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:50:52.142584 135554362916672 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_SQ_TA/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521426_results.db (UUID=0001fa6c-b553-7553-9603-58dd725ba6a5)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:52.225777 135554362916672 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007912 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:52.226969 135554362916672 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001174 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:52.228568 135554362916672 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001585 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:52.239138 135554362916672 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008557 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:52.310766 135554362916672 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.071613 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:52.313073 135554362916672 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002292 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:52.313090 135554362916672 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:52.321784 135554362916672 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008686 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:52.321799 135554362916672 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:52.321805 135554362916672 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:52.321812 135554362916672 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:52.321919 135554362916672 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000099 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:52.322137 135554362916672 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.180330 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:52.325028 135554362916672 simple_timer.cpp:55] [rocprofv3] output generation ::     0.203950 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:52.325101 135554362916672 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.205829 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_SQ_TA/MI200/out/pmc_1/2521426_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/8][Approximate profiling time left: 8 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_SQ_TA/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:53.804089 130316111454016 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.182697 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:53.804692 130316111454016 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:53.999629 130316111454016 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:54.086538 130316111454016 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.281846 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:54.109120 130316111454016 generateRocpd.cpp:583] writing SQL database for process 2521436 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:50:54.109924 130316111454016 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_SQ_TA/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521436_results.db (UUID=0001fa6c-bd12-7d12-8a30-67dd9475b410)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:54.192577 130316111454016 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007791 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:54.193756 130316111454016 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001163 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:54.195346 130316111454016 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001575 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:54.205651 130316111454016 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008381 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:54.275696 130316111454016 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.070030 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:54.278358 130316111454016 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002646 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:54.278376 130316111454016 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:54.287096 130316111454016 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008712 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:54.287110 130316111454016 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:54.287117 130316111454016 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:54.287124 130316111454016 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:54.287237 130316111454016 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000104 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:54.287449 130316111454016 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.178329 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:54.290429 130316111454016 simple_timer.cpp:55] [rocprofv3] output generation ::     0.202114 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:54.290495 130316111454016 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.203910 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_SQ_TA/MI200/out/pmc_1/2521436_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/8][Approximate profiling time left: 5 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_SQ_TA/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:55.790371 138138989313856 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.181833 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:55.790972 138138989313856 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:55.983901 138138989313856 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:56.066196 138138989313856 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.275224 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:56.088224 138138989313856 generateRocpd.cpp:583] writing SQL database for process 2521446 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:50:56.089015 138138989313856 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_SQ_TA/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521446_results.db (UUID=0001fa6c-c4d5-74d5-bc69-b4b3537305ca)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:56.173391 138138989313856 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007773 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:56.174587 138138989313856 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001173 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:56.176262 138138989313856 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001660 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:56.187083 138138989313856 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008655 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:56.237994 138138989313856 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.050896 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:56.240182 138138989313856 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002172 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:56.240199 138138989313856 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:56.248444 138138989313856 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008237 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:56.248458 138138989313856 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:56.248464 138138989313856 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:56.248471 138138989313856 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:56.248573 138138989313856 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000094 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:56.248796 138138989313856 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.160573 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:56.251752 138138989313856 simple_timer.cpp:55] [rocprofv3] output generation ::     0.184142 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:56.251805 138138989313856 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.185563 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_SQ_TA/MI200/out/pmc_1/2521446_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/8][Approximate profiling time left: 3 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_SQ_TA/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:57.747533 124463165951808 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.185792 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:57.748112 124463165951808 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:57.938537 124463165951808 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:58.021089 124463165951808 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.272976 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:58.043076 124463165951808 generateRocpd.cpp:583] writing SQL database for process 2521456 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:50:58.043853 124463165951808 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_SQ_TA/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521456_results.db (UUID=0001fa6c-cc76-7c76-a5cf-0b495fd7f572)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:58.127672 124463165951808 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007771 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:58.128953 124463165951808 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001264 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:58.130553 124463165951808 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001585 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:58.141114 124463165951808 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008555 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:58.156834 124463165951808 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.015705 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:58.159000 124463165951808 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002150 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:58.159018 124463165951808 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:58.167528 124463165951808 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008503 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:58.167543 124463165951808 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:58.167549 124463165951808 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:58.167555 124463165951808 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:58.167668 124463165951808 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000105 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:58.167847 124463165951808 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.124772 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:58.170651 124463165951808 simple_timer.cpp:55] [rocprofv3] output generation ::     0.148149 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:58.170707 124463165951808 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.149577 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_SQ_TA/MI200/out/pmc_1/2521456_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/8][Approximate profiling time left: 1 second]...
[profiling] Current input file: tests/workloads/ipblocks_SQ_TA/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:59.638821 134485394681664 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.181728 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:59.639432 134485394681664 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:59.835837 134485394681664 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:50:59.918402 134485394681664 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.278970 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:50:59.941059 134485394681664 generateRocpd.cpp:583] writing SQL database for process 2521465 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:50:59.941832 134485394681664 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_SQ_TA/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521465_results.db (UUID=0001fa6c-d3de-73de-9034-72acfd0aa20f)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:00.024308 134485394681664 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007768 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:00.025522 134485394681664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001197 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:00.027104 134485394681664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001568 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:00.037650 134485394681664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008561 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:00.120641 134485394681664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.082977 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:00.123007 134485394681664 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002350 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:00.123024 134485394681664 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:00.131576 134485394681664 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008538 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:00.131590 134485394681664 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:00.131596 134485394681664 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:00.131602 134485394681664 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:00.131704 134485394681664 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000094 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:00.131930 134485394681664 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.190871 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:00.134774 134485394681664 simple_timer.cpp:55] [rocprofv3] output generation ::     0.214602 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:00.134838 134485394681664 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.216395 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_SQ_TA/MI200/out/pmc_1/2521465_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/8][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_SQ_TA/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:51:01.642436 130242686201664 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.184261 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:51:01.643064 130242686201664 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:01.836407 130242686201664 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:51:01.917987 130242686201664 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.274923 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:01.940397 130242686201664 generateRocpd.cpp:583] writing SQL database for process 2521474 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:51:01.941214 130242686201664 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_SQ_TA/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521474_results.db (UUID=0001fa6c-dbaf-7baf-999b-1b406b5fe685)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:02.024694 130242686201664 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007798 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:02.025977 130242686201664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001266 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:02.027570 130242686201664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001578 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:02.037960 130242686201664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008377 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:02.115185 130242686201664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.077210 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:02.117552 130242686201664 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002352 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:02.117570 130242686201664 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:02.126093 130242686201664 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008516 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:02.126108 130242686201664 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:02.126114 130242686201664 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:02.126121 130242686201664 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:02.126218 130242686201664 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000089 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:02.126446 130242686201664 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.186049 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:02.129361 130242686201664 simple_timer.cpp:55] [rocprofv3] output generation ::     0.209777 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:51:02.129417 130242686201664 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.211150 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_SQ_TA/MI200/out/pmc_1/2521474_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Skipping roofline
