alias: vl1d, block id: 16
Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: ['vl1d']

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/10][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/ipblocks_TCP/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:28.757178 125662634008384 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.183736 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:28.757773 125662634008384 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:28.952322 125662634008384 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:29.039419 125662634008384 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.281646 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:29.061645 125662634008384 generateRocpd.cpp:583] writing SQL database for process 2521074 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:49:29.062450 125662634008384 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_TCP/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521074_results.db (UUID=0001fa6b-70da-70da-bc13-29ca4c351a01)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:29.144617 125662634008384 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007787 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:29.145825 125662634008384 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001191 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:29.147528 125662634008384 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001688 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:29.157931 125662634008384 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008239 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:29.207707 125662634008384 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.049761 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:29.209970 125662634008384 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002241 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:29.209988 125662634008384 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:29.218653 125662634008384 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008659 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:29.218669 125662634008384 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:29.218676 125662634008384 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:29.218682 125662634008384 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:29.218791 125662634008384 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000101 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:29.219004 125662634008384 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.157360 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:29.221967 125662634008384 simple_timer.cpp:55] [rocprofv3] output generation ::     0.181039 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:29.222028 125662634008384 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.182568 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_TCP/MI200/out/pmc_1/2521074_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/10][Approximate profiling time left: 15 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_TCP/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:30.715049 137372866662208 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.182832 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:30.715642 137372866662208 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:30.909395 137372866662208 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:30.993102 137372866662208 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.277461 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:31.015589 137372866662208 generateRocpd.cpp:583] writing SQL database for process 2521093 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:49:31.016407 137372866662208 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_TCP/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521093_results.db (UUID=0001fa6b-7881-7881-9aec-f155c7ebdaf0)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:31.100300 137372866662208 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007789 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:31.101520 137372866662208 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001204 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:31.103237 137372866662208 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001702 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:31.113732 137372866662208 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008298 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:31.142950 137372866662208 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.029203 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:31.145353 137372866662208 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002381 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:31.145370 137372866662208 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:31.153741 137372866662208 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008365 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:31.153755 137372866662208 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:31.153762 137372866662208 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:31.153768 137372866662208 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:31.153866 137372866662208 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000091 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:31.154075 137372866662208 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.138487 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:31.156982 137372866662208 simple_timer.cpp:55] [rocprofv3] output generation ::     0.162187 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:31.157048 137372866662208 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.163899 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_TCP/MI200/out/pmc_1/2521093_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/10][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_TCP/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:32.634524 136084001165120 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.182596 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:32.635116 136084001165120 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:32.829513 136084001165120 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:32.917317 136084001165120 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.282202 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:32.939579 136084001165120 generateRocpd.cpp:583] writing SQL database for process 2521103 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:49:32.940404 136084001165120 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_TCP/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521103_results.db (UUID=0001fa6b-8000-7000-b71e-6d61a50622c3)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:33.022569 136084001165120 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007786 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:33.023810 136084001165120 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001224 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:33.025438 136084001165120 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001612 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:33.035604 136084001165120 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008160 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:33.064877 136084001165120 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.029257 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:33.067279 136084001165120 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002385 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:33.067297 136084001165120 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:33.075871 136084001165120 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008559 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:33.075885 136084001165120 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:33.075892 136084001165120 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:33.075898 136084001165120 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:33.076011 136084001165120 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000105 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:33.076233 136084001165120 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.136655 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:33.079191 136084001165120 simple_timer.cpp:55] [rocprofv3] output generation ::     0.160300 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:33.079249 136084001165120 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.161872 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_TCP/MI200/out/pmc_1/2521103_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/10][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_TCP/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:34.565264 124178396364608 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.185046 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:34.565884 124178396364608 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:34.759261 124178396364608 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:34.840473 124178396364608 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.274589 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:34.862662 124178396364608 generateRocpd.cpp:583] writing SQL database for process 2521114 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:49:34.863478 124178396364608 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_TCP/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521114_results.db (UUID=0001fa6b-8789-7789-9459-26033b78def4)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:34.946422 124178396364608 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007898 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:34.947639 124178396364608 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001197 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:34.949340 124178396364608 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001686 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:34.959931 124178396364608 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008417 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:34.989565 124178396364608 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.029620 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:34.991798 124178396364608 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002218 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:34.991816 124178396364608 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:35.000520 124178396364608 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008697 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:35.000535 124178396364608 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:35.000541 124178396364608 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:35.000547 124178396364608 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:35.000651 124178396364608 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000096 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:35.000848 124178396364608 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.138187 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:35.003825 124178396364608 simple_timer.cpp:55] [rocprofv3] output generation ::     0.161888 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:35.003882 124178396364608 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.163360 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_TCP/MI200/out/pmc_1/2521114_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/10][Approximate profiling time left: 9 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_TCP/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:36.494468 129804430171968 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.183701 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:36.495062 129804430171968 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:36.690821 129804430171968 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:36.773636 129804430171968 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.278574 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:36.795883 129804430171968 generateRocpd.cpp:583] writing SQL database for process 2521124 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:49:36.796681 129804430171968 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_TCP/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521124_results.db (UUID=0001fa6b-8f13-7f13-9f98-52caa81fa87b)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:36.879663 129804430171968 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007891 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:36.880841 129804430171968 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001161 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:36.882437 129804430171968 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001582 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:36.892650 129804430171968 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008193 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:36.922022 129804430171968 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.029358 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:36.924450 129804430171968 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002402 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:36.924468 129804430171968 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:36.933023 129804430171968 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008548 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:36.933045 129804430171968 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:36.933051 129804430171968 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:36.933057 129804430171968 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:36.933158 129804430171968 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000092 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:36.933345 129804430171968 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.137462 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:36.936174 129804430171968 simple_timer.cpp:55] [rocprofv3] output generation ::     0.161046 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:36.936226 129804430171968 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.162549 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_TCP/MI200/out/pmc_1/2521124_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/10][Approximate profiling time left: 7 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_TCP/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:38.411229 129863076523840 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.181402 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:38.411794 129863076523840 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:38.611532 129863076523840 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:38.697764 129863076523840 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.285970 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:38.719723 129863076523840 generateRocpd.cpp:583] writing SQL database for process 2521133 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:49:38.720534 129863076523840 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_TCP/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521133_results.db (UUID=0001fa6b-9692-7692-bf61-f0f1b6881077)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:38.802296 129863076523840 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007778 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:38.803483 129863076523840 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001171 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:38.805078 129863076523840 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001581 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:38.815244 129863076523840 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008168 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:38.844843 129863076523840 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.029584 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:38.847003 129863076523840 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002144 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:38.847020 129863076523840 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:38.855869 129863076523840 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008842 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:38.855884 129863076523840 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:38.855890 129863076523840 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:38.855897 129863076523840 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:38.855998 129863076523840 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000094 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:38.856211 129863076523840 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.136489 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:38.859273 129863076523840 simple_timer.cpp:55] [rocprofv3] output generation ::     0.160170 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:38.859329 129863076523840 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.161519 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_TCP/MI200/out/pmc_1/2521133_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/10][Approximate profiling time left: 5 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_TCP/MI200/perfmon/pmc_perf_6.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:40.345098 140264277761856 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.182648 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:40.345675 140264277761856 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:40.539018 140264277761856 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:40.622957 140264277761856 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.277283 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:40.645606 140264277761856 generateRocpd.cpp:583] writing SQL database for process 2521142 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:49:40.646383 140264277761856 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_TCP/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521142_results.db (UUID=0001fa6b-9e1f-7e1f-8599-cc8253af9826)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:40.728615 140264277761856 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007862 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:40.729821 140264277761856 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001190 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:40.731550 140264277761856 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001715 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:40.741805 140264277761856 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008268 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:40.771190 140264277761856 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.029370 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:40.773419 140264277761856 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002214 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:40.773436 140264277761856 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:40.782171 140264277761856 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008728 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:40.782186 140264277761856 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:40.782193 140264277761856 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:40.782199 140264277761856 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:40.782301 140264277761856 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000094 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:40.782520 140264277761856 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.136914 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:40.785313 140264277761856 simple_timer.cpp:55] [rocprofv3] output generation ::     0.160470 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:40.785362 140264277761856 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.162367 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_TCP/MI200/out/pmc_1/2521142_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/10][Approximate profiling time left: 3 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_TCP/MI200/perfmon/pmc_perf_7.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:42.275766 136006977208128 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.183799 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:42.276351 136006977208128 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:42.472464 136006977208128 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:42.555556 136006977208128 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.279206 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:42.577608 136006977208128 generateRocpd.cpp:583] writing SQL database for process 2521151 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:49:42.578379 136006977208128 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_TCP/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521151_results.db (UUID=0001fa6b-a5a8-75a8-8bf0-0c8e530af0d6)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:42.661171 136006977208128 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007722 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:42.662367 136006977208128 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001179 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:42.663955 136006977208128 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001572 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:42.674188 136006977208128 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008192 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:42.703690 136006977208128 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.029487 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:42.705925 136006977208128 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002220 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:42.705943 136006977208128 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:42.714526 136006977208128 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008576 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:42.714540 136006977208128 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:42.714546 136006977208128 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:42.714553 136006977208128 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:42.714653 136006977208128 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000093 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:42.714876 136006977208128 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.137268 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:42.717771 136006977208128 simple_timer.cpp:55] [rocprofv3] output generation ::     0.160800 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:42.717819 136006977208128 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.162224 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_TCP/MI200/out/pmc_1/2521151_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/10][Approximate profiling time left: 1 second]...
[profiling] Current input file: tests/workloads/ipblocks_TCP/MI200/perfmon/pmc_perf_8.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:44.206177 135210234871616 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.182661 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:44.206785 135210234871616 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:44.399894 135210234871616 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:44.489460 135210234871616 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.282675 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:44.511474 135210234871616 generateRocpd.cpp:583] writing SQL database for process 2521161 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:49:44.512274 135210234871616 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_TCP/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521161_results.db (UUID=0001fa6b-ad34-7d34-b046-a27b6e56a6e0)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:44.593851 135210234871616 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007666 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:44.595045 135210234871616 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001179 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:44.596728 135210234871616 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001668 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:44.606982 135210234871616 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008272 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:44.636259 135210234871616 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.029262 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:44.638616 135210234871616 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002341 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:44.638633 135210234871616 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:44.647122 135210234871616 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008481 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:44.647137 135210234871616 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:44.647143 135210234871616 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:44.647149 135210234871616 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:44.647252 135210234871616 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000095 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:44.647458 135210234871616 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.135985 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:44.650357 135210234871616 simple_timer.cpp:55] [rocprofv3] output generation ::     0.159387 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:44.650415 135210234871616 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.160911 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_TCP/MI200/out/pmc_1/2521161_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/10][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_TCP/MI200/perfmon/pmc_perf_9.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:46.123266 133340620222272 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.184632 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:46.123833 133340620222272 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:46.316499 133340620222272 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:49:46.399019 133340620222272 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.275186 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:46.420892 133340620222272 generateRocpd.cpp:583] writing SQL database for process 2521169 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:49:46.421702 133340620222272 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_TCP/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521169_results.db (UUID=0001fa6b-b4af-74af-b60e-981852e77453)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:46.503131 133340620222272 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008029 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:46.504313 133340620222272 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001164 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:46.505859 133340620222272 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001531 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:46.516182 133340620222272 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008380 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:46.531786 133340620222272 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.015589 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:46.533966 133340620222272 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002165 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:46.533983 133340620222272 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:46.542732 133340620222272 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008742 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:46.542747 133340620222272 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:46.542754 133340620222272 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:46.542760 133340620222272 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:46.542869 133340620222272 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:46.543073 133340620222272 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.122181 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:46.546021 133340620222272 simple_timer.cpp:55] [rocprofv3] output generation ::     0.145255 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:49:46.546086 133340620222272 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.146997 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_TCP/MI200/out/pmc_1/2521169_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Skipping roofline
