alias: cu_ins, block id: 10
alias: cu_pipe, block id: 11
alias: ins_cache, block id: 13
alias: sl1d, block id: 14
alias: vl1d, block id: 16
alias: cpc, block id: 5
Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: ['cu_ins', 'cu_pipe', 'ins_cache', 'sl1d', 'vl1d', 'cpc']

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/10][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:33.357784 136693458493248 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.186061 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:33.358430 136693458493248 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:33.552799 136693458493248 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:33.638629 136693458493248 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280200 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:33.660825 136693458493248 generateRocpd.cpp:583] writing SQL database for process 2521849 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:52:33.661621 136693458493248 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521849_results.db (UUID=0001fa6e-41f0-71f0-96db-830c85afd79f)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:33.744956 136693458493248 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007873 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:33.746174 136693458493248 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001203 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:33.747875 136693458493248 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001685 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:33.758424 136693458493248 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008443 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:33.875621 136693458493248 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.117181 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:33.877953 136693458493248 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002313 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:33.877970 136693458493248 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:33.886937 136693458493248 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008960 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:33.886950 136693458493248 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:33.886957 136693458493248 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:33.886963 136693458493248 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:33.887080 136693458493248 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000107 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:33.887287 136693458493248 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.226462 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:33.890199 136693458493248 simple_timer.cpp:55] [rocprofv3] output generation ::     0.250099 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:33.890270 136693458493248 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.251599 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/out/pmc_1/2521849_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/10][Approximate profiling time left: 16 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:35.385434 130184374656832 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.180839 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:35.386047 130184374656832 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:35.578223 130184374656832 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:35.663684 130184374656832 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.277637 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:35.685771 130184374656832 generateRocpd.cpp:583] writing SQL database for process 2521860 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:52:35.686571 130184374656832 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521860_results.db (UUID=0001fa6e-49e1-79e1-bcfc-1f321694d3d0)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:35.767129 130184374656832 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007868 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:35.768320 130184374656832 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001175 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:35.769972 130184374656832 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001637 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:35.780509 130184374656832 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008362 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:35.882384 130184374656832 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.101860 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:35.884714 130184374656832 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002301 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:35.884732 130184374656832 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:35.893732 130184374656832 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008993 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:35.893746 130184374656832 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:35.893752 130184374656832 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:35.893759 130184374656832 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:35.893861 130184374656832 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000094 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:35.894068 130184374656832 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.208297 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:35.896949 130184374656832 simple_timer.cpp:55] [rocprofv3] output generation ::     0.231906 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:35.897020 130184374656832 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.233295 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/out/pmc_1/2521860_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/10][Approximate profiling time left: 14 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:37.386514 135425646026560 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.181736 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:37.387093 135425646026560 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:37.581930 135425646026560 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:37.665981 135425646026560 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.278888 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:37.688502 135425646026560 generateRocpd.cpp:583] writing SQL database for process 2521871 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:52:37.689305 135425646026560 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521871_results.db (UUID=0001fa6e-51b1-71b1-83fe-a3210f68db83)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:37.772253 135425646026560 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007846 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:37.773470 135425646026560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001201 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:37.775154 135425646026560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001670 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:37.785977 135425646026560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008653 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:37.868873 135425646026560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.082881 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:37.871258 135425646026560 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002369 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:37.871275 135425646026560 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:37.880551 135425646026560 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009269 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:37.880565 135425646026560 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:37.880571 135425646026560 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:37.880578 135425646026560 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:37.880681 135425646026560 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000096 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:37.880911 135425646026560 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.192409 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:37.883771 135425646026560 simple_timer.cpp:55] [rocprofv3] output generation ::     0.215907 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:37.883835 135425646026560 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.217779 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/out/pmc_1/2521871_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/10][Approximate profiling time left: 12 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:39.347524 140182181846848 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.182055 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:39.348100 140182181846848 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:39.541012 140182181846848 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:39.625423 140182181846848 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.277324 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:39.648053 140182181846848 generateRocpd.cpp:583] writing SQL database for process 2521880 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:52:39.648848 140182181846848 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521880_results.db (UUID=0001fa6e-595a-795a-b3ee-6879d08fe2f4)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:39.731966 140182181846848 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007769 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:39.733125 140182181846848 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001142 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:39.734736 140182181846848 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001597 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:39.745074 140182181846848 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008330 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:39.827125 140182181846848 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.082037 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:39.829412 140182181846848 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002268 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:39.829430 140182181846848 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:39.838648 140182181846848 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009211 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:39.838662 140182181846848 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:39.838669 140182181846848 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:39.838675 140182181846848 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:39.838780 140182181846848 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:39.838987 140182181846848 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.190934 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:39.841878 140182181846848 simple_timer.cpp:55] [rocprofv3] output generation ::     0.214637 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:39.841945 140182181846848 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.216475 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/out/pmc_1/2521880_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/10][Approximate profiling time left: 9 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:41.333249 135214325317440 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.184475 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:41.333811 135214325317440 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:41.530451 135214325317440 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:41.614566 135214325317440 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280755 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:41.636768 135214325317440 generateRocpd.cpp:583] writing SQL database for process 2521896 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:52:41.637556 135214325317440 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521896_results.db (UUID=0001fa6e-6119-7119-b7fd-7d17cbc2ae1e)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:41.720203 135214325317440 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007764 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:41.721315 135214325317440 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001096 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:41.722923 135214325317440 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001593 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:41.733314 135214325317440 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008332 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:41.816780 135214325317440 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.083451 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:41.819026 135214325317440 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002228 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:41.819051 135214325317440 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:41.828105 135214325317440 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009047 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:41.828120 135214325317440 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:41.828126 135214325317440 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:41.828133 135214325317440 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:41.828244 135214325317440 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:41.828453 135214325317440 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.191686 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:41.831338 135214325317440 simple_timer.cpp:55] [rocprofv3] output generation ::     0.215339 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:41.831402 135214325317440 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.216794 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/out/pmc_1/2521896_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/10][Approximate profiling time left: 7 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:43.337785 126716778659648 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.183929 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:43.338410 126716778659648 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:43.533677 126716778659648 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:43.618261 126716778659648 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.279851 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:43.640880 126716778659648 generateRocpd.cpp:583] writing SQL database for process 2521992 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:52:43.641692 126716778659648 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/out/pmc_1/smc4124-25-mi210-3c48/2521992_results.db (UUID=0001fa6e-68ee-78ee-a488-64945cc85434)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:43.724796 126716778659648 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007799 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:43.725991 126716778659648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001178 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:43.727602 126716778659648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001597 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:43.737972 126716778659648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008394 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:43.820187 126716778659648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.082200 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:43.822504 126716778659648 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002301 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:43.822522 126716778659648 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000005 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:43.831699 126716778659648 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009170 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:43.831714 126716778659648 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:43.831720 126716778659648 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:43.831727 126716778659648 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:43.831832 126716778659648 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000094 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:43.832045 126716778659648 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.191165 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:43.834939 126716778659648 simple_timer.cpp:55] [rocprofv3] output generation ::     0.215095 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:43.835006 126716778659648 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.216701 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/out/pmc_1/2521992_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/10][Approximate profiling time left: 5 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/perfmon/pmc_perf_6.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:45.322429 128837045264192 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.185483 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:45.323065 128837045264192 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:45.516610 128837045264192 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:45.596691 128837045264192 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.273626 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:45.618915 128837045264192 generateRocpd.cpp:583] writing SQL database for process 2522053 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:52:45.619742 128837045264192 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522053_results.db (UUID=0001fa6e-70ad-70ad-8d8f-2545aced9b59)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:45.702980 128837045264192 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007788 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:45.704220 128837045264192 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001223 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:45.705831 128837045264192 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001596 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:45.716299 128837045264192 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008478 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:45.732237 128837045264192 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.015923 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:45.734468 128837045264192 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002217 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:45.734485 128837045264192 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:45.743415 128837045264192 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008922 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:45.743429 128837045264192 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:45.743436 128837045264192 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:45.743442 128837045264192 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:45.743550 128837045264192 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:45.743749 128837045264192 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.124835 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:45.746571 128837045264192 simple_timer.cpp:55] [rocprofv3] output generation ::     0.148335 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:45.746623 128837045264192 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.149881 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/out/pmc_1/2522053_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/10][Approximate profiling time left: 3 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:47.243653 126107395661632 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.184712 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:47.244290 126107395661632 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:47.444011 126107395661632 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:47.539484 126107395661632 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.295194 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:47.563162 126107395661632 generateRocpd.cpp:583] writing SQL database for process 2522100 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:52:47.563961 126107395661632 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522100_results.db (UUID=0001fa6e-782f-782f-899a-ac75d1f5dc80)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:47.644756 126107395661632 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007760 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:47.645949 126107395661632 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001177 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:47.647527 126107395661632 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001564 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:47.657569 126107395661632 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008148 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:47.779618 126107395661632 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.122035 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:47.782719 126107395661632 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.003075 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:47.782784 126107395661632 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000005 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:47.791979 126107395661632 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009186 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:47.791995 126107395661632 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:47.792002 126107395661632 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:47.792009 126107395661632 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:47.792180 126107395661632 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000160 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:47.792792 126107395661632 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.229630 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:47.795863 126107395661632 simple_timer.cpp:55] [rocprofv3] output generation ::     0.254860 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:47.795959 126107395661632 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.256404 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/out/pmc_1/2522100_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/10][Approximate profiling time left: 1 second]...
[profiling] Current input file: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:49.308537 134925729881920 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.185900 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:49.309150 134925729881920 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:49.506341 134925729881920 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:49.600390 134925729881920 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.291240 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:49.622931 134925729881920 generateRocpd.cpp:583] writing SQL database for process 2522152 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:52:49.623743 134925729881920 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522152_results.db (UUID=0001fa6e-803f-703f-a1be-df987f312b3f)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:49.704378 134925729881920 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007910 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:49.705592 134925729881920 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001198 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:49.707160 134925729881920 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001552 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:49.717308 134925729881920 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008203 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:49.834401 134925729881920 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.117079 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:49.836649 134925729881920 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002231 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:49.836666 134925729881920 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:49.845263 134925729881920 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008589 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:49.845278 134925729881920 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:49.845285 134925729881920 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:49.845292 134925729881920 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:49.845397 134925729881920 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:49.845604 134925729881920 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.222674 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:49.848538 134925729881920 simple_timer.cpp:55] [rocprofv3] output generation ::     0.246469 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:49.848615 134925729881920 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.248175 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/out/pmc_1/2522152_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/10][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:51.337123 124219333910336 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.186147 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:51.337741 124219333910336 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:51.533139 124219333910336 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:51.624327 124219333910336 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.286586 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:51.646731 124219333910336 generateRocpd.cpp:583] writing SQL database for process 2522171 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:52:51.647526 124219333910336 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522171_results.db (UUID=0001fa6e-882b-782b-af0c-41cba17d167d)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:51.727826 124219333910336 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007693 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:51.728915 124219333910336 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001073 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:51.730513 124219333910336 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001583 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:51.740548 124219333910336 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008052 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:51.856509 124219333910336 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.115946 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:51.858564 124219333910336 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002040 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:51.858581 124219333910336 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:51.867065 124219333910336 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008477 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:51.867081 124219333910336 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:51.867088 124219333910336 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:51.867094 124219333910336 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:51.867199 124219333910336 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:51.867416 124219333910336 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.220685 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:51.870446 124219333910336 simple_timer.cpp:55] [rocprofv3] output generation ::     0.244439 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:51.870518 124219333910336 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.246149 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI200/out/pmc_1/2522171_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Skipping roofline
