alias: cpc, block id: 5
Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPF/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: ['cpc']

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/5][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/ipblocks_CPF/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:27.789911 132930018565952 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.184058 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:27.790534 132930018565952 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:27.983878 132930018565952 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:28.066412 132930018565952 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.275877 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:28.089088 132930018565952 generateRocpd.cpp:583] writing SQL database for process 2523718 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:00:28.089874 132930018565952 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_CPF/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523718_results.db (UUID=0001fa75-7f32-7f32-9068-b95698560003)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:28.174005 132930018565952 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008174 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:28.175121 132930018565952 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001095 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:28.176692 132930018565952 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001556 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:28.187090 132930018565952 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008419 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:28.229853 132930018565952 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.042748 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:28.232117 132930018565952 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002249 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:28.232134 132930018565952 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:28.241102 132930018565952 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008961 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:28.241116 132930018565952 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:28.241122 132930018565952 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:28.241129 132930018565952 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:28.241227 132930018565952 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000090 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:28.241451 132930018565952 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.152363 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:28.244323 132930018565952 simple_timer.cpp:55] [rocprofv3] output generation ::     0.176486 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:28.244378 132930018565952 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.177921 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_CPF/MI200/out/pmc_1/2523718_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/5][Approximate profiling time left: 5 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_CPF/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:29.729789 126921195634496 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.182034 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:29.730369 126921195634496 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:29.923118 126921195634496 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:30.003446 126921195634496 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.273077 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:30.025868 126921195634496 generateRocpd.cpp:583] writing SQL database for process 2523726 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:00:30.026654 126921195634496 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_CPF/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523726_results.db (UUID=0001fa75-86c8-76c8-84ce-b8adf3151802)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:30.109120 126921195634496 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007837 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:30.110317 126921195634496 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001180 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:30.111982 126921195634496 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001651 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:30.122616 126921195634496 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008470 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:30.151920 126921195634496 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.029288 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:30.154163 126921195634496 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002229 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:30.154180 126921195634496 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:30.163675 126921195634496 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009487 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:30.163689 126921195634496 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:30.163695 126921195634496 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:30.163702 126921195634496 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:30.163806 126921195634496 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:30.164009 126921195634496 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.138141 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:30.167063 126921195634496 simple_timer.cpp:55] [rocprofv3] output generation ::     0.162201 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:30.167111 126921195634496 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.163625 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_CPF/MI200/out/pmc_1/2523726_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/5][Approximate profiling time left: 3 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_CPF/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:31.649340 137170615099200 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.185266 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:31.649902 137170615099200 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:31.843730 137170615099200 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:31.925143 137170615099200 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.275242 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:31.947469 137170615099200 generateRocpd.cpp:583] writing SQL database for process 2523736 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:00:31.948255 137170615099200 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_CPF/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523736_results.db (UUID=0001fa75-8e44-7e44-819a-116905f78a0f)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:32.030790 137170615099200 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007814 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:32.032007 137170615099200 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001200 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:32.033693 137170615099200 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001672 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:32.044463 137170615099200 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008577 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:32.073524 137170615099200 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.029046 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:32.075879 137170615099200 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002339 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:32.075896 137170615099200 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:32.084764 137170615099200 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008861 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:32.084778 137170615099200 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:32.084785 137170615099200 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:32.084791 137170615099200 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:32.084894 137170615099200 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000094 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:32.085104 137170615099200 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.137635 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:32.087989 137170615099200 simple_timer.cpp:55] [rocprofv3] output generation ::     0.161245 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:32.088043 137170615099200 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.162860 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_CPF/MI200/out/pmc_1/2523736_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/5][Approximate profiling time left: 1 second]...
[profiling] Current input file: tests/workloads/ipblocks_CPF/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:33.566961 134483003457344 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.183789 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:33.567586 134483003457344 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:33.759723 134483003457344 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:33.851572 134483003457344 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.283987 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:33.873662 134483003457344 generateRocpd.cpp:583] writing SQL database for process 2523744 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:00:33.874463 134483003457344 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_CPF/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523744_results.db (UUID=0001fa75-95c4-75c4-96c1-511d5ee75e55)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:33.956556 134483003457344 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007687 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:33.957708 134483003457344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001134 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:33.959328 134483003457344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001603 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:33.969694 134483003457344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008378 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:33.991693 134483003457344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.021983 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:33.993777 134483003457344 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002069 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:33.993795 134483003457344 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:34.002485 134483003457344 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008681 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:34.002500 134483003457344 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:34.002510 134483003457344 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:34.002517 134483003457344 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:34.002615 134483003457344 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000092 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:34.002798 134483003457344 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.129136 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:34.005610 134483003457344 simple_timer.cpp:55] [rocprofv3] output generation ::     0.152674 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:34.005654 134483003457344 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.154035 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_CPF/MI200/out/pmc_1/2523744_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/5][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/ipblocks_CPF/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:35.491881 130316539273024 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.185639 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:35.492498 130316539273024 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:35.685679 130316539273024 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:35.788988 130316539273024 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.296491 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:35.811391 130316539273024 generateRocpd.cpp:583] writing SQL database for process 2523754 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:00:35.812189 130316539273024 generateRocpd.cpp:606] Opened result file: tests/workloads/ipblocks_CPF/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523754_results.db (UUID=0001fa75-9d47-7d47-be2f-231ad570e9c7)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:35.894522 130316539273024 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007760 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:35.895732 130316539273024 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001193 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:35.897423 130316539273024 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001677 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:35.908152 130316539273024 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008550 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:35.923851 130316539273024 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.015684 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:35.926261 130316539273024 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002391 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:35.926278 130316539273024 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:35.935114 130316539273024 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008829 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:35.935128 130316539273024 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:35.935134 130316539273024 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:35.935140 130316539273024 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:35.935244 130316539273024 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000095 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:35.935445 130316539273024 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.124054 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:35.938512 130316539273024 simple_timer.cpp:55] [rocprofv3] output generation ::     0.147836 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:35.938562 130316539273024 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.149501 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/ipblocks_CPF/MI200/out/pmc_1/2523754_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Skipping roofline
