alias: cpc, block id: 5
Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPC/MI100
Target: MI100
Command: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: ['cpc']

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.2s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/5][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPC/MI100/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:03.253248 124401433853760 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.298068 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:03.261804 124401433853760 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:03.472480 124401433853760 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:03.601286 124401433853760 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.339483 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:03.639882 124401433853760 generateRocpd.cpp:582] writing SQL database for process 2385396 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:54:03.641241 124401433853760 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385396_results.db (UUID=00004319-0c73-7c73-ade0-8a45a805dff3)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:03.730193 124401433853760 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013837 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:03.731292 124401433853760 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001069 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:03.733424 124401433853760 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002102 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:03.738452 124401433853760 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003123 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:03.748901 124401433853760 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.010420 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:03.751354 124401433853760 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002424 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:03.751383 124401433853760 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:03.767062 124401433853760 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015663 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:03.767097 124401433853760 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:03.767109 124401433853760 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:03.767121 124401433853760 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:03.767339 124401433853760 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000203 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:03.767767 124401433853760 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.127885 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:03.773644 124401433853760 simple_timer.cpp:55] [rocprofv3] output generation ::     0.169852 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:03.773724 124401433853760 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.172386 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPC/MI100/out/pmc_1/2385396_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/5][Approximate profiling time left: 9 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPC/MI100/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:06.232927 128016827903808 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.297969 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:06.243109 128016827903808 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:06.455594 128016827903808 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:06.580902 128016827903808 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.337794 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:06.619396 128016827903808 generateRocpd.cpp:582] writing SQL database for process 2385408 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:54:06.620722 128016827903808 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385408_results.db (UUID=00004319-1817-7817-82d0-cb06e033ace7)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:06.709621 128016827903808 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013853 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:06.710730 128016827903808 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001077 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:06.712907 128016827903808 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002148 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:06.717954 128016827903808 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003104 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:06.725388 128016827903808 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.007397 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:06.727801 128016827903808 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002385 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:06.727830 128016827903808 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:06.743723 128016827903808 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015878 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:06.743753 128016827903808 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:06.743765 128016827903808 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:06.743777 128016827903808 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:06.743998 128016827903808 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000207 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:06.744417 128016827903808 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.125022 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:06.750344 128016827903808 simple_timer.cpp:55] [rocprofv3] output generation ::     0.166966 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:06.750425 128016827903808 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.169471 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPC/MI100/out/pmc_1/2385408_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/5][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPC/MI100/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:08.968308 136280015523648 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.297552 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:08.976756 136280015523648 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:09.186636 136280015523648 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:09.318950 136280015523648 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.342195 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:09.358307 136280015523648 generateRocpd.cpp:582] writing SQL database for process 2385418 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:54:09.359579 136280015523648 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385418_results.db (UUID=00004319-22c7-72c7-8bb6-2c7bd57528ca)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:09.450114 136280015523648 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013978 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:09.451267 136280015523648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001123 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:09.453426 136280015523648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002130 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:09.458521 136280015523648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003147 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:09.466028 136280015523648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.007478 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:09.468483 136280015523648 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002427 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:09.468512 136280015523648 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:09.484039 136280015523648 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015513 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:09.484066 136280015523648 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:09.484078 136280015523648 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:09.484090 136280015523648 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:09.484283 136280015523648 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000178 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:09.484634 136280015523648 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.126328 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:09.490412 136280015523648 simple_timer.cpp:55] [rocprofv3] output generation ::     0.168987 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:09.490484 136280015523648 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.171475 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPC/MI100/out/pmc_1/2385418_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/5][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPC/MI100/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:11.727561 128996287737664 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.296780 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:11.737191 128996287737664 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:11.947848 128996287737664 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:12.076398 128996287737664 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.339208 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:12.115508 128996287737664 generateRocpd.cpp:582] writing SQL database for process 2385429 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:54:12.116780 128996287737664 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385429_results.db (UUID=00004319-2d8f-7d8f-8662-187483cfea69)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:12.206713 128996287737664 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013943 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:12.207856 128996287737664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001111 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:12.210026 128996287737664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002142 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:12.215215 128996287737664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003254 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:12.221293 128996287737664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.006048 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:12.223741 128996287737664 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002419 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:12.223772 128996287737664 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:12.239521 128996287737664 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015735 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:12.239553 128996287737664 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:12.239565 128996287737664 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:12.239577 128996287737664 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:12.239802 128996287737664 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000208 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:12.240206 128996287737664 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.124698 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:12.246338 128996287737664 simple_timer.cpp:55] [rocprofv3] output generation ::     0.167475 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:12.246419 128996287737664 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.169972 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPC/MI100/out/pmc_1/2385429_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/5][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPC/MI100/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:14.446146 130446665600832 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.298322 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:14.456480 130446665600832 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:14.667936 130446665600832 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:14.798644 130446665600832 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.342164 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:14.837851 130446665600832 generateRocpd.cpp:582] writing SQL database for process 2385439 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:54:14.839144 130446665600832 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385439_results.db (UUID=00004319-382c-782c-8463-7ddeef766548)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:14.929337 130446665600832 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013810 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:14.930455 130446665600832 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001082 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:14.932627 130446665600832 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002144 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:14.937764 130446665600832 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003161 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:14.942204 130446665600832 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.004410 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:14.944741 130446665600832 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002508 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:14.944770 130446665600832 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:14.960775 130446665600832 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015990 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:14.960804 130446665600832 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:14.960817 130446665600832 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:14.960829 130446665600832 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:14.961046 130446665600832 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000203 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:14.961443 130446665600832 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.123593 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:14.967325 130446665600832 simple_timer.cpp:55] [rocprofv3] output generation ::     0.166203 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:14.967404 130446665600832 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.168711 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPC/MI100/out/pmc_1/2385439_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Skipping roofline
