alias: cu_ins, block id: 10
alias: cu_pipe, block id: 11
alias: cpc, block id: 5
Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_CPC/MI100
Target: MI100
Command: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: ['cu_ins', 'cu_pipe', 'cpc']

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.2s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/5][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_CPC/MI100/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:39.204289 135147349696320 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.297111 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:39.214147 135147349696320 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:39.425718 135147349696320 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:39.555447 135147349696320 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.341300 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:39.594126 135147349696320 generateRocpd.cpp:582] writing SQL database for process 2385607 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:54:39.595415 135147349696320 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_CPC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385607_results.db (UUID=00004319-98e3-78e3-b0ac-e13fe95bdf5d)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:39.685584 135147349696320 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014087 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:39.686730 135147349696320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001115 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:39.688968 135147349696320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002210 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:39.694134 135147349696320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003198 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:39.718868 135147349696320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.024706 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:39.721520 135147349696320 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002622 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:39.721549 135147349696320 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:39.737367 135147349696320 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015803 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:39.737394 135147349696320 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:39.737406 135147349696320 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:39.737418 135147349696320 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:39.737622 135147349696320 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000184 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:39.738010 135147349696320 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.143884 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:39.743784 135147349696320 simple_timer.cpp:55] [rocprofv3] output generation ::     0.185825 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:39.743858 135147349696320 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.188361 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_CPC/MI100/out/pmc_1/2385607_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/5][Approximate profiling time left: 9 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_CPC/MI100/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:41.950631 132889585131328 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.299374 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:41.960411 132889585131328 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:42.171210 132889585131328 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:42.300632 132889585131328 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.340220 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:42.339535 132889585131328 generateRocpd.cpp:582] writing SQL database for process 2385618 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:54:42.340797 132889585131328 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_CPC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385618_results.db (UUID=00004319-a39b-739b-97bd-2e0103e1f762)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:42.431401 132889585131328 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014024 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:42.432522 132889585131328 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001090 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:42.434682 132889585131328 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002131 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:42.439812 132889585131328 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003180 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:42.461214 132889585131328 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.021374 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:42.463790 132889585131328 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002547 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:42.463818 132889585131328 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:42.479676 132889585131328 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015843 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:42.479703 132889585131328 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:42.479715 132889585131328 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:42.479727 132889585131328 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:42.479915 132889585131328 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000174 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:42.480334 132889585131328 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.140800 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:42.485982 132889585131328 simple_timer.cpp:55] [rocprofv3] output generation ::     0.182779 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:42.486053 132889585131328 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.185361 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_CPC/MI100/out/pmc_1/2385618_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/5][Approximate profiling time left: 5 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_CPC/MI100/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:44.729407 140437916753728 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.297385 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:44.739170 140437916753728 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:44.950359 140437916753728 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:45.079893 140437916753728 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.340724 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:45.118956 140437916753728 generateRocpd.cpp:582] writing SQL database for process 2385630 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:54:45.120299 140437916753728 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_CPC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385630_results.db (UUID=00004319-ae78-7e78-959d-65be84b71ec4)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:45.212001 140437916753728 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013955 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:45.213133 140437916753728 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001102 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:45.215355 140437916753728 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002193 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:45.220488 140437916753728 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003160 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:45.231236 140437916753728 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.010720 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:45.233737 140437916753728 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002472 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:45.233766 140437916753728 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:45.249903 140437916753728 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.016122 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:45.249934 140437916753728 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:45.249947 140437916753728 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:45.249959 140437916753728 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:45.250183 140437916753728 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000198 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:45.250649 140437916753728 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.131694 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:45.256690 140437916753728 simple_timer.cpp:55] [rocprofv3] output generation ::     0.174276 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:45.256773 140437916753728 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.176827 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_CPC/MI100/out/pmc_1/2385630_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/5][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_CPC/MI100/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:47.505738 138151257792320 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.307261 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:47.515654 138151257792320 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:47.730755 138151257792320 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:47.864782 138151257792320 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.349128 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:47.904566 138151257792320 generateRocpd.cpp:582] writing SQL database for process 2385640 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:54:47.905865 138151257792320 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_CPC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385640_results.db (UUID=00004319-b946-7946-994e-66afedc9d491)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:47.991970 138151257792320 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014072 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:47.993097 138151257792320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001081 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:47.995288 138151257792320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002163 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:48.000485 138151257792320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003330 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:48.009718 138151257792320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.009205 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:48.012262 138151257792320 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002515 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:48.012291 138151257792320 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:48.027995 138151257792320 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015689 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:48.028023 138151257792320 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:48.028036 138151257792320 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:48.028048 138151257792320 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:48.028259 138151257792320 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000192 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:48.028649 138151257792320 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.124083 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:48.034634 138151257792320 simple_timer.cpp:55] [rocprofv3] output generation ::     0.167353 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:48.034716 138151257792320 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.169882 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_CPC/MI100/out/pmc_1/2385640_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/5][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_CPC/MI100/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:50.292090 135498069331776 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.299704 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:50.300884 135498069331776 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:50.515422 135498069331776 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:50.645805 135498069331776 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.344921 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:50.677427 135498069331776 generateRocpd.cpp:582] writing SQL database for process 2385651 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:54:50.678453 135498069331776 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_CPC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385651_results.db (UUID=00004319-c430-7430-b746-7909fbff92a8)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:50.752861 135498069331776 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.010486 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:50.753871 135498069331776 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.000986 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:50.755646 135498069331776 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001753 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:50.759838 135498069331776 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.002472 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:50.763397 135498069331776 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.003537 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:50.765359 135498069331776 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.001940 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:50.765381 135498069331776 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:50.777167 135498069331776 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.011775 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:50.777188 135498069331776 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:50.777197 135498069331776 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:50.777206 135498069331776 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:50.777351 135498069331776 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000134 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:50.777645 135498069331776 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.100218 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:50.781869 135498069331776 simple_timer.cpp:55] [rocprofv3] output generation ::     0.133416 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:50.781925 135498069331776 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.136069 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_CPC/MI100/out/pmc_1/2385651_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Skipping roofline
