alias: cpc, block id: 5
Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPF/MI100
Target: MI100
Command: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: ['cpc']

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.2s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[ 33%] Built target fmt
[ 33%] Built target gsl_assert
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/5][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPF/MI100/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:21.196386 133604122705728 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.296848 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:21.206415 133604122705728 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:21.415151 133604122705728 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:21.544278 133604122705728 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.337863 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:21.576717 133604122705728 generateRocpd.cpp:582] writing SQL database for process 2385497 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:54:21.577731 133604122705728 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPF/MI100/out/pmc_1/dl385-20-mi100-3c48/2385497_results.db (UUID=00004319-528c-728c-8921-8124922d141f)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:21.652255 133604122705728 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.010538 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:21.653226 133604122705728 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.000944 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:21.654958 133604122705728 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001708 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:21.659298 133604122705728 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.002609 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:21.667473 133604122705728 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008150 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:21.669526 133604122705728 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002029 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:21.669553 133604122705728 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:21.681653 133604122705728 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.012080 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:21.681675 133604122705728 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:21.681684 133604122705728 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:21.681693 133604122705728 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:21.681848 133604122705728 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000135 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:21.682124 133604122705728 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.105407 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:21.686427 133604122705728 simple_timer.cpp:55] [rocprofv3] output generation ::     0.139433 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:21.686494 133604122705728 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.142143 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPF/MI100/out/pmc_1/2385497_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/5][Approximate profiling time left: 9 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPF/MI100/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:24.144507 123994538422080 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.298467 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:24.154493 123994538422080 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:24.365552 123994538422080 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:24.495265 123994538422080 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.340772 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:24.535085 123994538422080 generateRocpd.cpp:582] writing SQL database for process 2385507 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:54:24.536423 123994538422080 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPF/MI100/out/pmc_1/dl385-20-mi100-3c48/2385507_results.db (UUID=00004319-5e0e-7e0e-88f3-ab6e82362628)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:24.626962 123994538422080 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014022 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:24.628116 123994538422080 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001099 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:24.630310 123994538422080 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002165 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:24.635394 123994538422080 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003154 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:24.642969 123994538422080 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.007547 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:24.645445 123994538422080 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002426 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:24.645474 123994538422080 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:24.661091 123994538422080 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015603 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:24.661119 123994538422080 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:24.661131 123994538422080 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:24.661143 123994538422080 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:24.661348 123994538422080 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000184 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:24.661713 123994538422080 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.126629 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:24.667614 123994538422080 simple_timer.cpp:55] [rocprofv3] output generation ::     0.169880 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:24.667684 123994538422080 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.172371 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPF/MI100/out/pmc_1/2385507_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/5][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPF/MI100/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:26.877126 132673175846720 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.298721 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:26.885695 132673175846720 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:27.097559 132673175846720 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:27.226763 132673175846720 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.341068 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:27.266183 132673175846720 generateRocpd.cpp:582] writing SQL database for process 2385529 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:54:27.267463 132673175846720 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPF/MI100/out/pmc_1/dl385-20-mi100-3c48/2385529_results.db (UUID=00004319-68bb-78bb-8e01-2f848a0439c8)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:27.357833 132673175846720 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013878 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:27.358993 132673175846720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001129 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:27.361184 132673175846720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002162 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:27.366316 132673175846720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003186 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:27.373770 132673175846720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.007425 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:27.376230 132673175846720 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002432 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:27.376260 132673175846720 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:27.392140 132673175846720 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015866 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:27.392168 132673175846720 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:27.392180 132673175846720 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:27.392191 132673175846720 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:27.392397 132673175846720 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000184 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:27.392755 132673175846720 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.126572 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:27.398538 132673175846720 simple_timer.cpp:55] [rocprofv3] output generation ::     0.169309 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:27.398611 132673175846720 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.171797 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPF/MI100/out/pmc_1/2385529_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/5][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPF/MI100/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:29.621170 134614235197248 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.301137 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:29.630812 134614235197248 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:29.845850 134614235197248 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:29.976195 134614235197248 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.345383 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:30.015568 134614235197248 generateRocpd.cpp:582] writing SQL database for process 2385539 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:54:30.016896 134614235197248 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPF/MI100/out/pmc_1/dl385-20-mi100-3c48/2385539_results.db (UUID=00004319-7370-7370-96db-eaf29eb7e8e7)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:30.108696 134614235197248 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014245 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:30.109854 134614235197248 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001127 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:30.112042 134614235197248 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002159 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:30.117210 134614235197248 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003169 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:30.123221 134614235197248 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.005983 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:30.125720 134614235197248 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002470 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:30.125749 134614235197248 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:30.141598 134614235197248 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015834 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:30.141630 134614235197248 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:30.141642 134614235197248 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:30.141654 134614235197248 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:30.141873 134614235197248 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000199 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:30.142316 134614235197248 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.126749 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:30.148205 134614235197248 simple_timer.cpp:55] [rocprofv3] output generation ::     0.169324 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:30.148284 134614235197248 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.172039 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPF/MI100/out/pmc_1/2385539_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/5][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPF/MI100/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:32.357773 127220943843136 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.296784 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:32.367904 127220943843136 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:32.579502 127220943843136 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:54:32.707178 127220943843136 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.339274 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:32.746409 127220943843136 generateRocpd.cpp:582] writing SQL database for process 2385549 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:54:32.747689 127220943843136 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPF/MI100/out/pmc_1/dl385-20-mi100-3c48/2385549_results.db (UUID=00004319-7e25-7e25-b87a-7909c671e0f8)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:32.837719 127220943843136 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013779 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:32.838887 127220943843136 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001137 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:32.841084 127220943843136 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002167 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:32.846274 127220943843136 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003225 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:32.850768 127220943843136 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.004466 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:32.853253 127220943843136 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002456 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:32.853282 127220943843136 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:32.868841 127220943843136 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015544 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:32.868868 127220943843136 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:32.868880 127220943843136 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:32.868892 127220943843136 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:32.869106 127220943843136 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000198 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:32.869458 127220943843136 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.123051 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:32.875150 127220943843136 simple_timer.cpp:55] [rocprofv3] output generation ::     0.165507 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:54:32.875225 127220943843136 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.167994 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_CPF/MI100/out/pmc_1/2385549_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Skipping roofline
