alias: tatd, block id: 15
Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100
Target: MI100
Command: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: ['tatd']

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.2s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/8][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:13.937420 134244726947648 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.307939 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:13.947420 134244726947648 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:14.162752 134244726947648 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:14.294266 134244726947648 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.346846 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:14.334129 134244726947648 generateRocpd.cpp:582] writing SQL database for process 2383486 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:52:14.335431 134244726947648 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/out/pmc_1/dl385-20-mi100-3c48/2383486_results.db (UUID=00004317-6165-7165-acce-eb7fa8b19189)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:14.422481 134244726947648 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013931 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:14.423637 134244726947648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001121 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:14.425868 134244726947648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002198 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:14.430914 134244726947648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003077 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:14.441736 134244726947648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.010789 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:14.444188 134244726947648 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002417 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:14.444223 134244726947648 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:14.459543 134244726947648 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015293 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:14.459571 134244726947648 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:14.459583 134244726947648 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:14.459596 134244726947648 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:14.459808 134244726947648 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000192 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:14.460193 134244726947648 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.126064 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:14.466084 134244726947648 simple_timer.cpp:55] [rocprofv3] output generation ::     0.169306 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:14.466158 134244726947648 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.171832 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/out/pmc_1/2383486_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/8][Approximate profiling time left: 19 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:16.694482 132865522433856 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.300248 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:16.704107 132865522433856 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:16.916253 132865522433856 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:17.047764 132865522433856 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.343657 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:17.080109 132865522433856 generateRocpd.cpp:582] writing SQL database for process 2383496 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:52:17.081131 132865522433856 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/out/pmc_1/dl385-20-mi100-3c48/2383496_results.db (UUID=00004317-6c32-7c32-ada5-d2c8234f2fde)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:17.153736 132865522433856 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.010629 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:17.154678 132865522433856 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.000919 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:17.156398 132865522433856 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001699 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:17.160419 132865522433856 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.002454 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:17.166325 132865522433856 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.005884 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:17.168362 132865522433856 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002016 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:17.168384 132865522433856 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:17.180794 132865522433856 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.012399 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:17.180814 132865522433856 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:17.180823 132865522433856 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:17.180832 132865522433856 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:17.180994 132865522433856 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000147 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:17.181263 132865522433856 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.101154 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:17.185815 132865522433856 simple_timer.cpp:55] [rocprofv3] output generation ::     0.135478 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:17.185870 132865522433856 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.138053 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/out/pmc_1/2383496_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/8][Approximate profiling time left: 14 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:19.628569 134143455674176 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.297700 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:19.638659 134143455674176 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:19.847649 134143455674176 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:19.977043 134143455674176 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.338385 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:20.015929 134143455674176 generateRocpd.cpp:582] writing SQL database for process 2383507 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:52:20.017217 134143455674176 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/out/pmc_1/dl385-20-mi100-3c48/2383507_results.db (UUID=00004317-77ab-77ab-89a5-cb91c8b95681)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:20.103316 134143455674176 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013566 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:20.104481 134143455674176 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001135 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:20.106688 134143455674176 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002178 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:20.111867 134143455674176 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003222 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:20.119443 134143455674176 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.007547 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:20.121878 134143455674176 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002402 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:20.121906 134143455674176 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:20.137575 134143455674176 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015655 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:20.137604 134143455674176 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:20.137616 134143455674176 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:20.137631 134143455674176 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:20.137832 134143455674176 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000189 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:20.138222 134143455674176 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.122294 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:20.144096 134143455674176 simple_timer.cpp:55] [rocprofv3] output generation ::     0.164608 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:20.144167 134143455674176 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.167075 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/out/pmc_1/2383507_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/8][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:22.372223 125458799017792 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.297582 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:22.382113 125458799017792 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:22.592630 125458799017792 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:22.721753 125458799017792 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.339640 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:22.761471 125458799017792 generateRocpd.cpp:582] writing SQL database for process 2383517 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:52:22.762746 125458799017792 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/out/pmc_1/dl385-20-mi100-3c48/2383517_results.db (UUID=00004317-8263-7263-8d7f-ca8743199190)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:22.852136 125458799017792 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013831 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:22.853262 125458799017792 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001095 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:22.855425 125458799017792 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002135 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:22.860432 125458799017792 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003083 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:22.864838 125458799017792 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.004378 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:22.867372 125458799017792 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002505 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:22.867401 125458799017792 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:22.883285 125458799017792 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015870 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:22.883313 125458799017792 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:22.883325 125458799017792 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:22.883336 125458799017792 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:22.883541 125458799017792 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000187 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:22.883888 125458799017792 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.122417 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:22.889886 125458799017792 simple_timer.cpp:55] [rocprofv3] output generation ::     0.165643 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:22.889960 125458799017792 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.168157 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/out/pmc_1/2383517_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/8][Approximate profiling time left: 8 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:25.109723 135118852026176 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.299897 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:25.119795 135118852026176 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:25.329988 135118852026176 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:25.460312 135118852026176 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.340517 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:25.499633 135118852026176 generateRocpd.cpp:582] writing SQL database for process 2383529 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:52:25.500923 135118852026176 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/out/pmc_1/dl385-20-mi100-3c48/2383529_results.db (UUID=00004317-8d12-7d12-a50a-a109601f7e84)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:25.591440 135118852026176 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013626 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:25.592562 135118852026176 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001092 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:25.594725 135118852026176 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002134 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:25.599858 135118852026176 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003193 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:25.604370 135118852026176 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.004483 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:25.606855 135118852026176 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002457 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:25.606884 135118852026176 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:25.622986 135118852026176 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.016087 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:25.623013 135118852026176 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:25.623025 135118852026176 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:25.623037 135118852026176 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:25.623245 135118852026176 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000188 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:25.623652 135118852026176 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.124020 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:25.629241 135118852026176 simple_timer.cpp:55] [rocprofv3] output generation ::     0.166501 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:25.629324 135118852026176 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.168962 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/out/pmc_1/2383529_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/8][Approximate profiling time left: 5 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:27.873333 134775682461504 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.306682 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:27.882891 134775682461504 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:28.098283 134775682461504 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:28.231317 134775682461504 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.348426 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:28.270721 134775682461504 generateRocpd.cpp:582] writing SQL database for process 2383552 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:52:28.272079 134775682461504 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/out/pmc_1/dl385-20-mi100-3c48/2383552_results.db (UUID=00004317-97d6-77d6-a5c0-452866050200)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:28.357922 134775682461504 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013613 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:28.359038 134775682461504 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001084 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:28.361217 134775682461504 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002151 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:28.366263 134775682461504 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003184 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:28.370812 134775682461504 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.004520 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:28.373222 134775682461504 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002381 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:28.373251 134775682461504 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:28.388451 134775682461504 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015181 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:28.388479 134775682461504 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:28.388491 134775682461504 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:28.388503 134775682461504 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:28.388712 134775682461504 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000188 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:28.389082 134775682461504 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.118362 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:28.394983 134775682461504 simple_timer.cpp:55] [rocprofv3] output generation ::     0.161189 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:28.395051 134775682461504 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.163683 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/out/pmc_1/2383552_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/8][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/perfmon/pmc_perf_6.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:30.879735 127641567133504 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.300653 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:30.889803 127641567133504 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:31.099495 127641567133504 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:31.229866 127641567133504 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.340063 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:31.268923 127641567133504 generateRocpd.cpp:582] writing SQL database for process 2383573 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:52:31.270230 127641567133504 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/out/pmc_1/dl385-20-mi100-3c48/2383573_results.db (UUID=00004317-a39b-739b-9d3d-262f8572ad0f)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:31.360354 127641567133504 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014128 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:31.361479 127641567133504 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001094 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:31.363656 127641567133504 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002149 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:31.368750 127641567133504 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003165 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:31.373189 127641567133504 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.004411 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:31.375615 127641567133504 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002398 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:31.375644 127641567133504 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:31.391441 127641567133504 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015783 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:31.391468 127641567133504 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:31.391480 127641567133504 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:31.391492 127641567133504 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:31.391687 127641567133504 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000175 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:31.392049 127641567133504 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.123127 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:31.397877 127641567133504 simple_timer.cpp:55] [rocprofv3] output generation ::     0.165535 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:31.397946 127641567133504 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.168030 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/out/pmc_1/2383573_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/8][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/perfmon/pmc_perf_7.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:33.628180 132604259626816 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.300185 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:33.637899 132604259626816 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:33.851327 132604259626816 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:33.983228 132604259626816 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.345329 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:34.022404 132604259626816 generateRocpd.cpp:582] writing SQL database for process 2383676 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:52:34.023704 132604259626816 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/out/pmc_1/dl385-20-mi100-3c48/2383676_results.db (UUID=00004317-ae58-7e58-a908-5b9e7c9e7cc0)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:34.109815 132604259626816 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013925 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:34.110928 132604259626816 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001083 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:34.113064 132604259626816 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002108 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:34.118101 132604259626816 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003133 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:34.122517 132604259626816 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.004383 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:34.125013 132604259626816 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002468 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:34.125043 132604259626816 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:34.141373 132604259626816 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.016315 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:34.141403 132604259626816 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:34.141415 132604259626816 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:34.141427 132604259626816 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:34.141632 132604259626816 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000191 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:34.142111 132604259626816 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.119707 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:34.148102 132604259626816 simple_timer.cpp:55] [rocprofv3] output generation ::     0.162410 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:34.148183 132604259626816 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.164905 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TD/MI100/out/pmc_1/2383676_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Skipping roofline
