alias: cu_ins, block id: 10
alias: cu_pipe, block id: 11
alias: ins_cache, block id: 13
alias: sl1d, block id: 14
alias: vl1d, block id: 16
alias: cpc, block id: 5
Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100
Target: MI100
Command: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: ['cu_ins', 'cu_pipe', 'ins_cache', 'sl1d', 'vl1d', 'cpc']

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.2s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/10][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:38.256850 133487745806144 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.299580 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:38.266298 133487745806144 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:38.478072 133487745806144 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:38.608253 133487745806144 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.341955 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:38.647599 133487745806144 generateRocpd.cpp:582] writing SQL database for process 2385949 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:55:38.648911 133487745806144 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385949_results.db (UUID=0000431a-7f8e-7f8e-935f-61f67796d5ac)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:38.737808 133487745806144 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:38.738906 133487745806144 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001068 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:38.741076 133487745806144 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002143 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:38.746120 133487745806144 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003079 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:38.774752 133487745806144 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.028603 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:38.777420 133487745806144 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002639 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:38.777449 133487745806144 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:38.793428 133487745806144 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015964 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:38.793455 133487745806144 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:38.793478 133487745806144 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:38.793491 133487745806144 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:38.793688 133487745806144 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000185 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:38.794081 133487745806144 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.146482 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:38.799833 133487745806144 simple_timer.cpp:55] [rocprofv3] output generation ::     0.189016 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:38.799909 133487745806144 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.191603 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/out/pmc_1/2385949_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/10][Approximate profiling time left: 26 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:41.028905 128306861317952 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.299136 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:41.039789 128306861317952 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:41.251620 128306861317952 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:41.382036 128306861317952 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.342247 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:41.421246 128306861317952 generateRocpd.cpp:582] writing SQL database for process 2385960 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:55:41.422530 128306861317952 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385960_results.db (UUID=0000431a-8a62-7a62-ad0b-7ee2e6550000)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:41.498292 128306861317952 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.010673 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:41.499227 128306861317952 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.000912 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:41.500936 128306861317952 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001687 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:41.505121 128306861317952 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.002555 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:41.528357 128306861317952 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.023214 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:41.530577 128306861317952 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002198 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:41.530599 128306861317952 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:41.543000 128306861317952 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.012389 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:41.543026 128306861317952 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:41.543035 128306861317952 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:41.543044 128306861317952 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:41.543209 128306861317952 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000155 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:41.543586 128306861317952 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.122341 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:41.548343 128306861317952 simple_timer.cpp:55] [rocprofv3] output generation ::     0.163675 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:41.548410 128306861317952 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.166324 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/out/pmc_1/2385960_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/10][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:43.772990 129023774965568 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.299209 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:43.782384 129023774965568 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:43.994096 129023774965568 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:44.128180 129023774965568 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.345797 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:44.167412 129023774965568 generateRocpd.cpp:582] writing SQL database for process 2385971 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:55:44.168728 129023774965568 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385971_results.db (UUID=0000431a-951a-751a-87f8-5e692cc5e2e1)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:44.256714 129023774965568 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014061 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:44.257848 129023774965568 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001102 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:44.260041 129023774965568 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002165 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:44.265040 129023774965568 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003105 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:44.292238 129023774965568 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.027170 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:44.294919 129023774965568 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002652 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:44.294948 129023774965568 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:44.311014 129023774965568 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.016050 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:44.311047 129023774965568 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:44.311059 129023774965568 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:44.311071 129023774965568 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:44.311287 129023774965568 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000199 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:44.311848 129023774965568 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.144437 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:44.317835 129023774965568 simple_timer.cpp:55] [rocprofv3] output generation ::     0.187163 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:44.317922 129023774965568 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.189690 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/out/pmc_1/2385971_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/10][Approximate profiling time left: 17 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:46.554237 127070563090240 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.297969 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:46.564108 127070563090240 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:46.775639 127070563090240 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:46.903123 127070563090240 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.339015 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:46.942299 127070563090240 generateRocpd.cpp:582] writing SQL database for process 2385981 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:55:46.943594 127070563090240 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385981_results.db (UUID=0000431a-9ff8-7ff8-ae01-c7192cf3fbb2)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:47.031443 127070563090240 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014092 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:47.032576 127070563090240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001102 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:47.034773 127070563090240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002168 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:47.039842 127070563090240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003104 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:47.059613 127070563090240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.019742 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:47.062135 127070563090240 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002493 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:47.062164 127070563090240 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:47.078055 127070563090240 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015876 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:47.078088 127070563090240 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:47.078100 127070563090240 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:47.078112 127070563090240 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:47.078323 127070563090240 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000196 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:47.078847 127070563090240 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.136549 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:47.084812 127070563090240 simple_timer.cpp:55] [rocprofv3] output generation ::     0.179166 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:47.084897 127070563090240 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.181724 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/out/pmc_1/2385981_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/10][Approximate profiling time left: 14 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:49.319854 128067427516224 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.299237 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:49.330132 128067427516224 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:49.540847 128067427516224 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:49.669187 128067427516224 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.339056 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:49.708220 128067427516224 generateRocpd.cpp:582] writing SQL database for process 2385991 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:55:49.709496 128067427516224 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385991_results.db (UUID=0000431a-aac5-7ac5-8801-719a4b4bb522)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:49.797655 128067427516224 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013854 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:49.798748 128067427516224 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001060 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:49.800952 128067427516224 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002175 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:49.806068 128067427516224 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003136 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:49.813548 128067427516224 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.007447 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:49.816024 128067427516224 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002447 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:49.816054 128067427516224 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:49.831407 128067427516224 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015339 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:49.831435 128067427516224 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:49.831447 128067427516224 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:49.831459 128067427516224 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:49.831657 128067427516224 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000182 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:49.832059 128067427516224 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.123839 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:49.837819 128067427516224 simple_timer.cpp:55] [rocprofv3] output generation ::     0.166111 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:49.837893 128067427516224 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.168652 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/out/pmc_1/2385991_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/10][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:52.071959 133185409888064 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.297756 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:52.080728 133185409888064 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:52.291355 133185409888064 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:52.419594 133185409888064 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.338866 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:52.458700 133185409888064 generateRocpd.cpp:582] writing SQL database for process 2386002 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:55:52.459971 133185409888064 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/out/pmc_1/dl385-20-mi100-3c48/2386002_results.db (UUID=0000431a-b586-7586-9457-ccea1b43854c)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:52.548081 133185409888064 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013940 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:52.549185 133185409888064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001073 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:52.551353 133185409888064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002140 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:52.556456 133185409888064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003058 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:52.563935 133185409888064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.007450 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:52.566336 133185409888064 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002373 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:52.566365 133185409888064 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:52.581702 133185409888064 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015323 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:52.581729 133185409888064 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:52.581741 133185409888064 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:52.581752 133185409888064 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:52.581959 133185409888064 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000189 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:52.582333 133185409888064 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.123634 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:52.588022 133185409888064 simple_timer.cpp:55] [rocprofv3] output generation ::     0.165912 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:52.588096 133185409888064 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.168447 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/out/pmc_1/2386002_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/10][Approximate profiling time left: 8 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/perfmon/pmc_perf_6.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:54.827168 129600104263488 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.295461 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:54.837373 129600104263488 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:55.048503 129600104263488 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:55.175959 129600104263488 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.338586 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:55.214679 129600104263488 generateRocpd.cpp:582] writing SQL database for process 2386012 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:55:55.215953 129600104263488 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/out/pmc_1/dl385-20-mi100-3c48/2386012_results.db (UUID=0000431a-c04c-704c-86ed-6bb593a78eb3)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:55.305760 129600104263488 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013860 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:55.306920 129600104263488 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001129 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:55.309133 129600104263488 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002186 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:55.314186 129600104263488 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003134 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:55.321734 129600104263488 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.007519 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:55.324183 129600104263488 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002421 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:55.324211 129600104263488 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:55.339745 129600104263488 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015519 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:55.339772 129600104263488 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:55.339785 129600104263488 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:55.339797 129600104263488 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:55.340008 129600104263488 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000194 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:55.340362 129600104263488 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.125683 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:55.346173 129600104263488 simple_timer.cpp:55] [rocprofv3] output generation ::     0.167781 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:55.346252 129600104263488 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.170233 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/out/pmc_1/2386012_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/10][Approximate profiling time left: 5 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/perfmon/pmc_perf_7.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:57.817167 125737166528320 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.298509 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:57.827354 125737166528320 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:58.038390 125737166528320 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:55:58.168869 125737166528320 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.341515 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:58.208070 125737166528320 generateRocpd.cpp:582] writing SQL database for process 2386022 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:55:58.209365 125737166528320 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/out/pmc_1/dl385-20-mi100-3c48/2386022_results.db (UUID=0000431a-cbf7-7bf7-995b-f08d688936c5)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:58.299397 125737166528320 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:58.300502 125737166528320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001072 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:58.302659 125737166528320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002129 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:58.307829 125737166528320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003181 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:58.315432 125737166528320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.007574 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:58.317853 125737166528320 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002393 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:58.317882 125737166528320 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:58.333475 125737166528320 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015578 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:58.333508 125737166528320 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:58.333520 125737166528320 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:58.333532 125737166528320 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:58.333727 125737166528320 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000181 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:58.334170 125737166528320 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.126101 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:58.340110 125737166528320 simple_timer.cpp:55] [rocprofv3] output generation ::     0.168775 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:55:58.340195 125737166528320 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.171275 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/out/pmc_1/2386022_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/10][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/perfmon/pmc_perf_8.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:56:00.554036 135171477958464 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.297162 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:56:00.562672 135171477958464 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:00.773817 135171477958464 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:56:00.904089 135171477958464 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.341417 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:00.943147 135171477958464 generateRocpd.cpp:582] writing SQL database for process 2386033 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:56:00.944463 135171477958464 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/out/pmc_1/dl385-20-mi100-3c48/2386033_results.db (UUID=0000431a-d6a9-76a9-a395-b5718a660c2d)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:01.035202 135171477958464 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014078 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:01.036371 135171477958464 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001139 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:01.038582 135171477958464 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002182 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:01.043777 135171477958464 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003211 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:01.048220 135171477958464 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.004414 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:01.050710 135171477958464 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002461 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:01.050740 135171477958464 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:01.066577 135171477958464 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015823 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:01.066605 135171477958464 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:01.066617 135171477958464 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:01.066629 135171477958464 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:01.066825 135171477958464 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000179 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:01.067203 135171477958464 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.124056 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:01.073014 135171477958464 simple_timer.cpp:55] [rocprofv3] output generation ::     0.166417 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:01.073093 135171477958464 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.168953 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/out/pmc_1/2386033_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/10][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:56:03.287095 132279785897792 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.297735 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:56:03.295742 132279785897792 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:03.511210 132279785897792 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:56:03.641000 132279785897792 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.345259 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:03.680474 132279785897792 generateRocpd.cpp:582] writing SQL database for process 2386043 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:56:03.681752 132279785897792 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/out/pmc_1/dl385-20-mi100-3c48/2386043_results.db (UUID=0000431a-e155-7155-a7c3-7bac9584ab48)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:03.773343 132279785897792 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014182 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:03.774509 132279785897792 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001134 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:03.776759 132279785897792 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002222 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:03.781996 132279785897792 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003167 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:03.811064 132279785897792 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.029039 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:03.813912 132279785897792 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002813 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:03.813942 132279785897792 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:03.830005 132279785897792 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.016048 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:03.830037 132279785897792 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:03.830049 132279785897792 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:03.830062 132279785897792 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:03.830280 132279785897792 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000200 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:03.830700 132279785897792 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.150227 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:03.836579 132279785897792 simple_timer.cpp:55] [rocprofv3] output generation ::     0.193067 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:56:03.836664 132279785897792 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.195611 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_SQ_SQC_TCP_CPC/MI100/out/pmc_1/2386043_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Skipping roofline
