alias: vl1d, block id: 16
Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100
Target: MI100
Command: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: ['vl1d']

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/10][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:40.316105 126978915393344 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.299516 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:40.323676 126978915393344 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:40.532555 126978915393344 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:40.661866 126978915393344 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.338190 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:40.701151 126978915393344 generateRocpd.cpp:582] writing SQL database for process 2384915 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:52:40.702461 126978915393344 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/out/pmc_1/dl385-20-mi100-3c48/2384915_results.db (UUID=00004317-c879-7879-bb8e-693849b2b50d)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:40.790543 126978915393344 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013815 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:40.791622 126978915393344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001049 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:40.793707 126978915393344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002058 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:40.798570 126978915393344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:40.810452 126978915393344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.011855 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:40.812797 126978915393344 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002317 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:40.812825 126978915393344 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:40.827691 126978915393344 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.014852 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:40.827718 126978915393344 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:40.827731 126978915393344 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:40.827743 126978915393344 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:40.827948 126978915393344 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000186 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:40.828313 126978915393344 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.127162 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:40.834114 126978915393344 simple_timer.cpp:55] [rocprofv3] output generation ::     0.169746 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:40.834185 126978915393344 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.172268 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/out/pmc_1/2384915_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/10][Approximate profiling time left: 25 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:43.293882 131528418635584 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.297791 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:43.302776 131528418635584 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:43.519449 131528418635584 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:43.648699 131528418635584 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.345923 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:43.687620 131528418635584 generateRocpd.cpp:582] writing SQL database for process 2384981 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:52:43.688956 131528418635584 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/out/pmc_1/dl385-20-mi100-3c48/2384981_results.db (UUID=00004317-d41c-741c-b8c5-d09d17dc9457)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:43.778961 131528418635584 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014018 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:43.780111 131528418635584 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001104 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:43.782271 131528418635584 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002132 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:43.787353 131528418635584 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003105 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:43.794934 131528418635584 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.007547 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:43.797358 131528418635584 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002395 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:43.797387 131528418635584 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:43.813455 131528418635584 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.016053 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:43.813489 131528418635584 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:43.813501 131528418635584 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:43.813513 131528418635584 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:43.813732 131528418635584 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000197 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:43.814194 131528418635584 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.126574 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:43.820257 131528418635584 simple_timer.cpp:55] [rocprofv3] output generation ::     0.169074 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:43.820341 131528418635584 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.171590 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/out/pmc_1/2384981_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/10][Approximate profiling time left: 21 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:46.038491 139901324762944 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.296488 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:46.048279 139901324762944 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:46.263297 139901324762944 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:46.393687 139901324762944 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.345409 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:46.432764 139901324762944 generateRocpd.cpp:582] writing SQL database for process 2384995 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:52:46.434037 139901324762944 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/out/pmc_1/dl385-20-mi100-3c48/2384995_results.db (UUID=00004317-ded6-7ed6-9cf6-9d080fe30587)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:46.524127 139901324762944 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014068 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:46.525285 139901324762944 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001127 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:46.527476 139901324762944 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002163 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:46.532563 139901324762944 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003148 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:46.540116 139901324762944 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.007524 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:46.542586 139901324762944 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002442 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:46.542614 139901324762944 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:46.558331 139901324762944 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015702 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:46.558366 139901324762944 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:46.558378 139901324762944 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:46.558390 139901324762944 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:46.558599 139901324762944 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000195 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:46.559034 139901324762944 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.126271 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:46.564858 139901324762944 simple_timer.cpp:55] [rocprofv3] output generation ::     0.168643 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:46.564940 139901324762944 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.171202 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/out/pmc_1/2384995_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/10][Approximate profiling time left: 17 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:49.044732 129326962327360 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.300193 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:49.054672 129326962327360 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:49.269883 129326962327360 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:49.399449 129326962327360 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.344778 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:49.439007 129326962327360 generateRocpd.cpp:582] writing SQL database for process 2385005 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:52:49.440322 129326962327360 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/out/pmc_1/dl385-20-mi100-3c48/2385005_results.db (UUID=00004317-ea91-7a91-a695-c069d7a31150)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:49.530248 129326962327360 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013886 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:49.531374 129326962327360 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001095 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:49.533540 129326962327360 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002138 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:49.538558 129326962327360 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003117 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:49.546114 129326962327360 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.007528 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:49.548550 129326962327360 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002408 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:49.548580 129326962327360 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:49.564624 129326962327360 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.016029 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:49.564652 129326962327360 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:49.564664 129326962327360 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:49.564675 129326962327360 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:49.564881 129326962327360 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000184 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:49.565267 129326962327360 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.126260 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:49.571137 129326962327360 simple_timer.cpp:55] [rocprofv3] output generation ::     0.169180 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:49.571216 129326962327360 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.171717 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/out/pmc_1/2385005_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/10][Approximate profiling time left: 14 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:51.807678 138515195785024 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.300150 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:51.817284 138515195785024 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:52.031099 138515195785024 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:52.161546 138515195785024 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.344262 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:52.200647 138515195785024 generateRocpd.cpp:582] writing SQL database for process 2385015 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:52:52.201931 138515195785024 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/out/pmc_1/dl385-20-mi100-3c48/2385015_results.db (UUID=00004317-f55c-755c-82d1-2a4147dced12)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:52.292381 138515195785024 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013987 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:52.293488 138515195785024 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001075 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:52.295661 138515195785024 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002145 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:52.300727 138515195785024 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003111 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:52.308302 138515195785024 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.007546 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:52.310712 138515195785024 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002381 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:52.310741 138515195785024 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:52.326357 138515195785024 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015601 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:52.326385 138515195785024 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:52.326397 138515195785024 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:52.326409 138515195785024 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:52.326615 138515195785024 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000189 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:52.326969 138515195785024 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.126322 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:52.332779 138515195785024 simple_timer.cpp:55] [rocprofv3] output generation ::     0.168760 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:52.332855 138515195785024 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.171258 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/out/pmc_1/2385015_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/10][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:54.564233 132708398100288 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.296007 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:54.572784 132708398100288 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:54.786730 132708398100288 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:54.914708 132708398100288 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.341924 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:54.953586 132708398100288 generateRocpd.cpp:582] writing SQL database for process 2385034 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:52:54.954853 132708398100288 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/out/pmc_1/dl385-20-mi100-3c48/2385034_results.db (UUID=00004318-0024-7024-97a6-0f628b8faff5)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:55.044738 132708398100288 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014024 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:55.045908 132708398100288 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001139 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:55.048109 132708398100288 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002173 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:55.053213 132708398100288 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003189 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:55.060760 132708398100288 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.007519 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:55.063215 132708398100288 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002427 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:55.063244 132708398100288 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:55.078848 132708398100288 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015590 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:55.078876 132708398100288 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:55.078888 132708398100288 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:55.078900 132708398100288 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:55.079120 132708398100288 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000200 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:55.079476 132708398100288 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.125890 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:55.085231 132708398100288 simple_timer.cpp:55] [rocprofv3] output generation ::     0.168084 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:55.085303 132708398100288 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.170544 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/out/pmc_1/2385034_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/10][Approximate profiling time left: 8 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/perfmon/pmc_perf_6.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:57.283848 124634820079424 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.298760 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:57.293522 124634820079424 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:57.510604 124634820079424 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:52:57.639537 124634820079424 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.346016 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:57.678743 124634820079424 generateRocpd.cpp:582] writing SQL database for process 2385044 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:52:57.680084 124634820079424 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/out/pmc_1/dl385-20-mi100-3c48/2385044_results.db (UUID=00004318-0ac1-7ac1-9253-690b2ece3389)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:57.771497 124634820079424 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014246 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:57.772649 124634820079424 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001121 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:57.774834 124634820079424 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002156 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:57.780062 124634820079424 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003203 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:57.787641 124634820079424 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.007550 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:57.790099 124634820079424 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002430 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:57.790128 124634820079424 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:57.806416 124634820079424 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.016273 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:57.806446 124634820079424 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:57.806458 124634820079424 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:57.806470 124634820079424 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:57.806670 124634820079424 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000187 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:57.807070 124634820079424 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.128328 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:57.812944 124634820079424 simple_timer.cpp:55] [rocprofv3] output generation ::     0.170933 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:52:57.813042 124634820079424 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.173451 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/out/pmc_1/2385044_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/10][Approximate profiling time left: 5 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/perfmon/pmc_perf_7.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:00.019306 135963674914624 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.298005 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:00.029130 135963674914624 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:00.241379 135963674914624 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:00.370949 135963674914624 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.341818 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:00.410102 135963674914624 generateRocpd.cpp:582] writing SQL database for process 2385054 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:53:00.411405 135963674914624 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/out/pmc_1/dl385-20-mi100-3c48/2385054_results.db (UUID=00004318-1572-7572-ba3e-f3a1a9eef5b7)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:00.501678 135963674914624 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013892 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:00.502824 135963674914624 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001116 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:00.505012 135963674914624 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002159 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:00.511128 135963674914624 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003183 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:00.518663 135963674914624 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.007506 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:00.521063 135963674914624 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002372 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:00.521092 135963674914624 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:00.536935 135963674914624 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015829 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:00.536964 135963674914624 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:00.536984 135963674914624 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:00.537002 135963674914624 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:00.537203 135963674914624 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000187 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:00.537557 135963674914624 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.127455 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:00.543297 135963674914624 simple_timer.cpp:55] [rocprofv3] output generation ::     0.169880 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:00.543368 135963674914624 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.172358 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/out/pmc_1/2385054_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/10][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/perfmon/pmc_perf_8.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:02.768534 135309575417664 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.302086 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:02.778697 135309575417664 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:02.991082 135309575417664 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:03.125555 135309575417664 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.346859 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:03.164713 135309575417664 generateRocpd.cpp:582] writing SQL database for process 2385064 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:53:03.166066 135309575417664 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/out/pmc_1/dl385-20-mi100-3c48/2385064_results.db (UUID=00004318-202b-702b-8c31-becf4c3de28e)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:03.256457 135309575417664 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013762 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:03.257598 135309575417664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001110 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:03.259764 135309575417664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002137 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:03.264885 135309575417664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003153 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:03.272451 135309575417664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.007537 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:03.274886 135309575417664 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002406 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:03.274915 135309575417664 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:03.290812 135309575417664 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015883 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:03.290842 135309575417664 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:03.290854 135309575417664 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:03.290866 135309575417664 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:03.291094 135309575417664 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000207 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:03.291490 135309575417664 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.126777 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:03.297344 135309575417664 simple_timer.cpp:55] [rocprofv3] output generation ::     0.169319 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:03.297424 135309575417664 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.171816 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/out/pmc_1/2385064_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/10][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/perfmon/pmc_perf_9.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:05.776774 125493517725504 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.298554 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:05.786418 125493517725504 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:05.998501 125493517725504 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:06.127819 125493517725504 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.341401 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:06.167329 125493517725504 generateRocpd.cpp:582] writing SQL database for process 2385074 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:53:06.168646 125493517725504 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/out/pmc_1/dl385-20-mi100-3c48/2385074_results.db (UUID=00004318-2bee-7bee-a1fc-9ed7715d27c1)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:06.258453 125493517725504 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013941 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:06.259584 125493517725504 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:06.261752 125493517725504 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002140 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:06.266850 125493517725504 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003132 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:06.271295 125493517725504 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.004416 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:06.273720 125493517725504 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002396 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:06.273749 125493517725504 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:06.289477 125493517725504 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015713 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:06.289507 125493517725504 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:06.289519 125493517725504 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:06.289531 125493517725504 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:06.289739 125493517725504 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000186 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:06.290176 125493517725504 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.122848 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:06.296077 125493517725504 simple_timer.cpp:55] [rocprofv3] output generation ::     0.165802 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:06.296157 125493517725504 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.168289 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCP/MI100/out/pmc_1/2385074_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Skipping roofline
