Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/kernel/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: ['vecCopy']
Dispatch Selection: None
Filtered sections: All

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/13][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/kernel/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:26.086696 140015523688256 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191501 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:26.087370 140015523688256 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:26.281683 140015523688256 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:26.373174 140015523688256 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.285805 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:26.395409 140015523688256 generateRocpd.cpp:583] writing SQL database for process 2525551 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:05:26.396199 140015523688256 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525551_results.db (UUID=0001fa7a-0c64-7c64-916d-599239ad5a73)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:26.520966 140015523688256 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008089 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:26.522205 140015523688256 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001221 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:26.523811 140015523688256 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001588 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:26.534109 140015523688256 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008255 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:26.858479 140015523688256 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.324354 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:26.860945 140015523688256 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002448 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:26.860962 140015523688256 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:26.870619 140015523688256 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009650 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:26.870634 140015523688256 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:26.870641 140015523688256 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:26.870647 140015523688256 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:26.870772 140015523688256 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000118 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:26.871042 140015523688256 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.475634 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:26.874736 140015523688256 simple_timer.cpp:55] [rocprofv3] output generation ::     0.500116 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:26.874851 140015523688256 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.501635 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel/MI200/out/pmc_1/2525551_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/13][Approximate profiling time left: 25 seconds]...
[profiling] Current input file: tests/workloads/kernel/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:28.404157 137287312817984 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191853 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:28.404766 137287312817984 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:28.605290 137287312817984 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:28.701458 137287312817984 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.296692 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:28.724638 137287312817984 generateRocpd.cpp:583] writing SQL database for process 2525561 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:05:28.725400 137287312817984 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525561_results.db (UUID=0001fa7a-1571-7571-96ec-5e1dd0632fec)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:28.813400 137287312817984 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.009317 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:28.814683 137287312817984 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001256 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:28.817012 137287312817984 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002311 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:28.827767 137287312817984 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008490 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:29.141893 137287312817984 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.314106 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:29.144337 137287312817984 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002414 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:29.144356 137287312817984 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:29.154260 137287312817984 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009897 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:29.154282 137287312817984 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:29.154288 137287312817984 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:29.154295 137287312817984 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:29.154452 137287312817984 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000150 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:29.155049 137287312817984 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.430412 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:29.158108 137287312817984 simple_timer.cpp:55] [rocprofv3] output generation ::     0.455254 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:29.158355 137287312817984 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.456817 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel/MI200/out/pmc_1/2525561_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/13][Approximate profiling time left: 23 seconds]...
[profiling] Current input file: tests/workloads/kernel/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:30.725226 130129848500032 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189753 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:30.725893 130129848500032 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:30.923413 130129848500032 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:31.009925 130129848500032 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.284032 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:31.032534 130129848500032 generateRocpd.cpp:583] writing SQL database for process 2525574 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:05:31.033352 130129848500032 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525574_results.db (UUID=0001fa7a-1e84-7e84-af8b-748ca5f8ce76)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:31.116636 130129848500032 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007951 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:31.117862 130129848500032 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001209 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:31.119512 130129848500032 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001634 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:31.129870 130129848500032 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008316 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:31.430400 130129848500032 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.300515 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:31.432730 130129848500032 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002313 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:31.432748 130129848500032 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:31.442054 130129848500032 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009300 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:31.442068 130129848500032 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:31.442075 130129848500032 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:31.442082 130129848500032 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:31.442189 130129848500032 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:31.442417 130129848500032 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.409883 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:31.445549 130129848500032 simple_timer.cpp:55] [rocprofv3] output generation ::     0.434054 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:31.445646 130129848500032 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.435671 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel/MI200/out/pmc_1/2525574_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/13][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: tests/workloads/kernel/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:32.977466 138711479050048 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192831 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:32.978135 138711479050048 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:33.172777 138711479050048 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:33.272348 138711479050048 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.294213 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:33.295292 138711479050048 generateRocpd.cpp:583] writing SQL database for process 2525584 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:05:33.296075 138711479050048 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525584_results.db (UUID=0001fa7a-274d-774d-8f7f-a45cd3e008b6)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:33.378242 138711479050048 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008077 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:33.379442 138711479050048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001182 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:33.381401 138711479050048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001944 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:33.391579 138711479050048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008207 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:33.688522 138711479050048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.296929 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:33.690858 138711479050048 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002313 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:33.690876 138711479050048 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:33.701056 138711479050048 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.010172 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:33.701071 138711479050048 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:33.701078 138711479050048 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:33.701085 138711479050048 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:33.701230 138711479050048 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000137 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:33.701540 138711479050048 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.406248 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:33.704754 138711479050048 simple_timer.cpp:55] [rocprofv3] output generation ::     0.430007 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:33.704867 138711479050048 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.432479 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel/MI200/out/pmc_1/2525584_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/13][Approximate profiling time left: 18 seconds]...
[profiling] Current input file: tests/workloads/kernel/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:35.225768 125059276947264 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190740 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:35.226372 125059276947264 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:35.418633 125059276947264 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:35.509859 125059276947264 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.283488 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:35.532123 125059276947264 generateRocpd.cpp:583] writing SQL database for process 2525592 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:05:35.532955 125059276947264 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525592_results.db (UUID=0001fa7a-3017-7017-a715-eae382851d32)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:35.617854 125059276947264 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008050 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:35.619092 125059276947264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001215 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:35.620810 125059276947264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001703 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:35.631403 125059276947264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008334 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:35.911871 125059276947264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.280452 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:35.914330 125059276947264 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002428 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:35.914348 125059276947264 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:35.923541 125059276947264 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009186 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:35.923556 125059276947264 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:35.923563 125059276947264 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:35.923570 125059276947264 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:35.923719 125059276947264 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000116 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:35.923948 125059276947264 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.391826 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:35.927098 125059276947264 simple_timer.cpp:55] [rocprofv3] output generation ::     0.415998 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:35.927204 125059276947264 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.417297 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel/MI200/out/pmc_1/2525592_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/13][Approximate profiling time left: 15 seconds]...
[profiling] Current input file: tests/workloads/kernel/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:37.434759 133279341301568 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.182997 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:37.435374 133279341301568 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:37.627783 133279341301568 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:37.722182 133279341301568 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.286809 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:37.744250 133279341301568 generateRocpd.cpp:583] writing SQL database for process 2525601 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:05:37.745037 133279341301568 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525601_results.db (UUID=0001fa7a-38c0-78c0-8d45-31274ade109c)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:37.829312 133279341301568 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007736 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:37.830535 133279341301568 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001206 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:37.832150 133279341301568 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001600 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:37.842466 133279341301568 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008273 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:37.851126 133279341301568 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008646 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:37.853307 133279341301568 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002165 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:37.853324 133279341301568 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:37.861909 133279341301568 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008577 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:37.861923 133279341301568 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:37.861930 133279341301568 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:37.861936 133279341301568 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:37.862068 133279341301568 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000099 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:37.862267 133279341301568 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.118017 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:37.865161 133279341301568 simple_timer.cpp:55] [rocprofv3] output generation ::     0.141514 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:37.865212 133279341301568 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.142988 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel/MI200/out/pmc_1/2525601_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/13][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/kernel/MI200/perfmon/pmc_perf_SQC_DCACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:39.381097 137841229578048 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191543 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:39.381679 137841229578048 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:39.574141 137841229578048 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:39.662022 137841229578048 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280343 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:39.684502 137841229578048 generateRocpd.cpp:583] writing SQL database for process 2525610 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:05:39.685329 137841229578048 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525610_results.db (UUID=0001fa7a-4052-7052-969e-d5ea99eee48b)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:39.767434 137841229578048 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008106 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:39.768626 137841229578048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001173 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:39.770558 137841229578048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001917 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:39.780997 137841229578048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008485 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:40.191013 137841229578048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.410001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:40.193367 137841229578048 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002324 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:40.193385 137841229578048 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:40.201998 137841229578048 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008606 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:40.202014 137841229578048 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:40.202020 137841229578048 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:40.202027 137841229578048 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:40.202180 137841229578048 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000130 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:40.202447 137841229578048 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.517945 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:40.205533 137841229578048 simple_timer.cpp:55] [rocprofv3] output generation ::     0.541971 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:40.205656 137841229578048 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.543579 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel/MI200/out/pmc_1/2525610_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/13][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/kernel/MI200/perfmon/pmc_perf_SQC_ICACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:41.731504 133419695435584 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189495 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:41.732100 133419695435584 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:41.924778 133419695435584 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:42.020340 133419695435584 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.288240 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:42.042661 133419695435584 generateRocpd.cpp:583] writing SQL database for process 2525618 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:05:42.043466 133419695435584 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525618_results.db (UUID=0001fa7a-4983-7983-b631-8ad52720cebb)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:42.131374 133419695435584 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008078 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:42.132605 133419695435584 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001215 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:42.134751 133419695435584 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002131 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:42.145167 133419695435584 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008283 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:42.579048 133419695435584 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.433866 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:42.581414 133419695435584 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002346 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:42.581431 133419695435584 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:42.590099 133419695435584 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008661 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:42.590114 133419695435584 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:42.590120 133419695435584 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:42.590127 133419695435584 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:42.590236 133419695435584 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000102 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:42.590461 133419695435584 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.547800 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:42.593455 133419695435584 simple_timer.cpp:55] [rocprofv3] output generation ::     0.571713 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:42.593575 133419695435584 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.573186 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel/MI200/out/pmc_1/2525618_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/13][Approximate profiling time left: 9 seconds]...
[profiling] Current input file: tests/workloads/kernel/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:44.160423 134107636440896 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.198137 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:44.161081 134107636440896 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:44.355670 134107636440896 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:44.448338 134107636440896 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.287257 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:44.470726 134107636440896 generateRocpd.cpp:583] writing SQL database for process 2525628 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:05:44.471538 134107636440896 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525628_results.db (UUID=0001fa7a-52f7-72f7-8f89-73f5104d6fa8)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:44.556487 134107636440896 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008463 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:44.557708 134107636440896 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001204 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:44.559888 134107636440896 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002166 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:44.570566 134107636440896 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008224 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:45.154152 134107636440896 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.583570 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:45.156568 134107636440896 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002387 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:45.156585 134107636440896 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:45.165371 134107636440896 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008778 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:45.165386 134107636440896 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:45.165392 134107636440896 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:45.165399 134107636440896 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:45.165538 134107636440896 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000129 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:45.165806 134107636440896 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.695081 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:45.168829 134107636440896 simple_timer.cpp:55] [rocprofv3] output generation ::     0.719086 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:45.168966 134107636440896 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.720580 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel/MI200/out/pmc_1/2525628_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/13][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: tests/workloads/kernel/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_LDS_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:46.732416 125559049887552 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189749 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:46.732995 125559049887552 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:46.926722 125559049887552 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:47.018152 125559049887552 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.285158 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:47.040733 125559049887552 generateRocpd.cpp:583] writing SQL database for process 2525636 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:05:47.041535 125559049887552 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525636_results.db (UUID=0001fa7a-5d0b-7d0b-9675-db2ef937edee)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:47.124606 125559049887552 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008160 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:47.125751 125559049887552 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001129 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:47.127783 125559049887552 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002017 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:47.138087 125559049887552 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008271 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:47.482803 125559049887552 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.344700 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:47.485107 125559049887552 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002286 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:47.485124 125559049887552 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:47.494778 125559049887552 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009647 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:47.494793 125559049887552 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:47.494799 125559049887552 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:47.494806 125559049887552 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:47.494930 125559049887552 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000116 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:47.495191 125559049887552 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.454459 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:47.498246 125559049887552 simple_timer.cpp:55] [rocprofv3] output generation ::     0.478565 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:47.498357 125559049887552 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.480156 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel/MI200/out/pmc_1/2525636_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 11/13][Approximate profiling time left: 4 seconds]...
[profiling] Current input file: tests/workloads/kernel/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:49.052462 127345743191872 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189222 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:49.053090 127345743191872 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:49.246317 127345743191872 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:49.330911 127345743191872 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.277821 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:49.353121 127345743191872 generateRocpd.cpp:583] writing SQL database for process 2525644 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:05:49.353894 127345743191872 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525644_results.db (UUID=0001fa7a-661c-761c-ae06-36f49ef15a35)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:49.438564 127345743191872 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007923 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:49.439767 127345743191872 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001186 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:49.441777 127345743191872 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001995 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:49.452198 127345743191872 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008349 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:49.785044 127345743191872 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.332831 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:49.787590 127345743191872 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002522 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:49.787607 127345743191872 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:49.796233 127345743191872 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008618 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:49.796250 127345743191872 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:49.796257 127345743191872 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:49.796264 127345743191872 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:49.796400 127345743191872 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000125 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:49.796639 127345743191872 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.443518 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:49.799618 127345743191872 simple_timer.cpp:55] [rocprofv3] output generation ::     0.467029 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:49.799724 127345743191872 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.468770 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel/MI200/out/pmc_1/2525644_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 12/13][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: tests/workloads/kernel/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:51.345333 129680896778048 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.197678 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:51.345931 129680896778048 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:51.545226 129680896778048 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:51.640452 129680896778048 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.294521 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:51.662342 129680896778048 generateRocpd.cpp:583] writing SQL database for process 2525652 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:05:51.663154 129680896778048 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525652_results.db (UUID=0001fa7a-6f08-7f08-8c33-0fd39f8d762d)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:51.747987 129680896778048 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008297 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:51.749144 129680896778048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001141 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:51.751161 129680896778048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:51.761603 129680896778048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008421 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:52.280342 129680896778048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.518724 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:52.282606 129680896778048 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002235 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:52.282623 129680896778048 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:52.291359 129680896778048 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008729 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:52.291374 129680896778048 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:52.291381 129680896778048 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:52.291387 129680896778048 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:52.291502 129680896778048 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000107 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:52.291727 129680896778048 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.629386 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:52.294751 129680896778048 simple_timer.cpp:55] [rocprofv3] output generation ::     0.653120 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:52.294872 129680896778048 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.654372 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel/MI200/out/pmc_1/2525652_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 13/13][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/kernel/MI200/perfmon/pmc_perf_SQ_LEVEL_WAVES_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:53.839652 135744962576192 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191198 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:53.840245 135744962576192 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:54.034234 135744962576192 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:05:54.123458 135744962576192 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.283213 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:54.145808 135744962576192 generateRocpd.cpp:583] writing SQL database for process 2525661 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:05:54.146613 135744962576192 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525661_results.db (UUID=0001fa7a-78cd-78cd-b5b4-b10e4f800a13)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:54.231057 135744962576192 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:54.232279 135744962576192 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001205 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:54.233863 135744962576192 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001570 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:54.244538 135744962576192 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008454 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:54.565077 135744962576192 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.320522 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:54.567426 135744962576192 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002332 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:54.567444 135744962576192 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:54.576372 135744962576192 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008920 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:54.576386 135744962576192 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:54.576392 135744962576192 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:54.576399 135744962576192 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:54.576504 135744962576192 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:54.576712 135744962576192 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.430904 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:54.579695 135744962576192 simple_timer.cpp:55] [rocprofv3] output generation ::     0.454846 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:05:54.579790 135744962576192 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.456283 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel/MI200/out/pmc_1/2525661_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Checking for roofline.csv in tests/workloads/kernel/MI200
[roofline] Benchmark execution failed: 'L1'. Skipping roofline.
