Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/path/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: All

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/13][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/path/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:52.440957 130578958081856 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191282 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:52.441628 130578958081856 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:52.638976 130578958081856 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:52.742409 130578958081856 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.300781 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:52.764666 130578958081856 generateRocpd.cpp:583] writing SQL database for process 2522403 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:53:52.765464 130578958081856 generateRocpd.cpp:606] Opened result file: tests/workloads/path/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522403_results.db (UUID=0001fa6f-76d6-76d6-b6bd-c1228d6c185e)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:52.847398 130578958081856 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008180 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:52.848573 130578958081856 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001159 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:52.850140 130578958081856 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001554 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:52.860356 130578958081856 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008276 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:53.187757 130578958081856 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.327386 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:53.190222 130578958081856 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002441 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:53.190239 130578958081856 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:53.198778 130578958081856 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008532 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:53.198793 130578958081856 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:53.198799 130578958081856 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:53.198806 130578958081856 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:53.198922 130578958081856 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000106 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:53.199143 130578958081856 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.434477 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:53.202086 130578958081856 simple_timer.cpp:55] [rocprofv3] output generation ::     0.457890 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:53.202189 130578958081856 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.459739 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/path/MI200/out/pmc_1/2522403_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/13][Approximate profiling time left: 25 seconds]...
[profiling] Current input file: tests/workloads/path/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:54.741354 137845384036160 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.188719 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:54.741985 137845384036160 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:54.936500 137845384036160 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:55.022480 137845384036160 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280495 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:55.045022 137845384036160 generateRocpd.cpp:583] writing SQL database for process 2522413 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:53:55.045817 137845384036160 generateRocpd.cpp:606] Opened result file: tests/workloads/path/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522413_results.db (UUID=0001fa6f-7fd5-7fd5-a4fe-0debe1ae5671)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:55.129374 137845384036160 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007975 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:55.130523 137845384036160 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001134 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:55.132122 137845384036160 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001584 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:55.142587 137845384036160 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008454 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:55.452926 137845384036160 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.310324 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:55.455165 137845384036160 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002222 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:55.455182 137845384036160 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:55.463884 137845384036160 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008695 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:55.463899 137845384036160 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:55.463905 137845384036160 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:55.463912 137845384036160 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:55.464021 137845384036160 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000101 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:55.464239 137845384036160 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.419217 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:55.467185 137845384036160 simple_timer.cpp:55] [rocprofv3] output generation ::     0.442862 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:55.467282 137845384036160 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.444764 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/path/MI200/out/pmc_1/2522413_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/13][Approximate profiling time left: 22 seconds]...
[profiling] Current input file: tests/workloads/path/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:57.003954 123499023261504 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192117 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:57.004556 123499023261504 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:57.197977 123499023261504 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:57.279748 123499023261504 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.275192 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:57.302172 123499023261504 generateRocpd.cpp:583] writing SQL database for process 2522422 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:53:57.302932 123499023261504 generateRocpd.cpp:606] Opened result file: tests/workloads/path/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522422_results.db (UUID=0001fa6f-88a8-78a8-b22f-1bdcba65011c)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:57.386168 123499023261504 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007997 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:57.387354 123499023261504 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001170 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:57.388952 123499023261504 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001583 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:57.399363 123499023261504 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008401 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:57.698638 123499023261504 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.299258 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:57.701050 123499023261504 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002383 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:57.701069 123499023261504 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:57.709667 123499023261504 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008592 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:57.709682 123499023261504 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:57.709689 123499023261504 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:57.709696 123499023261504 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:57.709834 123499023261504 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000101 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:57.710050 123499023261504 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.407878 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:57.712955 123499023261504 simple_timer.cpp:55] [rocprofv3] output generation ::     0.431525 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:57.713056 123499023261504 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.433273 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/path/MI200/out/pmc_1/2522422_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/13][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: tests/workloads/path/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:59.243265 128550227971904 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191661 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:59.243864 128550227971904 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:59.438559 128550227971904 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:59.519371 128550227971904 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.275508 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:59.541287 128550227971904 generateRocpd.cpp:583] writing SQL database for process 2522431 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:53:59.542095 128550227971904 generateRocpd.cpp:606] Opened result file: tests/workloads/path/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522431_results.db (UUID=0001fa6f-9168-7168-b39d-bdf4966e3eda)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:59.625211 128550227971904 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008051 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:59.626438 128550227971904 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001209 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:59.628420 128550227971904 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001968 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:59.638816 128550227971904 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008388 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:59.924058 128550227971904 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.285227 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:59.926417 128550227971904 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002331 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:59.926437 128550227971904 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000005 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:59.934907 128550227971904 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008456 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:59.934923 128550227971904 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:59.934965 128550227971904 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:59.934975 128550227971904 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:59.935088 128550227971904 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000106 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:59.935297 128550227971904 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.394010 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:59.938212 128550227971904 simple_timer.cpp:55] [rocprofv3] output generation ::     0.417664 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:59.938312 128550227971904 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.418893 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/path/MI200/out/pmc_1/2522431_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/13][Approximate profiling time left: 18 seconds]...
[profiling] Current input file: tests/workloads/path/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:01.474560 136794023558976 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189801 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:01.475176 136794023558976 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:01.667579 136794023558976 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:01.751309 136794023558976 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.276133 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:01.773739 136794023558976 generateRocpd.cpp:583] writing SQL database for process 2522439 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:54:01.774537 136794023558976 generateRocpd.cpp:606] Opened result file: tests/workloads/path/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522439_results.db (UUID=0001fa6f-9a21-7a21-a972-0017682460b2)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:01.857270 136794023558976 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007938 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:01.858454 136794023558976 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001169 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:01.860071 136794023558976 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001602 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:01.870548 136794023558976 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008475 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:02.168282 136794023558976 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.297719 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:02.170591 136794023558976 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002291 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:02.170608 136794023558976 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:02.179219 136794023558976 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008604 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:02.179233 136794023558976 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:02.179238 136794023558976 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:02.179245 136794023558976 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:02.179352 136794023558976 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000099 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:02.179567 136794023558976 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.405828 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:02.182562 136794023558976 simple_timer.cpp:55] [rocprofv3] output generation ::     0.429409 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:02.182660 136794023558976 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.431307 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/path/MI200/out/pmc_1/2522439_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/13][Approximate profiling time left: 15 seconds]...
[profiling] Current input file: tests/workloads/path/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:03.674431 128393956650816 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.183450 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:03.675050 128393956650816 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:03.874332 128393956650816 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:03.954995 128393956650816 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.279945 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:03.977086 128393956650816 generateRocpd.cpp:583] writing SQL database for process 2522447 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:54:03.977834 128393956650816 generateRocpd.cpp:606] Opened result file: tests/workloads/path/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522447_results.db (UUID=0001fa6f-a2bf-72bf-86f8-f3a85e1c0c8d)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:04.061335 128393956650816 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007951 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:04.062544 128393956650816 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001191 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:04.064249 128393956650816 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001690 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:04.074863 128393956650816 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008433 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:04.083396 128393956650816 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008519 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:04.085528 128393956650816 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002116 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:04.085547 128393956650816 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:04.094627 128393956650816 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009073 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:04.094642 128393956650816 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:04.094648 128393956650816 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:04.094654 128393956650816 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:04.094753 128393956650816 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000091 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:04.094940 128393956650816 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.117855 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:04.098342 128393956650816 simple_timer.cpp:55] [rocprofv3] output generation ::     0.141804 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:04.098389 128393956650816 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.143351 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/path/MI200/out/pmc_1/2522447_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/13][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/path/MI200/perfmon/pmc_perf_SQC_DCACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:05.603254 139020945018688 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190200 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:05.603882 139020945018688 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:05.799630 139020945018688 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:05.894519 139020945018688 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.290637 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:05.916965 139020945018688 generateRocpd.cpp:583] writing SQL database for process 2522455 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:54:05.917764 139020945018688 generateRocpd.cpp:606] Opened result file: tests/workloads/path/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522455_results.db (UUID=0001fa6f-aa42-7a42-808d-ea9384d28a1c)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:06.002545 139020945018688 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008231 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:06.003759 139020945018688 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001197 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:06.005758 139020945018688 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001984 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:06.016364 139020945018688 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008582 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:06.426372 139020945018688 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.409993 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:06.429797 139020945018688 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.003406 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:06.429814 139020945018688 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:06.438447 139020945018688 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008626 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:06.438461 139020945018688 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:06.438467 139020945018688 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:06.438474 139020945018688 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:06.438584 139020945018688 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:06.438793 139020945018688 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.521829 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:06.441707 139020945018688 simple_timer.cpp:55] [rocprofv3] output generation ::     0.545429 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:06.441818 139020945018688 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.547258 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/path/MI200/out/pmc_1/2522455_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/13][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/path/MI200/perfmon/pmc_perf_SQC_ICACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:07.966442 125588551974720 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189337 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:07.967078 125588551974720 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:08.160322 125588551974720 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:08.242590 125588551974720 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.275512 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:08.264850 125588551974720 generateRocpd.cpp:583] writing SQL database for process 2522464 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:54:08.265654 125588551974720 generateRocpd.cpp:606] Opened result file: tests/workloads/path/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522464_results.db (UUID=0001fa6f-b37e-737e-b8ff-f320b0111c46)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:08.348791 125588551974720 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007995 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:08.350003 125588551974720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001196 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:08.351981 125588551974720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001963 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:08.362372 125588551974720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008370 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:08.763736 125588551974720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.401349 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:08.766086 125588551974720 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002326 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:08.766104 125588551974720 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:08.774711 125588551974720 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008600 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:08.774726 125588551974720 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:08.774732 125588551974720 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:08.774739 125588551974720 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:08.774881 125588551974720 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000135 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:08.775152 125588551974720 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.510302 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:08.778083 125588551974720 simple_timer.cpp:55] [rocprofv3] output generation ::     0.533965 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:08.778207 125588551974720 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.535568 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/path/MI200/out/pmc_1/2522464_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/13][Approximate profiling time left: 8 seconds]...
[profiling] Current input file: tests/workloads/path/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:10.335208 131689781665600 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.194916 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:10.335790 131689781665600 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:10.534409 131689781665600 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:10.621395 131689781665600 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.285606 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:10.643953 131689781665600 generateRocpd.cpp:583] writing SQL database for process 2522473 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:54:10.644756 131689781665600 generateRocpd.cpp:606] Opened result file: tests/workloads/path/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522473_results.db (UUID=0001fa6f-bcb9-7cb9-9bd2-0d92903db928)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:10.726539 131689781665600 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008400 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:10.727697 131689781665600 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001142 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:10.729686 131689781665600 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001975 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:10.739955 131689781665600 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008327 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:11.323432 131689781665600 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.583461 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:11.325734 131689781665600 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002277 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:11.325752 131689781665600 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:11.334550 131689781665600 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008791 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:11.334566 131689781665600 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:11.334572 131689781665600 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:11.334579 131689781665600 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:11.334696 131689781665600 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000107 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:11.334952 131689781665600 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.691000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:11.337947 131689781665600 simple_timer.cpp:55] [rocprofv3] output generation ::     0.714720 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:11.338085 131689781665600 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.716642 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/path/MI200/out/pmc_1/2522473_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/13][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: tests/workloads/path/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_LDS_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:12.884599 130694178103104 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.193201 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:12.885226 130694178103104 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:13.077561 130694178103104 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:13.161579 130694178103104 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.276354 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:13.184122 130694178103104 generateRocpd.cpp:583] writing SQL database for process 2522482 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:54:13.184929 130694178103104 generateRocpd.cpp:606] Opened result file: tests/workloads/path/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522482_results.db (UUID=0001fa6f-c6b0-76b0-8175-3126a2f8fe2b)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:13.266299 130694178103104 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008061 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:13.267494 130694178103104 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001178 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:13.269628 130694178103104 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002119 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:13.280043 130694178103104 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008429 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:13.623566 130694178103104 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.343508 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:13.625861 130694178103104 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002278 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:13.625879 130694178103104 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:13.634398 130694178103104 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008512 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:13.634413 130694178103104 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:13.634419 130694178103104 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:13.634426 130694178103104 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:13.634572 130694178103104 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000102 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:13.634782 130694178103104 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.450660 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:13.637717 130694178103104 simple_timer.cpp:55] [rocprofv3] output generation ::     0.474293 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:13.637830 130694178103104 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.476202 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/path/MI200/out/pmc_1/2522482_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 11/13][Approximate profiling time left: 4 seconds]...
[profiling] Current input file: tests/workloads/path/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:15.179368 128375352549184 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192801 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:15.179970 128375352549184 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:15.374312 128375352549184 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:15.456864 128375352549184 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.276894 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:15.479182 128375352549184 generateRocpd.cpp:583] writing SQL database for process 2522494 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:54:15.479983 128375352549184 generateRocpd.cpp:606] Opened result file: tests/workloads/path/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522494_results.db (UUID=0001fa6f-cfa7-7fa7-b26f-79a93c39e283)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:15.562971 128375352549184 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008011 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:15.564200 128375352549184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001207 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:15.566162 128375352549184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001946 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:15.576552 128375352549184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008400 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:15.908079 128375352549184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.331512 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:15.910365 128375352549184 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002270 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:15.910383 128375352549184 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:15.918877 128375352549184 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008486 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:15.918892 128375352549184 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:15.918898 128375352549184 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:15.918905 128375352549184 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:15.919022 128375352549184 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000109 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:15.919270 128375352549184 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.440088 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:15.922195 128375352549184 simple_timer.cpp:55] [rocprofv3] output generation ::     0.463637 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:15.922300 128375352549184 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.465388 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/path/MI200/out/pmc_1/2522494_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 12/13][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: tests/workloads/path/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:17.480533 140192621768512 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.197591 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:17.481204 140192621768512 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:17.678060 140192621768512 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:17.764300 140192621768512 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.283097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:17.786807 140192621768512 generateRocpd.cpp:583] writing SQL database for process 2522502 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:54:17.787616 140192621768512 generateRocpd.cpp:606] Opened result file: tests/workloads/path/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522502_results.db (UUID=0001fa6f-d89f-789f-8bb5-fbe8f4a335ad)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:17.871244 140192621768512 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008178 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:17.872459 140192621768512 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001198 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:17.874608 140192621768512 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002134 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:17.885361 140192621768512 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008587 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:18.402255 140192621768512 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.516880 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:18.404793 140192621768512 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002516 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:18.404810 140192621768512 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:18.413816 140192621768512 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008999 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:18.413830 140192621768512 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:18.413837 140192621768512 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:18.413844 140192621768512 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:18.413983 140192621768512 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000130 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:18.414267 140192621768512 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.627460 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:18.417230 140192621768512 simple_timer.cpp:55] [rocprofv3] output generation ::     0.651531 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:18.417359 140192621768512 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.653008 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/path/MI200/out/pmc_1/2522502_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 13/13][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/path/MI200/perfmon/pmc_perf_SQ_LEVEL_WAVES_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:19.942979 134962913599296 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190269 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:19.943562 134962913599296 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:20.136021 134962913599296 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:54:20.219018 134962913599296 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.275456 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:20.242197 134962913599296 generateRocpd.cpp:583] writing SQL database for process 2522510 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:54:20.242979 134962913599296 generateRocpd.cpp:606] Opened result file: tests/workloads/path/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522510_results.db (UUID=0001fa6f-e245-7245-970b-0074a1fd7488)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:20.326802 134962913599296 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008082 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:20.328024 134962913599296 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001206 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:20.329718 134962913599296 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001670 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:20.340411 134962913599296 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008504 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:20.661061 134962913599296 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.320636 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:20.663410 134962913599296 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002334 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:20.663428 134962913599296 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:20.672407 134962913599296 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008972 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:20.672421 134962913599296 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:20.672427 134962913599296 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:20.672433 134962913599296 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:20.672554 134962913599296 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000109 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:20.672790 134962913599296 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.430594 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:20.675757 134962913599296 simple_timer.cpp:55] [rocprofv3] output generation ::     0.454541 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:54:20.675864 134962913599296 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.456770 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/path/MI200/out/pmc_1/2522510_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Checking for roofline.csv in tests/workloads/path/MI200
[roofline] Benchmark execution failed: 'L1'. Skipping roofline.
