Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/device_filter/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: All

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[ 22%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/13][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/device_filter/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:22.193443 136740105445184 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190642 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:22.194079 136740105445184 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:22.386254 136740105445184 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:22.469302 136740105445184 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.275223 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:22.492020 136740105445184 generateRocpd.cpp:583] writing SQL database for process 2524233 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:02:22.492842 136740105445184 generateRocpd.cpp:606] Opened result file: tests/workloads/device_filter/MI200/out/pmc_1/smc4124-25-mi210-3c48/2524233_results.db (UUID=0001fa77-3e0f-7e0f-ac9c-147766ecb88d)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:22.574945 136740105445184 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007975 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:22.576087 136740105445184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001122 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:22.577697 136740105445184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001595 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:22.587909 136740105445184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008215 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:22.911223 136740105445184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.323298 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:22.913344 136740105445184 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002101 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:22.913364 136740105445184 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:22.923391 136740105445184 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.010019 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:22.923406 136740105445184 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:22.923412 136740105445184 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:22.923419 136740105445184 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:22.923546 136740105445184 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000119 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:22.923810 136740105445184 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.431790 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:22.926893 136740105445184 simple_timer.cpp:55] [rocprofv3] output generation ::     0.455673 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:22.927003 136740105445184 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.457649 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_filter/MI200/out/pmc_1/2524233_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/13][Approximate profiling time left: 25 seconds]...
[profiling] Current input file: tests/workloads/device_filter/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:24.448526 136945999634240 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191675 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:24.449120 136945999634240 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:24.642486 136945999634240 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:24.723628 136945999634240 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.274508 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:24.746125 136945999634240 generateRocpd.cpp:583] writing SQL database for process 2524249 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:02:24.746899 136945999634240 generateRocpd.cpp:606] Opened result file: tests/workloads/device_filter/MI200/out/pmc_1/smc4124-25-mi210-3c48/2524249_results.db (UUID=0001fa77-46dd-76dd-9153-5f7b40135b65)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:24.830067 136945999634240 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007968 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:24.831204 136945999634240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001121 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:24.832809 136945999634240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001591 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:24.843084 136945999634240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008280 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:25.163995 136945999634240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.320897 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:25.166126 136945999634240 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002111 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:25.166143 136945999634240 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:25.176094 136945999634240 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009943 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:25.176109 136945999634240 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:25.176115 136945999634240 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:25.176122 136945999634240 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:25.176248 136945999634240 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000114 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:25.176523 136945999634240 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.430399 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:25.179584 136945999634240 simple_timer.cpp:55] [rocprofv3] output generation ::     0.454311 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:25.179693 136945999634240 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.456021 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_filter/MI200/out/pmc_1/2524249_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/13][Approximate profiling time left: 22 seconds]...
[profiling] Current input file: tests/workloads/device_filter/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:26.728473 126696037265216 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190346 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:26.729048 126696037265216 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:26.924521 126696037265216 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:27.006590 126696037265216 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.277542 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:27.029277 126696037265216 generateRocpd.cpp:583] writing SQL database for process 2524261 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:02:27.030092 126696037265216 generateRocpd.cpp:606] Opened result file: tests/workloads/device_filter/MI200/out/pmc_1/smc4124-25-mi210-3c48/2524261_results.db (UUID=0001fa77-4fc7-7fc7-a3d4-9c91ae01dc6b)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:27.114362 126696037265216 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008047 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:27.115589 126696037265216 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001210 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:27.117319 126696037265216 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001715 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:27.127923 126696037265216 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008420 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:27.427414 126696037265216 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.299476 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:27.429697 126696037265216 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002257 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:27.429715 126696037265216 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:27.439785 126696037265216 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.010063 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:27.439800 126696037265216 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:27.439807 126696037265216 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:27.439814 126696037265216 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:27.439968 126696037265216 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000120 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:27.440248 126696037265216 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.410972 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:27.443335 126696037265216 simple_timer.cpp:55] [rocprofv3] output generation ::     0.434966 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:27.443435 126696037265216 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.436792 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_filter/MI200/out/pmc_1/2524261_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/13][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: tests/workloads/device_filter/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:28.993862 140482737766208 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192216 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:28.994482 140482737766208 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:29.188782 140482737766208 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:29.271547 140482737766208 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.277066 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:29.294099 140482737766208 generateRocpd.cpp:583] writing SQL database for process 2524271 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:02:29.294866 140482737766208 generateRocpd.cpp:606] Opened result file: tests/workloads/device_filter/MI200/out/pmc_1/smc4124-25-mi210-3c48/2524271_results.db (UUID=0001fa77-589e-789e-b395-0d4597153253)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:29.378093 140482737766208 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008044 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:29.379310 140482737766208 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001202 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:29.381431 140482737766208 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002107 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:29.391892 140482737766208 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008340 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:29.679190 140482737766208 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.287283 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:29.681419 140482737766208 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002213 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:29.681436 140482737766208 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:29.691265 140482737766208 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009821 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:29.691279 140482737766208 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:29.691286 140482737766208 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:29.691292 140482737766208 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:29.691396 140482737766208 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:29.691622 140482737766208 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.397523 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:29.694661 140482737766208 simple_timer.cpp:55] [rocprofv3] output generation ::     0.421322 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:29.694751 140482737766208 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.423161 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_filter/MI200/out/pmc_1/2524271_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/13][Approximate profiling time left: 18 seconds]...
[profiling] Current input file: tests/workloads/device_filter/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:31.214198 127841683906368 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192539 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:31.214770 127841683906368 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:31.407640 127841683906368 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:31.501221 127841683906368 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.286451 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:31.523456 127841683906368 generateRocpd.cpp:583] writing SQL database for process 2524280 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:02:31.524257 127841683906368 generateRocpd.cpp:606] Opened result file: tests/workloads/device_filter/MI200/out/pmc_1/smc4124-25-mi210-3c48/2524280_results.db (UUID=0001fa77-614a-714a-a900-0bbf84c74316)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:31.608337 127841683906368 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008090 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:31.609549 127841683906368 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001196 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:31.611266 127841683906368 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001702 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:31.622052 127841683906368 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008581 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:31.910895 127841683906368 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.288827 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:31.913261 127841683906368 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002351 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:31.913278 127841683906368 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:31.922617 127841683906368 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009332 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:31.922631 127841683906368 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:31.922637 127841683906368 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:31.922644 127841683906368 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:31.922747 127841683906368 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000096 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:31.922959 127841683906368 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.399503 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:31.926040 127841683906368 simple_timer.cpp:55] [rocprofv3] output generation ::     0.423419 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:31.926126 127841683906368 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.424863 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_filter/MI200/out/pmc_1/2524280_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/13][Approximate profiling time left: 15 seconds]...
[profiling] Current input file: tests/workloads/device_filter/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:33.408778 133122341879616 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.184760 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:33.409391 133122341879616 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:33.601992 133122341879616 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:33.692527 133122341879616 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.283136 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:33.716106 133122341879616 generateRocpd.cpp:583] writing SQL database for process 2524291 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:02:33.716899 133122341879616 generateRocpd.cpp:606] Opened result file: tests/workloads/device_filter/MI200/out/pmc_1/smc4124-25-mi210-3c48/2524291_results.db (UUID=0001fa77-69e4-79e4-8c7b-ee6dbafa9b77)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:33.797294 133122341879616 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007772 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:33.798482 133122341879616 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001172 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:33.800084 133122341879616 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001586 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:33.810553 133122341879616 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008484 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:33.819120 133122341879616 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008552 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:33.821185 133122341879616 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002050 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:33.821202 133122341879616 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:33.829778 133122341879616 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008569 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:33.829793 133122341879616 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:33.829799 133122341879616 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:33.829806 133122341879616 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:33.829906 133122341879616 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000093 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:33.830108 133122341879616 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.114002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:33.832917 133122341879616 simple_timer.cpp:55] [rocprofv3] output generation ::     0.137855 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:33.832961 133122341879616 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.140384 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_filter/MI200/out/pmc_1/2524291_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/13][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/device_filter/MI200/perfmon/pmc_perf_SQC_DCACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:35.383812 134570809446208 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192180 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:35.384443 134570809446208 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:35.587211 134570809446208 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:35.690261 134570809446208 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.305819 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:35.712396 134570809446208 generateRocpd.cpp:583] writing SQL database for process 2524310 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:02:35.713231 134570809446208 generateRocpd.cpp:606] Opened result file: tests/workloads/device_filter/MI200/out/pmc_1/smc4124-25-mi210-3c48/2524310_results.db (UUID=0001fa77-7194-7194-ab0e-bce870fb1e50)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:35.796928 134570809446208 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008021 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:35.798102 134570809446208 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001157 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:35.800173 134570809446208 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002056 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:35.810429 134570809446208 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008341 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:36.240888 134570809446208 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.430444 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:36.243182 134570809446208 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002276 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:36.243199 134570809446208 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:36.251900 134570809446208 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008693 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:36.251915 134570809446208 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:36.251921 134570809446208 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:36.251928 134570809446208 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:36.252042 134570809446208 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000107 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:36.252270 134570809446208 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.539874 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:36.255302 134570809446208 simple_timer.cpp:55] [rocprofv3] output generation ::     0.563888 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:36.255424 134570809446208 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.565114 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_filter/MI200/out/pmc_1/2524310_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/13][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/device_filter/MI200/perfmon/pmc_perf_SQC_ICACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:37.802059 136448910049088 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.200791 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:37.802725 136448910049088 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:38.005674 136448910049088 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:38.095773 136448910049088 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.293048 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:38.118637 136448910049088 generateRocpd.cpp:583] writing SQL database for process 2524522 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:02:38.119402 136448910049088 generateRocpd.cpp:606] Opened result file: tests/workloads/device_filter/MI200/out/pmc_1/smc4124-25-mi210-3c48/2524522_results.db (UUID=0001fa77-7afe-7afe-8e62-25a6ff36b0b6)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:38.203634 136448910049088 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008220 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:38.204748 136448910049088 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:38.206700 136448910049088 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001936 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:38.217058 136448910049088 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008368 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:38.625085 136448910049088 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.408012 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:38.627394 136448910049088 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002283 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:38.627413 136448910049088 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:38.636467 136448910049088 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009046 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:38.636481 136448910049088 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:38.636488 136448910049088 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:38.636495 136448910049088 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:38.636600 136448910049088 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000098 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:38.636803 136448910049088 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.518166 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:38.639839 136448910049088 simple_timer.cpp:55] [rocprofv3] output generation ::     0.542381 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:38.639937 136448910049088 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.544131 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_filter/MI200/out/pmc_1/2524522_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/13][Approximate profiling time left: 9 seconds]...
[profiling] Current input file: tests/workloads/device_filter/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:40.211708 139452146401088 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.197036 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:40.212322 139452146401088 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:40.405404 139452146401088 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:40.502984 139452146401088 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.290663 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:40.525671 139452146401088 generateRocpd.cpp:583] writing SQL database for process 2524548 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:02:40.526486 139452146401088 generateRocpd.cpp:606] Opened result file: tests/workloads/device_filter/MI200/out/pmc_1/smc4124-25-mi210-3c48/2524548_results.db (UUID=0001fa77-846b-746b-90f5-0871185cbce3)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:40.609199 139452146401088 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008225 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:40.610387 139452146401088 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001171 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:40.612539 139452146401088 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002138 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:40.623072 139452146401088 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008519 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:41.206471 139452146401088 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.583383 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:41.208894 139452146401088 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002398 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:41.208911 139452146401088 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:41.218177 139452146401088 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009259 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:41.218192 139452146401088 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:41.218199 139452146401088 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:41.218205 139452146401088 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:41.218313 139452146401088 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:41.218574 139452146401088 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.692903 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:41.221757 139452146401088 simple_timer.cpp:55] [rocprofv3] output generation ::     0.717211 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:41.221899 139452146401088 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.718859 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_filter/MI200/out/pmc_1/2524548_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/13][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: tests/workloads/device_filter/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_LDS_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:42.751650 125941098553152 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190654 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:42.752284 125941098553152 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:42.946715 125941098553152 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:43.033778 125941098553152 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.281494 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:43.056210 125941098553152 generateRocpd.cpp:583] writing SQL database for process 2524557 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:02:43.056993 125941098553152 generateRocpd.cpp:606] Opened result file: tests/workloads/device_filter/MI200/out/pmc_1/smc4124-25-mi210-3c48/2524557_results.db (UUID=0001fa77-8e5d-7e5d-8c68-bb8b227bb5f4)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:43.139826 125941098553152 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008196 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:43.141028 125941098553152 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001186 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:43.142991 125941098553152 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001941 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:43.153441 125941098553152 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008438 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:43.498146 125941098553152 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.344690 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:43.500368 125941098553152 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002204 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:43.500386 125941098553152 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:43.509018 125941098553152 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008625 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:43.509040 125941098553152 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:43.509047 125941098553152 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:43.509055 125941098553152 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:43.509170 125941098553152 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000108 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:43.509419 125941098553152 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.453210 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:43.512458 125941098553152 simple_timer.cpp:55] [rocprofv3] output generation ::     0.476956 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:43.512576 125941098553152 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.478755 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_filter/MI200/out/pmc_1/2524557_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 11/13][Approximate profiling time left: 4 seconds]...
[profiling] Current input file: tests/workloads/device_filter/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:45.041192 126290833710912 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.193336 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:45.041806 126290833710912 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:45.238156 126290833710912 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:45.333875 126290833710912 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.292070 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:45.356826 126290833710912 generateRocpd.cpp:583] writing SQL database for process 2524566 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:02:45.357580 126290833710912 generateRocpd.cpp:606] Opened result file: tests/workloads/device_filter/MI200/out/pmc_1/smc4124-25-mi210-3c48/2524566_results.db (UUID=0001fa77-974c-774c-ac18-35b03eb8238c)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:45.441306 126290833710912 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008175 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:45.442531 126290833710912 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001209 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:45.444729 126290833710912 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002183 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:45.455305 126290833710912 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008476 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:45.789859 126290833710912 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.334540 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:45.792252 126290833710912 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002374 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:45.792270 126290833710912 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:45.802400 126290833710912 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.010123 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:45.802414 126290833710912 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:45.802420 126290833710912 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:45.802427 126290833710912 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:45.802529 126290833710912 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000095 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:45.802732 126290833710912 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.445906 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:45.805719 126290833710912 simple_timer.cpp:55] [rocprofv3] output generation ::     0.469892 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:45.805805 126290833710912 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.471896 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_filter/MI200/out/pmc_1/2524566_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 12/13][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: tests/workloads/device_filter/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:47.349288 126319204572992 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.198390 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:47.349866 126319204572992 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:47.543588 126319204572992 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:47.645611 126319204572992 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.295746 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:47.668068 126319204572992 generateRocpd.cpp:583] writing SQL database for process 2524575 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:02:47.668886 126319204572992 generateRocpd.cpp:606] Opened result file: tests/workloads/device_filter/MI200/out/pmc_1/smc4124-25-mi210-3c48/2524575_results.db (UUID=0001fa77-a04b-704b-a9bb-3ecb132cb68e)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:47.750439 126319204572992 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008131 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:47.751620 126319204572992 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001165 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:47.753751 126319204572992 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002115 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:47.764239 126319204572992 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008357 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:48.283020 126319204572992 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.518766 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:48.285250 126319204572992 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002201 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:48.285268 126319204572992 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:48.293957 126319204572992 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008683 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:48.293972 126319204572992 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:48.293978 126319204572992 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:48.293984 126319204572992 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:48.294092 126319204572992 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000101 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:48.294289 126319204572992 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.626222 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:48.297260 126319204572992 simple_timer.cpp:55] [rocprofv3] output generation ::     0.649773 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:48.297381 126319204572992 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.651725 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_filter/MI200/out/pmc_1/2524575_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 13/13][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/device_filter/MI200/perfmon/pmc_perf_SQ_LEVEL_WAVES_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:49.823236 138785579761472 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191196 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:49.823823 138785579761472 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:50.016585 138785579761472 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:02:50.105485 138785579761472 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.281661 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:50.128756 138785579761472 generateRocpd.cpp:583] writing SQL database for process 2524603 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:02:50.129565 138785579761472 generateRocpd.cpp:606] Opened result file: tests/workloads/device_filter/MI200/out/pmc_1/smc4124-25-mi210-3c48/2524603_results.db (UUID=0001fa77-a9fc-79fc-a0f7-257b40191efe)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:50.213639 138785579761472 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008119 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:50.214833 138785579761472 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001179 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:50.216532 138785579761472 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001684 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:50.227192 138785579761472 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008503 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:50.545845 138785579761472 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.318637 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:50.548563 138785579761472 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002688 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:50.548580 138785579761472 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:50.557814 138785579761472 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009226 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:50.557829 138785579761472 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:50.557835 138785579761472 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:50.557842 138785579761472 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:50.557949 138785579761472 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:50.558169 138785579761472 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.429413 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:50.561152 138785579761472 simple_timer.cpp:55] [rocprofv3] output generation ::     0.453512 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:02:50.561241 138785579761472 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.455707 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_filter/MI200/out/pmc_1/2524603_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Checking for roofline.csv in tests/workloads/device_filter/MI200
[roofline] Benchmark execution failed: 'L1'. Skipping roofline.
