Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/dispatch_0/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: ['1']
Filtered sections: All

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/13][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/dispatch_0/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:53.657630 123597971799872 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191048 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:53.658288 123597971799872 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:53.850668 123597971799872 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:53.932294 123597971799872 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.274006 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:53.954831 123597971799872 generateRocpd.cpp:583] writing SQL database for process 2523174 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:57:53.955645 123597971799872 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523174_results.db (UUID=0001fa73-2517-7517-afb5-6fb4e09d4399)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:54.036841 123597971799872 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007970 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:54.037946 123597971799872 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001088 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:54.039523 123597971799872 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001562 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:54.049733 123597971799872 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008305 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:54.376252 123597971799872 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.326504 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:54.378550 123597971799872 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002280 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:54.378567 123597971799872 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:54.387664 123597971799872 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009090 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:54.387679 123597971799872 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:54.387685 123597971799872 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:54.387692 123597971799872 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:54.387851 123597971799872 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000122 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:54.388105 123597971799872 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.433275 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:54.391077 123597971799872 simple_timer.cpp:55] [rocprofv3] output generation ::     0.457147 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:54.391182 123597971799872 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.458838 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0/MI200/out/pmc_1/2523174_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/13][Approximate profiling time left: 25 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:55.926762 128959292448576 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192869 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:55.927408 128959292448576 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:56.122891 128959292448576 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:56.211295 128959292448576 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.283888 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:56.233478 128959292448576 generateRocpd.cpp:583] writing SQL database for process 2523183 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:57:56.234307 128959292448576 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523183_results.db (UUID=0001fa73-2df2-7df2-9ed2-3bd36f5f3d3b)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:56.317675 128959292448576 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008017 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:56.318886 128959292448576 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001194 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:56.321023 128959292448576 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002122 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:56.331414 128959292448576 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008326 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:56.642889 128959292448576 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.311460 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:56.645241 128959292448576 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002335 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:56.645258 128959292448576 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:56.654649 128959292448576 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009383 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:56.654663 128959292448576 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:56.654669 128959292448576 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:56.654676 128959292448576 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:56.654793 128959292448576 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000109 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:56.655012 128959292448576 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.421534 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:56.658061 128959292448576 simple_timer.cpp:55] [rocprofv3] output generation ::     0.445571 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:56.658161 128959292448576 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.446817 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0/MI200/out/pmc_1/2523183_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/13][Approximate profiling time left: 22 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:58.204946 133886908972864 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191085 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:58.205559 133886908972864 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:58.398263 133886908972864 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:58.490729 133886908972864 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.285170 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:58.513480 133886908972864 generateRocpd.cpp:583] writing SQL database for process 2523191 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:57:58.514285 133886908972864 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523191_results.db (UUID=0001fa73-36da-76da-a297-430ee8ec21cd)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:58.596210 133886908972864 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007997 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:58.597401 133886908972864 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001173 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:58.599060 133886908972864 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001644 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:58.609683 133886908972864 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008504 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:58.908349 133886908972864 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.298650 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:58.910653 133886908972864 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002288 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:58.910671 133886908972864 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:58.919692 133886908972864 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009014 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:58.919707 133886908972864 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:58.919713 133886908972864 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:58.919720 133886908972864 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:58.919838 133886908972864 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000110 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:58.920107 133886908972864 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.406627 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:58.923133 133886908972864 simple_timer.cpp:55] [rocprofv3] output generation ::     0.430472 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:58.923237 133886908972864 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.432460 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0/MI200/out/pmc_1/2523191_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/13][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:00.435157 136692738965312 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189247 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:00.435753 136692738965312 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:00.629256 136692738965312 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:00.723475 136692738965312 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.287723 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:00.745791 136692738965312 generateRocpd.cpp:583] writing SQL database for process 2523199 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:58:00.746589 136692738965312 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523199_results.db (UUID=0001fa73-3f92-7f92-aea9-4b7afc77f41f)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:00.829583 136692738965312 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007991 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:00.830795 136692738965312 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001196 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:00.832957 136692738965312 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002147 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:00.843550 136692738965312 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008476 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:01.144304 136692738965312 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.300740 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:01.146588 136692738965312 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002264 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:01.146606 136692738965312 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:01.156686 136692738965312 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.010072 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:01.156700 136692738965312 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:01.156706 136692738965312 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:01.156713 136692738965312 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:01.156827 136692738965312 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000107 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:01.157054 136692738965312 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.411263 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:01.159972 136692738965312 simple_timer.cpp:55] [rocprofv3] output generation ::     0.435061 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:01.160065 136692738965312 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.436549 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0/MI200/out/pmc_1/2523199_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/13][Approximate profiling time left: 18 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:02.680415 124184907226944 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190178 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:02.681049 124184907226944 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:02.875322 124184907226944 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:02.969599 124184907226944 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.288550 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:02.991864 124184907226944 generateRocpd.cpp:583] writing SQL database for process 2523209 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:58:02.992679 124184907226944 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523209_results.db (UUID=0001fa73-4857-7857-ae61-b195f81264b1)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:03.075968 124184907226944 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007990 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:03.077186 124184907226944 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001202 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:03.078890 124184907226944 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001688 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:03.089657 124184907226944 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008560 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:03.368520 124184907226944 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.278849 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:03.370875 124184907226944 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002335 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:03.370892 124184907226944 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:03.379714 124184907226944 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008814 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:03.379729 124184907226944 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:03.379736 124184907226944 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:03.379743 124184907226944 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:03.379872 124184907226944 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000121 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:03.380137 124184907226944 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.388274 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:03.383164 124184907226944 simple_timer.cpp:55] [rocprofv3] output generation ::     0.412184 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:03.383267 124184907226944 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.413616 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0/MI200/out/pmc_1/2523209_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/13][Approximate profiling time left: 15 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:04.861745 129209021456192 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.183742 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:04.862334 129209021456192 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:05.056613 129209021456192 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:05.142641 129209021456192 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280308 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:05.164735 129209021456192 generateRocpd.cpp:583] writing SQL database for process 2523218 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:58:05.165532 129209021456192 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523218_results.db (UUID=0001fa73-50e2-70e2-8dcc-b008df3a9c81)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:05.246864 129209021456192 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007732 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:05.248056 129209021456192 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001175 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:05.249719 129209021456192 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001648 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:05.260360 129209021456192 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008480 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:05.268882 129209021456192 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008507 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:05.270982 129209021456192 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002086 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:05.270999 129209021456192 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:05.279510 129209021456192 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008503 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:05.279524 129209021456192 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:05.279530 129209021456192 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:05.279536 129209021456192 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:05.279639 129209021456192 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000094 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:05.279829 129209021456192 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.115095 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:05.282628 129209021456192 simple_timer.cpp:55] [rocprofv3] output generation ::     0.138610 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:05.282672 129209021456192 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.139982 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0/MI200/out/pmc_1/2523218_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/13][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0/MI200/perfmon/pmc_perf_SQC_DCACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:06.786554 139961087565632 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189167 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:06.787205 139961087565632 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:06.980227 139961087565632 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:07.078316 139961087565632 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.291111 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:07.100982 139961087565632 generateRocpd.cpp:583] writing SQL database for process 2523226 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:58:07.101794 139961087565632 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523226_results.db (UUID=0001fa73-5862-7862-a662-5b5429cbec07)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:07.185384 139961087565632 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008165 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:07.186593 139961087565632 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001185 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:07.188588 139961087565632 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001979 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:07.198907 139961087565632 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008294 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:07.606655 139961087565632 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.407731 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:07.608904 139961087565632 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002229 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:07.608923 139961087565632 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:07.617784 139961087565632 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008849 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:07.617801 139961087565632 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:07.617815 139961087565632 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:07.617825 139961087565632 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:07.617948 139961087565632 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000114 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:07.618200 139961087565632 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.517218 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:07.621196 139961087565632 simple_timer.cpp:55] [rocprofv3] output generation ::     0.541304 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:07.621316 139961087565632 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.542942 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0/MI200/out/pmc_1/2523226_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/13][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0/MI200/perfmon/pmc_perf_SQC_ICACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:09.157859 139494844612416 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190592 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:09.158471 139494844612416 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:09.351504 139494844612416 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:09.438205 139494844612416 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.279735 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:09.460752 139494844612416 generateRocpd.cpp:583] writing SQL database for process 2523234 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:58:09.461551 139494844612416 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523234_results.db (UUID=0001fa73-61a4-71a4-9a30-82c7460cf327)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:09.544816 139494844612416 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008111 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:09.545956 139494844612416 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001124 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:09.547936 139494844612416 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001965 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:09.558493 139494844612416 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008523 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:09.960046 139494844612416 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.401539 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:09.962358 139494844612416 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002293 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:09.962376 139494844612416 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:09.972359 139494844612416 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009976 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:09.972374 139494844612416 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:09.972380 139494844612416 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:09.972387 139494844612416 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:09.972550 139494844612416 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000124 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:09.972824 139494844612416 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.512072 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:09.975829 139494844612416 simple_timer.cpp:55] [rocprofv3] output generation ::     0.536031 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:09.975947 139494844612416 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.537703 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0/MI200/out/pmc_1/2523234_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/13][Approximate profiling time left: 8 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:11.519500 124931657207616 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.199444 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:11.520154 124931657207616 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:11.714460 124931657207616 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:11.801681 124931657207616 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.281527 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:11.823635 124931657207616 generateRocpd.cpp:583] writing SQL database for process 2523242 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:58:11.824442 124931657207616 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523242_results.db (UUID=0001fa73-6ad4-7ad4-b0e6-b27d33eaa6d5)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:11.907155 124931657207616 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008146 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:11.908290 124931657207616 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001115 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:11.910246 124931657207616 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001941 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:11.920701 124931657207616 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008460 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:12.510122 124931657207616 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.589405 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:12.512303 124931657207616 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002155 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:12.512321 124931657207616 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:12.520821 124931657207616 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008492 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:12.520836 124931657207616 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:12.520842 124931657207616 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:12.520849 124931657207616 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:12.520967 124931657207616 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000111 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:12.521218 124931657207616 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.697584 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:12.524193 124931657207616 simple_timer.cpp:55] [rocprofv3] output generation ::     0.721133 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:12.524329 124931657207616 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.722605 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0/MI200/out/pmc_1/2523242_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/13][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_LDS_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:14.052351 134689430323008 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190367 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:14.052966 134689430323008 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:14.246618 134689430323008 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:14.332417 134689430323008 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.279452 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:14.354618 134689430323008 generateRocpd.cpp:583] writing SQL database for process 2523251 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:58:14.355421 134689430323008 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523251_results.db (UUID=0001fa73-74c2-74c2-ace0-5956c27e0bdf)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:14.438637 134689430323008 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008098 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:14.439854 134689430323008 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001202 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:14.441898 134689430323008 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002029 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:14.452344 134689430323008 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008414 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:14.867845 134689430323008 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.415485 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:14.870143 134689430323008 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002271 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:14.870160 134689430323008 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:14.878829 134689430323008 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008661 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:14.878845 134689430323008 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:14.878851 134689430323008 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:14.878858 134689430323008 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:14.878984 134689430323008 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000119 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:14.879240 134689430323008 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.524622 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:14.882212 134689430323008 simple_timer.cpp:55] [rocprofv3] output generation ::     0.548289 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:14.882322 134689430323008 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.549854 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0/MI200/out/pmc_1/2523251_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 11/13][Approximate profiling time left: 4 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:16.405548 131473667931968 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191921 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:16.406168 131473667931968 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:16.600690 131473667931968 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:16.684210 131473667931968 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.278043 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:16.706594 131473667931968 generateRocpd.cpp:583] writing SQL database for process 2523259 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:58:16.707399 131473667931968 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523259_results.db (UUID=0001fa73-7df2-7df2-80b6-4384d04e4023)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:16.788543 131473667931968 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007999 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:16.789720 131473667931968 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001161 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:16.791852 131473667931968 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002117 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:16.802242 131473667931968 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008396 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:17.136676 131473667931968 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.334420 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:17.139057 131473667931968 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002363 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:17.139075 131473667931968 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:17.148458 131473667931968 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009376 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:17.148473 131473667931968 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:17.148480 131473667931968 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:17.148486 131473667931968 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:17.148602 131473667931968 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000109 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:17.148852 131473667931968 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.442259 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:17.151757 131473667931968 simple_timer.cpp:55] [rocprofv3] output generation ::     0.465827 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:17.151854 131473667931968 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.467593 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0/MI200/out/pmc_1/2523259_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 12/13][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:18.699429 130275112066880 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.199541 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:18.700017 130275112066880 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:18.893488 130275112066880 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:18.984086 130275112066880 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.284070 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:19.006753 130275112066880 generateRocpd.cpp:583] writing SQL database for process 2523268 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:58:19.007533 130275112066880 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523268_results.db (UUID=0001fa73-86e0-76e0-be72-3269db96f6d7)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:19.090761 130275112066880 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008248 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:19.091980 130275112066880 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001203 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:19.094127 130275112066880 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002132 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:19.104824 130275112066880 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008613 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:19.629436 130275112066880 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.524597 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:19.631737 130275112066880 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002274 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:19.631754 130275112066880 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:19.640356 130275112066880 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008595 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:19.640371 130275112066880 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:19.640378 130275112066880 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:19.640385 130275112066880 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:19.640527 130275112066880 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000135 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:19.640796 130275112066880 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.634043 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:19.643782 130275112066880 simple_timer.cpp:55] [rocprofv3] output generation ::     0.657907 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:19.643910 130275112066880 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.659780 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0/MI200/out/pmc_1/2523268_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 13/13][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0/MI200/perfmon/pmc_perf_SQ_LEVEL_WAVES_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:21.168506 137780607409984 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189479 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:21.169123 137780607409984 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:21.361905 137780607409984 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:58:21.445601 137780607409984 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.276478 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:21.467837 137780607409984 generateRocpd.cpp:583] writing SQL database for process 2523277 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:58:21.468634 137780607409984 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523277_results.db (UUID=0001fa73-9090-7090-82b4-18e758668f85)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:21.551472 137780607409984 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007905 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:21.552697 137780607409984 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001209 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:21.554303 137780607409984 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001590 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:21.564669 137780607409984 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008381 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:21.884010 137780607409984 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.319325 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:21.886303 137780607409984 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002274 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:21.886321 137780607409984 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:21.896253 137780607409984 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009924 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:21.896270 137780607409984 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:21.896276 137780607409984 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:21.896283 137780607409984 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:21.896414 137780607409984 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000124 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:21.896683 137780607409984 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.428846 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:21.899663 137780607409984 simple_timer.cpp:55] [rocprofv3] output generation ::     0.452597 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:58:21.899778 137780607409984 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.454135 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0/MI200/out/pmc_1/2523277_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Checking for roofline.csv in tests/workloads/dispatch_0/MI200
[roofline] Benchmark execution failed: 'L1'. Skipping roofline.
