Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/sort_kernels/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: All

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/13][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/sort_kernels/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:37.864169 124342554419008 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191532 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:37.864822 124342554419008 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:38.059833 124342554419008 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:38.143694 124342554419008 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.278872 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:38.166247 124342554419008 generateRocpd.cpp:583] writing SQL database for process 2528189 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:17:38.167055 124342554419008 generateRocpd.cpp:606] Opened result file: tests/workloads/sort_kernels/MI200/out/pmc_1/smc4124-25-mi210-3c48/2528189_results.db (UUID=0001fa85-36e5-76e5-bc64-908b767365c8)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:38.249738 124342554419008 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007990 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:38.250873 124342554419008 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001118 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:38.252440 124342554419008 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001553 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:38.262595 124342554419008 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008204 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:38.622227 124342554419008 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.359616 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:38.625393 124342554419008 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.003141 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:38.625411 124342554419008 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:38.635865 124342554419008 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.010448 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:38.635880 124342554419008 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:38.635887 124342554419008 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:38.635894 124342554419008 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:38.636051 124342554419008 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000128 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:38.636303 124342554419008 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.470056 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:38.640053 124342554419008 simple_timer.cpp:55] [rocprofv3] output generation ::     0.494557 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:38.640159 124342554419008 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.496416 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/sort_kernels/MI200/out/pmc_1/2528189_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/13][Approximate profiling time left: 25 seconds]...
[profiling] Current input file: tests/workloads/sort_kernels/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:40.171245 124993791016768 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189424 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:40.171870 124993791016768 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:40.365568 124993791016768 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:40.449377 124993791016768 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.277508 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:40.471788 124993791016768 generateRocpd.cpp:583] writing SQL database for process 2528198 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:17:40.472607 124993791016768 generateRocpd.cpp:606] Opened result file: tests/workloads/sort_kernels/MI200/out/pmc_1/smc4124-25-mi210-3c48/2528198_results.db (UUID=0001fa85-3fea-7fea-abcf-e3bbd5153c27)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:40.555626 124993791016768 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008025 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:40.556813 124993791016768 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001170 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:40.558412 124993791016768 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001584 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:40.568581 124993791016768 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008191 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:40.881147 124993791016768 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.312550 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:40.883565 124993791016768 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002388 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:40.883582 124993791016768 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:40.892915 124993791016768 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009326 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:40.892930 124993791016768 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:40.892936 124993791016768 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:40.892943 124993791016768 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:40.893064 124993791016768 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000114 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:40.893271 124993791016768 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.421484 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:40.896531 124993791016768 simple_timer.cpp:55] [rocprofv3] output generation ::     0.445729 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:40.896646 124993791016768 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.447223 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/sort_kernels/MI200/out/pmc_1/2528198_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/13][Approximate profiling time left: 22 seconds]...
[profiling] Current input file: tests/workloads/sort_kernels/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:42.441256 132370311765824 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191389 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:42.441856 132370311765824 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:42.643402 132370311765824 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:42.733322 132370311765824 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.291466 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:42.755595 132370311765824 generateRocpd.cpp:583] writing SQL database for process 2528206 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:17:42.756423 132370311765824 generateRocpd.cpp:606] Opened result file: tests/workloads/sort_kernels/MI200/out/pmc_1/smc4124-25-mi210-3c48/2528206_results.db (UUID=0001fa85-48c6-78c6-ab85-28eafb0b11cd)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:42.839940 132370311765824 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007868 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:42.841149 132370311765824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001194 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:42.842835 132370311765824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001670 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:42.853313 132370311765824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008346 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:43.152330 132370311765824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.299002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:43.154552 132370311765824 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002206 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:43.154569 132370311765824 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:43.163401 132370311765824 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008825 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:43.163416 132370311765824 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:43.163422 132370311765824 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:43.163428 132370311765824 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:43.163537 132370311765824 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000096 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:43.163738 132370311765824 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.408143 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:43.167025 132370311765824 simple_timer.cpp:55] [rocprofv3] output generation ::     0.432241 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:43.167146 132370311765824 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.433763 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/sort_kernels/MI200/out/pmc_1/2528206_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/13][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: tests/workloads/sort_kernels/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:44.693322 128522782195520 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189251 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:44.693906 128522782195520 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:44.887894 128522782195520 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:44.968062 128522782195520 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.274156 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:44.990643 128522782195520 generateRocpd.cpp:583] writing SQL database for process 2528216 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:17:44.991448 128522782195520 generateRocpd.cpp:606] Opened result file: tests/workloads/sort_kernels/MI200/out/pmc_1/smc4124-25-mi210-3c48/2528216_results.db (UUID=0001fa85-5195-7195-84db-af348ec37858)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:45.074292 128522782195520 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008018 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:45.075501 128522782195520 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001193 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:45.077615 128522782195520 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002099 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:45.087918 128522782195520 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008255 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:45.375289 128522782195520 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.287357 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:45.377561 128522782195520 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002254 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:45.377578 128522782195520 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:45.387175 128522782195520 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009590 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:45.387190 128522782195520 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:45.387196 128522782195520 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:45.387203 128522782195520 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:45.387325 128522782195520 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000111 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:45.387559 128522782195520 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.396917 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:45.390748 128522782195520 simple_timer.cpp:55] [rocprofv3] output generation ::     0.421060 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:45.390857 128522782195520 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.422745 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/sort_kernels/MI200/out/pmc_1/2528216_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/13][Approximate profiling time left: 18 seconds]...
[profiling] Current input file: tests/workloads/sort_kernels/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:46.923134 137977454198592 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189853 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:46.923727 137977454198592 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:47.115828 137977454198592 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:47.197571 137977454198592 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.273845 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:47.219783 137977454198592 generateRocpd.cpp:583] writing SQL database for process 2528224 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:17:47.220591 137977454198592 generateRocpd.cpp:606] Opened result file: tests/workloads/sort_kernels/MI200/out/pmc_1/smc4124-25-mi210-3c48/2528224_results.db (UUID=0001fa85-5a4a-7a4a-b62c-813d85599d71)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:47.302505 137977454198592 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007966 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:47.303684 137977454198592 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001162 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:47.305375 137977454198592 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001676 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:47.315809 137977454198592 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008331 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:47.595660 137977454198592 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.279835 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:47.597912 137977454198592 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002237 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:47.597930 137977454198592 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:47.607245 137977454198592 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009308 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:47.607259 137977454198592 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:47.607265 137977454198592 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:47.607272 137977454198592 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:47.607396 137977454198592 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000094 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:47.607605 137977454198592 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.387823 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:47.610624 137977454198592 simple_timer.cpp:55] [rocprofv3] output generation ::     0.411612 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:47.610722 137977454198592 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.413099 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/sort_kernels/MI200/out/pmc_1/2528224_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/13][Approximate profiling time left: 15 seconds]...
[profiling] Current input file: tests/workloads/sort_kernels/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:49.099728 131318244007744 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.185264 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:49.100380 131318244007744 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:49.299253 131318244007744 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:49.381211 131318244007744 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280831 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:49.403531 131318244007744 generateRocpd.cpp:583] writing SQL database for process 2528232 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:17:49.404348 131318244007744 generateRocpd.cpp:606] Opened result file: tests/workloads/sort_kernels/MI200/out/pmc_1/smc4124-25-mi210-3c48/2528232_results.db (UUID=0001fa85-62cf-72cf-9005-7fcb676aa330)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:49.486748 131318244007744 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007828 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:49.487898 131318244007744 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001129 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:49.489501 131318244007744 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001588 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:49.499866 131318244007744 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008392 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:49.508486 131318244007744 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008604 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:49.510589 131318244007744 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002089 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:49.510606 131318244007744 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:49.519300 131318244007744 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008687 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:49.519314 131318244007744 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:49.519321 131318244007744 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:49.519327 131318244007744 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:49.519435 131318244007744 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:49.519606 131318244007744 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.116076 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:49.522448 131318244007744 simple_timer.cpp:55] [rocprofv3] output generation ::     0.139600 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:49.522504 131318244007744 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.141249 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/sort_kernels/MI200/out/pmc_1/2528232_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/13][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/sort_kernels/MI200/perfmon/pmc_perf_SQC_DCACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:51.045700 123724303523648 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191127 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:51.046306 123724303523648 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:51.239989 123724303523648 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:51.321897 123724303523648 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.275591 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:51.344691 123724303523648 generateRocpd.cpp:583] writing SQL database for process 2528241 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:17:51.345502 123724303523648 generateRocpd.cpp:606] Opened result file: tests/workloads/sort_kernels/MI200/out/pmc_1/smc4124-25-mi210-3c48/2528241_results.db (UUID=0001fa85-6a63-7a63-a829-4e939e51b34c)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:51.428283 123724303523648 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008096 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:51.429472 123724303523648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001174 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:51.431466 123724303523648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001979 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:51.441919 123724303523648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008471 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:51.852706 123724303523648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.410772 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:51.854913 123724303523648 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002186 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:51.854931 123724303523648 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:51.864350 123724303523648 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009412 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:51.864365 123724303523648 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:51.864371 123724303523648 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:51.864378 123724303523648 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:51.864528 123724303523648 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000121 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:51.864774 123724303523648 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.520084 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:51.867808 123724303523648 simple_timer.cpp:55] [rocprofv3] output generation ::     0.544173 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:51.867921 123724303523648 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.545979 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/sort_kernels/MI200/out/pmc_1/2528241_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/13][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/sort_kernels/MI200/perfmon/pmc_perf_SQC_ICACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:53.400717 138376091623232 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190475 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:53.401345 138376091623232 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:53.595769 138376091623232 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:53.679723 138376091623232 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.278378 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:53.702026 138376091623232 generateRocpd.cpp:583] writing SQL database for process 2528249 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:17:53.702827 138376091623232 generateRocpd.cpp:606] Opened result file: tests/workloads/sort_kernels/MI200/out/pmc_1/smc4124-25-mi210-3c48/2528249_results.db (UUID=0001fa85-7397-7397-8e73-ddbfa033ddd6)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:53.787312 138376091623232 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008093 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:53.788446 138376091623232 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001117 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:53.790418 138376091623232 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001958 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:53.800579 138376091623232 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008146 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:54.202334 138376091623232 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.401740 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:54.204402 138376091623232 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002042 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:54.204420 138376091623232 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:54.213671 138376091623232 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009244 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:54.213685 138376091623232 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:54.213692 138376091623232 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:54.213698 138376091623232 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:54.213830 138376091623232 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:54.214060 138376091623232 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.512035 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:54.217090 138376091623232 simple_timer.cpp:55] [rocprofv3] output generation ::     0.536007 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:54.217205 138376091623232 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.537433 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/sort_kernels/MI200/out/pmc_1/2528249_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/13][Approximate profiling time left: 8 seconds]...
[profiling] Current input file: tests/workloads/sort_kernels/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:55.778962 136645675093824 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.197266 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:55.779557 136645675093824 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:55.974515 136645675093824 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:56.083331 136645675093824 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.303774 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:56.105792 136645675093824 generateRocpd.cpp:583] writing SQL database for process 2528257 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:17:56.106582 136645675093824 generateRocpd.cpp:606] Opened result file: tests/workloads/sort_kernels/MI200/out/pmc_1/smc4124-25-mi210-3c48/2528257_results.db (UUID=0001fa85-7cda-7cda-9ff0-94d4df9ec7bf)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:56.188837 136645675093824 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008138 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:56.189941 136645675093824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001087 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:56.191906 136645675093824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001950 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:56.202392 136645675093824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008504 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:56.788240 136645675093824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.585833 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:56.790309 136645675093824 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002050 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:56.790327 136645675093824 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000005 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:56.799407 136645675093824 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009073 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:56.799421 136645675093824 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:56.799427 136645675093824 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:56.799434 136645675093824 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:56.799540 136645675093824 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:56.799756 136645675093824 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.693964 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:56.802878 136645675093824 simple_timer.cpp:55] [rocprofv3] output generation ::     0.717928 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:56.803010 136645675093824 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.719640 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/sort_kernels/MI200/out/pmc_1/2528257_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/13][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: tests/workloads/sort_kernels/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_LDS_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:58.350259 131662457667392 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191323 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:58.350830 131662457667392 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:58.543685 131662457667392 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:17:58.642310 131662457667392 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.291480 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:58.665584 131662457667392 generateRocpd.cpp:583] writing SQL database for process 2528269 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:17:58.666389 131662457667392 generateRocpd.cpp:606] Opened result file: tests/workloads/sort_kernels/MI200/out/pmc_1/smc4124-25-mi210-3c48/2528269_results.db (UUID=0001fa85-86eb-76eb-b18d-95322a0df2f6)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:58.748291 131662457667392 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007913 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:58.749387 131662457667392 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001080 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:58.751316 131662457667392 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001915 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:58.761727 131662457667392 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008443 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:59.103543 131662457667392 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.341801 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:59.105627 131662457667392 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002066 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:59.105645 131662457667392 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:59.114568 131662457667392 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008916 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:59.114582 131662457667392 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:59.114589 131662457667392 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:59.114596 131662457667392 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:59.114727 131662457667392 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000120 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:59.114980 131662457667392 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.449396 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:59.118108 131662457667392 simple_timer.cpp:55] [rocprofv3] output generation ::     0.474255 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:17:59.118218 131662457667392 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.475858 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/sort_kernels/MI200/out/pmc_1/2528269_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 11/13][Approximate profiling time left: 4 seconds]...
[profiling] Current input file: tests/workloads/sort_kernels/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:18:00.645612 139158417211200 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189796 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:18:00.646231 139158417211200 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:00.840514 139158417211200 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:18:00.936652 139158417211200 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.290421 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:00.958986 139158417211200 generateRocpd.cpp:583] writing SQL database for process 2528278 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:18:00.959789 139158417211200 generateRocpd.cpp:606] Opened result file: tests/workloads/sort_kernels/MI200/out/pmc_1/smc4124-25-mi210-3c48/2528278_results.db (UUID=0001fa85-8fe4-7fe4-8556-6532c299eac9)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:01.042393 139158417211200 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007956 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:01.043573 139158417211200 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001164 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:01.045501 139158417211200 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001913 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:01.055773 139158417211200 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008303 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:01.386762 139158417211200 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.330975 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:01.388929 139158417211200 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002148 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:01.388947 139158417211200 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:01.398002 139158417211200 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009049 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:01.398017 139158417211200 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:01.398023 139158417211200 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:01.398038 139158417211200 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000009 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:01.398153 139158417211200 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000107 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:01.398386 139158417211200 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.439401 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:01.401488 139158417211200 simple_timer.cpp:55] [rocprofv3] output generation ::     0.463409 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:01.401601 139158417211200 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.464902 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/sort_kernels/MI200/out/pmc_1/2528278_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 12/13][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: tests/workloads/sort_kernels/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:18:02.971623 133572881375040 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.199422 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:18:02.972224 133572881375040 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:03.168550 133572881375040 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:18:03.270633 133572881375040 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.298409 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:03.292817 133572881375040 generateRocpd.cpp:583] writing SQL database for process 2528286 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:18:03.293613 133572881375040 generateRocpd.cpp:606] Opened result file: tests/workloads/sort_kernels/MI200/out/pmc_1/smc4124-25-mi210-3c48/2528286_results.db (UUID=0001fa85-98f1-78f1-bf5a-0e6c2dccbb26)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:03.375759 133572881375040 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008036 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:03.376869 133572881375040 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001094 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:03.378780 133572881375040 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001897 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:03.388898 133572881375040 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008128 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:03.907359 133572881375040 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.518445 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:03.909354 133572881375040 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.001976 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:03.909372 133572881375040 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:03.918960 133572881375040 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009581 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:03.918975 133572881375040 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:03.918981 133572881375040 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:03.918988 133572881375040 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:03.919149 133572881375040 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000129 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:03.919393 133572881375040 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.626577 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:03.922451 133572881375040 simple_timer.cpp:55] [rocprofv3] output generation ::     0.650396 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:03.922575 133572881375040 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.651901 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/sort_kernels/MI200/out/pmc_1/2528286_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 13/13][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/sort_kernels/MI200/perfmon/pmc_perf_SQ_LEVEL_WAVES_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:18:05.464972 128555315347264 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190326 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:18:05.465550 128555315347264 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:05.658510 128555315347264 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:18:05.748760 128555315347264 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.283211 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:05.770621 128555315347264 generateRocpd.cpp:583] writing SQL database for process 2528295 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:18:05.771406 128555315347264 generateRocpd.cpp:606] Opened result file: tests/workloads/sort_kernels/MI200/out/pmc_1/smc4124-25-mi210-3c48/2528295_results.db (UUID=0001fa85-a2b7-72b7-810f-89b1dc40a50a)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:05.854058 128555315347264 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008039 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:05.855241 128555315347264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001166 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:05.856923 128555315347264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001668 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:05.867592 128555315347264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008495 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:06.186126 128555315347264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.318519 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:06.188568 128555315347264 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002412 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:06.188586 128555315347264 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:06.198284 128555315347264 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009691 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:06.198298 128555315347264 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:06.198305 128555315347264 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:06.198311 128555315347264 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:06.198434 128555315347264 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000092 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:06.198612 128555315347264 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.427991 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:06.201642 128555315347264 simple_timer.cpp:55] [rocprofv3] output generation ::     0.451531 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:18:06.201735 128555315347264 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.452937 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/sort_kernels/MI200/out/pmc_1/2528295_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Checking for roofline.csv in tests/workloads/sort_kernels/MI200
[roofline] Benchmark execution failed: 'L1'. Skipping roofline.
