Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/vcopy/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: All

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/13][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/vcopy/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:22.377184 132597270015808 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190507 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:22.377808 132597270015808 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:22.570959 132597270015808 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:22.653525 132597270015808 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.275717 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:22.675665 132597270015808 generateRocpd.cpp:583] writing SQL database for process 2525720 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:06:22.676477 132597270015808 generateRocpd.cpp:606] Opened result file: tests/workloads/vcopy/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525720_results.db (UUID=0001fa7a-e847-7847-a3ce-6b658a65651b)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:22.756842 132597270015808 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008015 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:22.758020 132597270015808 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001161 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:22.759613 132597270015808 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001570 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:22.769750 132597270015808 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008198 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:23.095452 132597270015808 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.325686 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:23.097641 132597270015808 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002173 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:23.097659 132597270015808 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:23.106920 132597270015808 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009253 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:23.106936 132597270015808 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:23.106943 132597270015808 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:23.106950 132597270015808 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:23.107077 132597270015808 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000119 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:23.107308 132597270015808 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.431643 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:23.110337 132597270015808 simple_timer.cpp:55] [rocprofv3] output generation ::     0.455376 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:23.110452 132597270015808 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.456876 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/vcopy/MI200/out/pmc_1/2525720_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/13][Approximate profiling time left: 25 seconds]...
[profiling] Current input file: tests/workloads/vcopy/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:24.665134 123359110176576 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190461 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:24.665731 123359110176576 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:24.858439 123359110176576 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:24.942468 123359110176576 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.276737 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:24.965132 123359110176576 generateRocpd.cpp:583] writing SQL database for process 2525729 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:06:24.965908 123359110176576 generateRocpd.cpp:606] Opened result file: tests/workloads/vcopy/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525729_results.db (UUID=0001fa7a-f137-7137-b112-f97f8eab414f)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:25.046874 123359110176576 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008014 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:25.047990 123359110176576 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001098 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:25.049588 123359110176576 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001583 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:25.059728 123359110176576 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008184 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:25.372594 123359110176576 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.312851 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:25.374904 123359110176576 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002294 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:25.374922 123359110176576 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:25.384592 123359110176576 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009663 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:25.384608 123359110176576 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:25.384615 123359110176576 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:25.384621 123359110176576 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:25.384733 123359110176576 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000104 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:25.384946 123359110176576 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.419814 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:25.388039 123359110176576 simple_timer.cpp:55] [rocprofv3] output generation ::     0.444163 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:25.388143 123359110176576 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.445633 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/vcopy/MI200/out/pmc_1/2525729_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/13][Approximate profiling time left: 22 seconds]...
[profiling] Current input file: tests/workloads/vcopy/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:26.956867 133052991516480 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189421 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:26.957482 133052991516480 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:27.151275 133052991516480 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:27.248450 133052991516480 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.290968 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:27.270954 133052991516480 generateRocpd.cpp:583] writing SQL database for process 2525737 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:06:27.271748 133052991516480 generateRocpd.cpp:606] Opened result file: tests/workloads/vcopy/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525737_results.db (UUID=0001fa7a-fa2c-7a2c-99b7-36df04c58621)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:27.353732 133052991516480 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007955 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:27.354961 133052991516480 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001214 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:27.356645 133052991516480 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001668 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:27.366829 133052991516480 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008231 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:27.673788 133052991516480 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.306943 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:27.676094 133052991516480 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002280 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:27.676112 133052991516480 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:27.686057 133052991516480 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009939 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:27.686072 133052991516480 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:27.686079 133052991516480 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:27.686085 133052991516480 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:27.686211 133052991516480 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000118 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:27.686472 133052991516480 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.415518 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:27.689484 133052991516480 simple_timer.cpp:55] [rocprofv3] output generation ::     0.439560 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:27.689588 133052991516480 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.441088 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/vcopy/MI200/out/pmc_1/2525737_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/13][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: tests/workloads/vcopy/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:29.210593 133094638477120 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190798 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:29.211223 133094638477120 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:29.406345 133094638477120 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:29.502696 133094638477120 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.291474 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:29.524832 133094638477120 generateRocpd.cpp:583] writing SQL database for process 2525746 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:06:29.525633 133094638477120 generateRocpd.cpp:606] Opened result file: tests/workloads/vcopy/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525746_results.db (UUID=0001fa7b-02f8-72f8-8bc8-4c31155c8896)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:29.608568 133094638477120 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008044 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:29.609773 133094638477120 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001189 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:29.611843 133094638477120 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002055 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:29.622121 133094638477120 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008296 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:29.908393 133094638477120 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.286258 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:29.910982 133094638477120 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002573 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:29.911001 133094638477120 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:29.919810 133094638477120 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008801 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:29.919827 133094638477120 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:29.919833 133094638477120 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:29.919841 133094638477120 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:29.919965 133094638477120 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000092 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:29.920183 133094638477120 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.395352 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:29.923247 133094638477120 simple_timer.cpp:55] [rocprofv3] output generation ::     0.419190 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:29.923358 133094638477120 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.420612 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/vcopy/MI200/out/pmc_1/2525746_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/13][Approximate profiling time left: 18 seconds]...
[profiling] Current input file: tests/workloads/vcopy/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:31.469134 138799959392064 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192809 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:31.469741 138799959392064 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:31.664109 138799959392064 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:31.752494 138799959392064 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.282753 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:31.774668 138799959392064 generateRocpd.cpp:583] writing SQL database for process 2525755 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:06:31.775466 138799959392064 generateRocpd.cpp:606] Opened result file: tests/workloads/vcopy/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525755_results.db (UUID=0001fa7b-0bc9-7bc9-8526-70d6feb70888)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:31.858358 138799959392064 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007996 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:31.859587 138799959392064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001213 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:31.861173 138799959392064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001572 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:31.871493 138799959392064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008352 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:32.150820 138799959392064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.279312 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:32.153132 138799959392064 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002296 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:32.153150 138799959392064 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:32.162146 138799959392064 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008989 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:32.162162 138799959392064 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:32.162168 138799959392064 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:32.162175 138799959392064 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:32.162290 138799959392064 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000108 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:32.162517 138799959392064 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.387849 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:32.165505 138799959392064 simple_timer.cpp:55] [rocprofv3] output generation ::     0.411561 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:32.165593 138799959392064 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.413050 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/vcopy/MI200/out/pmc_1/2525755_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/13][Approximate profiling time left: 15 seconds]...
[profiling] Current input file: tests/workloads/vcopy/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:33.657098 128106143014720 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.182609 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:33.657681 128106143014720 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:33.850313 128106143014720 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:33.946172 128106143014720 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.288491 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:33.967916 128106143014720 generateRocpd.cpp:583] writing SQL database for process 2525763 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:06:33.968718 128106143014720 generateRocpd.cpp:606] Opened result file: tests/workloads/vcopy/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525763_results.db (UUID=0001fa7b-145f-745f-847d-5de670106db4)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:34.051487 128106143014720 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007669 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:34.052700 128106143014720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001197 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:34.054413 128106143014720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001698 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:34.064956 128106143014720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008351 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:34.073551 128106143014720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008581 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:34.075743 128106143014720 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002177 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:34.075761 128106143014720 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:34.084406 128106143014720 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008638 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:34.084421 128106143014720 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:34.084427 128106143014720 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:34.084434 128106143014720 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:34.084536 128106143014720 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000094 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:34.084736 128106143014720 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.116820 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:34.087556 128106143014720 simple_timer.cpp:55] [rocprofv3] output generation ::     0.139791 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:34.087603 128106143014720 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.141384 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/vcopy/MI200/out/pmc_1/2525763_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/13][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/vcopy/MI200/perfmon/pmc_perf_SQC_DCACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:35.603270 125124977172288 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192933 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:35.603894 125124977172288 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:35.798419 125124977172288 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:35.886672 125124977172288 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.282778 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:35.908811 125124977172288 generateRocpd.cpp:583] writing SQL database for process 2525771 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:06:35.909614 125124977172288 generateRocpd.cpp:606] Opened result file: tests/workloads/vcopy/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525771_results.db (UUID=0001fa7b-1bef-7bef-9e34-5bc206bd3d1e)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:35.992870 125124977172288 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008137 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:35.994110 125124977172288 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001224 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:35.996098 125124977172288 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001972 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:36.006581 125124977172288 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008472 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:36.416870 125124977172288 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.410274 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:36.419704 125124977172288 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002810 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:36.419723 125124977172288 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:36.428337 125124977172288 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008607 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:36.428351 125124977172288 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:36.428357 125124977172288 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:36.428364 125124977172288 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:36.428472 125124977172288 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000101 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:36.428692 125124977172288 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.519881 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:36.431781 125124977172288 simple_timer.cpp:55] [rocprofv3] output generation ::     0.543745 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:36.431898 125124977172288 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.545178 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/vcopy/MI200/out/pmc_1/2525771_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/13][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/vcopy/MI200/perfmon/pmc_perf_SQC_ICACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:37.955400 131584774848320 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191958 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:37.955991 131584774848320 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:38.150063 131584774848320 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:38.240291 131584774848320 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.284301 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:38.263342 131584774848320 generateRocpd.cpp:583] writing SQL database for process 2525779 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:06:38.264136 131584774848320 generateRocpd.cpp:606] Opened result file: tests/workloads/vcopy/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525779_results.db (UUID=0001fa7b-2520-7520-a8b8-bd8d47ea328f)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:38.346313 131584774848320 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:38.347516 131584774848320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001186 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:38.349625 131584774848320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002094 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:38.359861 131584774848320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008216 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:38.764698 131584774848320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.404821 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:38.767020 131584774848320 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002299 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:38.767044 131584774848320 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:38.775627 131584774848320 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008573 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:38.775643 131584774848320 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:38.775649 131584774848320 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:38.775656 131584774848320 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:38.775762 131584774848320 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:38.775987 131584774848320 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.512646 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:38.779006 131584774848320 simple_timer.cpp:55] [rocprofv3] output generation ::     0.537163 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:38.779118 131584774848320 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.538790 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/vcopy/MI200/out/pmc_1/2525779_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/13][Approximate profiling time left: 8 seconds]...
[profiling] Current input file: tests/workloads/vcopy/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:40.362651 136166128238400 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.197907 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:40.363250 136166128238400 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:40.559705 136166128238400 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:40.646404 136166128238400 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.283154 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:40.668944 136166128238400 generateRocpd.cpp:583] writing SQL database for process 2525787 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:06:40.669735 136166128238400 generateRocpd.cpp:606] Opened result file: tests/workloads/vcopy/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525787_results.db (UUID=0001fa7b-2e81-7e81-ac53-c60694a217f0)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:40.753653 136166128238400 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008163 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:40.754863 136166128238400 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001194 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:40.757005 136166128238400 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002127 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:40.767326 136166128238400 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008261 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:41.351449 136166128238400 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.584107 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:41.353841 136166128238400 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002366 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:41.353858 136166128238400 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:41.362928 136166128238400 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009063 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:41.362948 136166128238400 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:41.362954 136166128238400 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:41.362961 136166128238400 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:41.363138 136166128238400 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000142 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:41.363438 136166128238400 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.694495 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:41.366470 136166128238400 simple_timer.cpp:55] [rocprofv3] output generation ::     0.718578 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:41.366605 136166128238400 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.720154 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/vcopy/MI200/out/pmc_1/2525787_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/13][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: tests/workloads/vcopy/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_LDS_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:42.903198 129925934227264 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189416 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:42.903789 129925934227264 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:43.096963 129925934227264 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:43.188264 129925934227264 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.284475 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:43.210786 129925934227264 generateRocpd.cpp:583] writing SQL database for process 2525796 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:06:43.211633 129925934227264 generateRocpd.cpp:606] Opened result file: tests/workloads/vcopy/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525796_results.db (UUID=0001fa7b-3876-7876-9530-90c594909d3c)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:43.293905 129925934227264 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008019 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:43.295120 129925934227264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001199 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:43.297248 129925934227264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002112 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:43.307562 129925934227264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008253 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:43.658993 129925934227264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.351415 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:43.661317 129925934227264 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002307 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:43.661335 129925934227264 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:43.669907 129925934227264 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008566 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:43.669922 129925934227264 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:43.669928 129925934227264 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:43.669934 129925934227264 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:43.670050 129925934227264 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000108 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:43.670240 129925934227264 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.459455 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:43.673211 129925934227264 simple_timer.cpp:55] [rocprofv3] output generation ::     0.483347 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:43.673302 129925934227264 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.484992 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/vcopy/MI200/out/pmc_1/2525796_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 11/13][Approximate profiling time left: 4 seconds]...
[profiling] Current input file: tests/workloads/vcopy/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:45.195498 130413995786048 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189919 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:45.196115 130413995786048 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:45.390360 130413995786048 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:45.486369 130413995786048 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.290255 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:45.508631 130413995786048 generateRocpd.cpp:583] writing SQL database for process 2525804 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:06:45.509469 130413995786048 generateRocpd.cpp:606] Opened result file: tests/workloads/vcopy/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525804_results.db (UUID=0001fa7b-416a-716a-a645-06e5b8916351)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:45.590775 130413995786048 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008047 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:45.591962 130413995786048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001171 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:45.594088 130413995786048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002111 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:45.604198 130413995786048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008150 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:45.934063 130413995786048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.329851 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:45.936398 130413995786048 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002318 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:45.936415 130413995786048 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:45.944967 130413995786048 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008545 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:45.944981 130413995786048 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:45.944987 130413995786048 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:45.944994 130413995786048 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:45.945105 130413995786048 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:45.945293 130413995786048 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.436662 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:45.948222 130413995786048 simple_timer.cpp:55] [rocprofv3] output generation ::     0.460323 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:45.948322 130413995786048 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.461902 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/vcopy/MI200/out/pmc_1/2525804_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 12/13][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: tests/workloads/vcopy/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:47.526258 138169686122304 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.198460 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:47.526825 138169686122304 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:47.722813 138169686122304 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:47.806607 138169686122304 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.279782 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:47.828940 138169686122304 generateRocpd.cpp:583] writing SQL database for process 2525812 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:06:47.829741 138169686122304 generateRocpd.cpp:606] Opened result file: tests/workloads/vcopy/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525812_results.db (UUID=0001fa7b-4a7c-7a7c-85aa-696327407e49)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:47.913370 138169686122304 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008209 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:47.914587 138169686122304 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001200 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:47.916734 138169686122304 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002132 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:47.927167 138169686122304 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008357 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:48.449832 138169686122304 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.522650 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:48.452058 138169686122304 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002206 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:48.452076 138169686122304 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:48.461243 138169686122304 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009160 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:48.461257 138169686122304 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:48.461263 138169686122304 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:48.461270 138169686122304 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:48.461403 138169686122304 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000122 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:48.461679 138169686122304 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.632739 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:48.464694 138169686122304 simple_timer.cpp:55] [rocprofv3] output generation ::     0.656645 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:48.464813 138169686122304 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.658155 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/vcopy/MI200/out/pmc_1/2525812_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 13/13][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/vcopy/MI200/perfmon/pmc_perf_SQ_LEVEL_WAVES_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:50.015287 136093135937344 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.194949 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:50.015907 136093135937344 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:50.212489 136093135937344 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:06:50.306345 136093135937344 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.290439 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:50.328860 136093135937344 generateRocpd.cpp:583] writing SQL database for process 2525821 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:06:50.329668 136093135937344 generateRocpd.cpp:606] Opened result file: tests/workloads/vcopy/MI200/out/pmc_1/smc4124-25-mi210-3c48/2525821_results.db (UUID=0001fa7b-5439-7439-aae9-21eb2d9f9a0a)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:50.413028 136093135937344 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008039 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:50.414242 136093135937344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001188 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:50.415950 136093135937344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001693 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:50.426589 136093135937344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008443 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:50.755079 136093135937344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.328475 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:50.757440 136093135937344 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002346 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:50.757457 136093135937344 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:50.766047 136093135937344 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008582 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:50.766061 136093135937344 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:50.766068 136093135937344 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:50.766074 136093135937344 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:50.766191 136093135937344 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000110 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:50.766397 136093135937344 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.437538 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:50.769378 136093135937344 simple_timer.cpp:55] [rocprofv3] output generation ::     0.461455 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:06:50.769478 136093135937344 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.463084 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/vcopy/MI200/out/pmc_1/2525821_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Checking for roofline.csv in tests/workloads/vcopy/MI200
[roofline] Benchmark execution failed: 'L1'. Skipping roofline.
