Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/mem_levels_HBM_LDS/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: All

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/13][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/mem_levels_HBM_LDS/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:03.574814 123964030918464 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191980 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:03.575428 123964030918464 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:03.779572 123964030918464 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:03.875374 123964030918464 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.299946 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:03.897997 123964030918464 generateRocpd.cpp:583] writing SQL database for process 2526738 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:11:03.898792 123964030918464 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526738_results.db (UUID=0001fa7f-32b3-72b3-adb5-51f9ce6dca4c)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:03.982936 123964030918464 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008040 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:03.984136 123964030918464 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001184 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:03.985700 123964030918464 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001548 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:03.995836 123964030918464 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008196 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:04.319909 123964030918464 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.324058 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:04.322168 123964030918464 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002241 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:04.322186 123964030918464 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:04.331359 123964030918464 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009166 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:04.331374 123964030918464 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:04.331380 123964030918464 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:04.331386 123964030918464 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:04.331506 123964030918464 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000109 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:04.331713 123964030918464 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.433717 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:04.334662 123964030918464 simple_timer.cpp:55] [rocprofv3] output generation ::     0.457465 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:04.334762 123964030918464 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.459345 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/2526738_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/13][Approximate profiling time left: 25 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM_LDS/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:05.890904 126349648686912 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189271 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:05.891542 126349648686912 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:06.085700 126349648686912 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:06.174614 126349648686912 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.283072 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:06.197077 126349648686912 generateRocpd.cpp:583] writing SQL database for process 2526752 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:11:06.197871 126349648686912 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526752_results.db (UUID=0001fa7f-3bc2-7bc2-882b-a54e3ed1501d)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:06.280911 126349648686912 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007864 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:06.282102 126349648686912 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001175 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:06.283652 126349648686912 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001535 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:06.294086 126349648686912 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008534 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:06.606748 126349648686912 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.312647 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:06.609162 126349648686912 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002382 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:06.609179 126349648686912 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:06.618175 126349648686912 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008988 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:06.618190 126349648686912 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:06.618196 126349648686912 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:06.618203 126349648686912 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:06.618321 126349648686912 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000110 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:06.618550 126349648686912 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.421473 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:06.621572 126349648686912 simple_timer.cpp:55] [rocprofv3] output generation ::     0.445278 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:06.621685 126349648686912 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.447024 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/2526752_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/13][Approximate profiling time left: 23 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM_LDS/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:08.188706 137549858041664 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.188582 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:08.189286 137549858041664 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:08.382610 137549858041664 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:08.474617 137549858041664 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.285332 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:08.497190 137549858041664 generateRocpd.cpp:583] writing SQL database for process 2526764 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:11:08.497975 137549858041664 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526764_results.db (UUID=0001fa7f-44bd-74bd-af4e-e3d968117e44)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:08.581861 137549858041664 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007974 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:08.583054 137549858041664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001175 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:08.584728 137549858041664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001659 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:08.595253 137549858041664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008321 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:08.893667 137549858041664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.298400 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:08.895994 137549858041664 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002301 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:08.896012 137549858041664 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:08.904883 137549858041664 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008864 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:08.904898 137549858041664 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:08.904904 137549858041664 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:08.904911 137549858041664 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:08.905067 137549858041664 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000120 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:08.905308 137549858041664 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.408118 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:08.908303 137549858041664 simple_timer.cpp:55] [rocprofv3] output generation ::     0.432009 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:08.908406 137549858041664 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.433740 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/2526764_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/13][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM_LDS/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:10.473358 132191619333952 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192199 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:10.473934 132191619333952 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:10.668936 132191619333952 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:10.768957 132191619333952 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.295024 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:10.791280 132191619333952 generateRocpd.cpp:583] writing SQL database for process 2526772 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:11:10.792065 132191619333952 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526772_results.db (UUID=0001fa7f-4da6-7da6-aa90-f0fd29776180)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:10.875630 132191619333952 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008013 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:10.876840 132191619333952 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001193 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:10.878985 132191619333952 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002129 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:10.889582 132191619333952 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008410 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:11.176272 132191619333952 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.286675 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:11.178558 132191619333952 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002268 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:11.178576 132191619333952 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:11.187411 132191619333952 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008828 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:11.187427 132191619333952 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:11.187433 132191619333952 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:11.187439 132191619333952 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:11.187585 132191619333952 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000101 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:11.187774 132191619333952 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.396494 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:11.190812 132191619333952 simple_timer.cpp:55] [rocprofv3] output generation ::     0.420340 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:11.190918 132191619333952 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.421923 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/2526772_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/13][Approximate profiling time left: 18 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM_LDS/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:12.737629 138530775793472 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190385 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:12.738212 138530775793472 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:12.931143 138530775793472 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:13.022805 138530775793472 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.284592 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:13.045451 138530775793472 generateRocpd.cpp:583] writing SQL database for process 2526782 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:11:13.046225 138530775793472 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526782_results.db (UUID=0001fa7f-5680-7680-b681-616c79719bb7)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:13.129638 138530775793472 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007904 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:13.130764 138530775793472 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001109 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:13.132399 138530775793472 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001620 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:13.142627 138530775793472 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008234 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:13.422078 138530775793472 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.279435 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:13.424319 138530775793472 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002224 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:13.424336 138530775793472 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:13.433914 138530775793472 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009571 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:13.433928 138530775793472 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:13.433935 138530775793472 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:13.433941 138530775793472 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:13.434056 138530775793472 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000106 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:13.434254 138530775793472 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.388803 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:13.437272 138530775793472 simple_timer.cpp:55] [rocprofv3] output generation ::     0.412872 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:13.437355 138530775793472 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.414512 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/2526782_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/13][Approximate profiling time left: 16 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM_LDS/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:14.965054 134787799334720 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.183613 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:14.965641 134787799334720 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:15.167142 134787799334720 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:15.257643 134787799334720 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.292003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:15.280208 134787799334720 generateRocpd.cpp:583] writing SQL database for process 2526791 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:11:15.281003 134787799334720 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526791_results.db (UUID=0001fa7f-5f3a-7f3a-aecf-0f659e44dcd1)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:15.361867 134787799334720 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007689 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:15.363083 134787799334720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001199 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:15.364734 134787799334720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001636 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:15.374842 134787799334720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008185 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:15.383163 134787799334720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008306 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:15.385188 134787799334720 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002011 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:15.385205 134787799334720 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:15.393495 134787799334720 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008284 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:15.393509 134787799334720 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:15.393515 134787799334720 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:15.393521 134787799334720 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:15.393625 134787799334720 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000096 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:15.393817 134787799334720 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.113609 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:15.397063 134787799334720 simple_timer.cpp:55] [rocprofv3] output generation ::     0.137729 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:15.397104 134787799334720 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.139419 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/2526791_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/13][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM_LDS/MI200/perfmon/pmc_perf_SQC_DCACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:16.923320 129327570706240 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191741 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:16.923932 129327570706240 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:17.116610 129327570706240 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:17.215148 129327570706240 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.291216 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:17.237426 129327570706240 generateRocpd.cpp:583] writing SQL database for process 2526799 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:11:17.238196 129327570706240 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526799_results.db (UUID=0001fa7f-66d8-76d8-9571-000cdadf89ce)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:17.320571 129327570706240 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008042 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:17.321694 129327570706240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001107 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:17.323591 129327570706240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001882 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:17.334003 129327570706240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008458 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:17.746080 129327570706240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.412062 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:17.748331 129327570706240 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002234 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:17.748348 129327570706240 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:17.757321 129327570706240 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008966 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:17.757337 129327570706240 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:17.757344 129327570706240 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:17.757350 129327570706240 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:17.757461 129327570706240 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:17.757713 129327570706240 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.520287 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:17.760680 129327570706240 simple_timer.cpp:55] [rocprofv3] output generation ::     0.544118 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:17.760787 129327570706240 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.545603 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/2526799_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/13][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM_LDS/MI200/perfmon/pmc_perf_SQC_ICACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:19.302558 128139450605376 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190971 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:19.303157 128139450605376 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:19.497240 128139450605376 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:19.582918 128139450605376 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.279761 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:19.605317 128139450605376 generateRocpd.cpp:583] writing SQL database for process 2526807 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:11:19.606111 128139450605376 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526807_results.db (UUID=0001fa7f-7024-7024-b012-54f5bbf11acb)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:19.689263 128139450605376 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008132 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:19.690490 128139450605376 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001209 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:19.692465 128139450605376 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001959 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:19.702874 128139450605376 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008423 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:20.106104 128139450605376 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.403215 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:20.108365 128139450605376 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002243 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:20.108384 128139450605376 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:20.118334 128139450605376 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009943 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:20.118351 128139450605376 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:20.118357 128139450605376 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:20.118363 128139450605376 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:20.118472 128139450605376 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:20.118676 128139450605376 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.513361 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:20.121697 128139450605376 simple_timer.cpp:55] [rocprofv3] output generation ::     0.537230 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:20.121820 128139450605376 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.538859 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/2526807_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/13][Approximate profiling time left: 9 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM_LDS/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:21.728570 134433932255040 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.198784 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:21.729164 134433932255040 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:21.922763 134433932255040 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:22.010344 134433932255040 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.281180 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:22.032558 134433932255040 generateRocpd.cpp:583] writing SQL database for process 2526815 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:11:22.033359 134433932255040 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526815_results.db (UUID=0001fa7f-7996-7996-bd11-a612f4faaed9)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:22.116820 134433932255040 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008239 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:22.118021 134433932255040 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001183 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:22.119995 134433932255040 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001944 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:22.130447 134433932255040 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008447 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:22.715665 134433932255040 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.585203 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:22.717971 134433932255040 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002285 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:22.717989 134433932255040 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:22.727005 134433932255040 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009009 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:22.727019 134433932255040 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:22.727025 134433932255040 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:22.727039 134433932255040 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:22.727154 134433932255040 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000106 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:22.727392 134433932255040 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.694834 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:22.730405 134433932255040 simple_timer.cpp:55] [rocprofv3] output generation ::     0.718706 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:22.730540 134433932255040 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.720137 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/2526815_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/13][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM_LDS/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_LDS_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:24.295260 123266320973632 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192969 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:24.295877 123266320973632 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:24.496320 123266320973632 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:24.585028 123266320973632 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.289151 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:24.607461 123266320973632 generateRocpd.cpp:583] writing SQL database for process 2526823 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:11:24.608275 123266320973632 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526823_results.db (UUID=0001fa7f-83a3-73a3-b9b0-84195268cdb9)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:24.691605 123266320973632 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008171 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:24.692812 123266320973632 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001191 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:24.694957 123266320973632 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002131 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:24.705515 123266320973632 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008342 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:25.050469 123266320973632 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.344938 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:25.052831 123266320973632 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002344 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:25.052849 123266320973632 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:25.061417 123266320973632 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008561 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:25.061432 123266320973632 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:25.061438 123266320973632 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:25.061445 123266320973632 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:25.061551 123266320973632 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000099 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:25.061764 123266320973632 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.454304 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:25.064759 123266320973632 simple_timer.cpp:55] [rocprofv3] output generation ::     0.478179 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:25.064861 123266320973632 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.479759 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/2526823_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 11/13][Approximate profiling time left: 4 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM_LDS/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:26.609783 139730578980672 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191517 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:26.610358 139730578980672 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:26.804221 139730578980672 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:26.896781 139730578980672 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.286424 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:26.919289 139730578980672 generateRocpd.cpp:583] writing SQL database for process 2526832 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:11:26.920096 139730578980672 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526832_results.db (UUID=0001fa7f-8caf-7caf-9819-28cce3796876)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:27.003313 139730578980672 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008069 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:27.004546 139730578980672 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001217 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:27.006710 139730578980672 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002149 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:27.017028 139730578980672 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008230 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:27.348567 139730578980672 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.331514 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:27.351210 139730578980672 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002622 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:27.351228 139730578980672 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:27.361020 139730578980672 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009784 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:27.361040 139730578980672 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:27.361047 139730578980672 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:27.361058 139730578980672 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:27.361174 139730578980672 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000107 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:27.361386 139730578980672 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.442097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:27.364420 139730578980672 simple_timer.cpp:55] [rocprofv3] output generation ::     0.465961 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:27.364530 139730578980672 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.467700 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/2526832_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 12/13][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM_LDS/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:28.929698 125474861080384 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.196019 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:28.930280 125474861080384 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:29.123692 125474861080384 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:29.209654 125474861080384 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.279374 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:29.232117 125474861080384 generateRocpd.cpp:583] writing SQL database for process 2526841 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:11:29.232900 125474861080384 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526841_results.db (UUID=0001fa7f-95ba-75ba-bb24-335455bebde2)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:29.315359 125474861080384 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008047 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:29.316538 125474861080384 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001162 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:29.318636 125474861080384 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002084 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:29.328873 125474861080384 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008264 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:29.846556 125474861080384 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.517669 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:29.848777 125474861080384 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002202 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:29.848795 125474861080384 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:29.857853 125474861080384 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009050 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:29.857868 125474861080384 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:29.857874 125474861080384 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:29.857881 125474861080384 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:29.857996 125474861080384 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:29.858236 125474861080384 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.626120 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:29.861282 125474861080384 simple_timer.cpp:55] [rocprofv3] output generation ::     0.649901 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:29.861409 125474861080384 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.651712 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/2526841_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 13/13][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM_LDS/MI200/perfmon/pmc_perf_SQ_LEVEL_WAVES_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:31.411153 128996950212416 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192932 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:31.411792 128996950212416 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:31.608494 128996950212416 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:31.703122 128996950212416 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.291330 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:31.725419 128996950212416 generateRocpd.cpp:583] writing SQL database for process 2526849 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:11:31.726224 128996950212416 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526849_results.db (UUID=0001fa7f-9f6f-7f6f-a071-6977cab3e459)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:31.808873 128996950212416 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007908 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:31.810087 128996950212416 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001197 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:31.811772 128996950212416 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001671 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:31.822256 128996950212416 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008241 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:32.140308 128996950212416 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.318036 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:32.142567 128996950212416 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002244 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:32.142584 128996950212416 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:32.151296 128996950212416 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008704 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:32.151311 128996950212416 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:32.151318 128996950212416 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:32.151324 128996950212416 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:32.151452 128996950212416 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000117 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:32.151713 128996950212416 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.426295 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:32.154761 128996950212416 simple_timer.cpp:55] [rocprofv3] output generation ::     0.450142 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:11:32.154872 128996950212416 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.451702 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM_LDS/MI200/out/pmc_1/2526849_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Checking for roofline.csv in tests/workloads/mem_levels_HBM_LDS/MI200
[roofline] Benchmark execution failed: 'L1'. Skipping roofline.
