Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/mem_levels_vL1D/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: All

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/13][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/mem_levels_vL1D/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:48.883337 131612440796992 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191467 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:48.883973 131612440796992 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:49.078126 131612440796992 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:49.160585 131612440796992 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.276612 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:49.182820 131612440796992 generateRocpd.cpp:583] writing SQL database for process 2527672 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:14:49.183637 131612440796992 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527672_results.db (UUID=0001fa82-a2d0-72d0-bb19-910c57d09d5c)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:49.268710 131612440796992 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007956 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:49.269887 131612440796992 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001161 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:49.271536 131612440796992 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001634 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:49.282083 131612440796992 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008449 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:49.608770 131612440796992 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.326672 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:49.611068 131612440796992 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002273 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:49.611087 131612440796992 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:49.620617 131612440796992 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009523 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:49.620634 131612440796992 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:49.620642 131612440796992 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:49.620649 131612440796992 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:49.620807 131612440796992 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000124 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:49.621046 131612440796992 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.438226 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:49.624550 131612440796992 simple_timer.cpp:55] [rocprofv3] output generation ::     0.462596 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:49.624651 131612440796992 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.464017 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/2527672_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/13][Approximate profiling time left: 25 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1D/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:51.176409 125994273226560 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189145 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:51.177043 125994273226560 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:51.370285 125994273226560 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:51.466331 125994273226560 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.289288 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:51.488855 125994273226560 generateRocpd.cpp:583] writing SQL database for process 2527681 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:14:51.489677 125994273226560 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527681_results.db (UUID=0001fa82-abc8-7bc8-96c0-3d8b70857d48)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:51.574228 125994273226560 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007954 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:51.575432 125994273226560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001189 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:51.577044 125994273226560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001598 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:51.587399 125994273226560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008342 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:51.899160 125994273226560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.311746 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:51.901534 125994273226560 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002350 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:51.901551 125994273226560 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:51.910559 125994273226560 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:51.910573 125994273226560 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:51.910579 125994273226560 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:51.910586 125994273226560 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:51.910707 125994273226560 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000110 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:51.910962 125994273226560 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.422107 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:51.914042 125994273226560 simple_timer.cpp:55] [rocprofv3] output generation ::     0.446222 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:51.914143 125994273226560 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.447762 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/2527681_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/13][Approximate profiling time left: 22 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1D/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:53.476332 128377030926144 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189253 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:53.476940 128377030926144 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:53.671926 128377030926144 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:53.753720 128377030926144 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.276780 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:53.776383 128377030926144 generateRocpd.cpp:583] writing SQL database for process 2527690 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:14:53.777195 128377030926144 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527690_results.db (UUID=0001fa82-b4c3-74c3-a3e2-9fe3c96af62f)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:53.860932 128377030926144 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008043 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:53.862148 128377030926144 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001200 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:53.863729 128377030926144 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001566 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:53.874284 128377030926144 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008571 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:54.175229 128377030926144 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.300930 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:54.177933 128377030926144 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002687 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:54.177950 128377030926144 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:54.188230 128377030926144 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.010273 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:54.188248 128377030926144 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:54.188256 128377030926144 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:54.188270 128377030926144 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:54.188399 128377030926144 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000122 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:54.188656 128377030926144 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.412273 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:54.192105 128377030926144 simple_timer.cpp:55] [rocprofv3] output generation ::     0.436563 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:54.192222 128377030926144 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.438453 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/2527690_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/13][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1D/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:55.732469 137879976533824 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.193356 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:55.733078 137879976533824 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:55.928249 137879976533824 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:56.011198 137879976533824 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.278120 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:56.033715 137879976533824 generateRocpd.cpp:583] writing SQL database for process 2527699 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:14:56.034527 137879976533824 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527699_results.db (UUID=0001fa82-bd90-7d90-a8f4-3ec00a71fc9d)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:56.118916 137879976533824 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008108 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:56.120139 137879976533824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001207 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:56.122118 137879976533824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001964 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:56.132719 137879976533824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008567 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:56.419353 137879976533824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.286619 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:56.421627 137879976533824 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002253 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:56.421645 137879976533824 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:56.431081 137879976533824 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009429 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:56.431096 137879976533824 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:56.431102 137879976533824 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:56.431109 137879976533824 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:56.431224 137879976533824 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000107 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:56.431469 137879976533824 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.397755 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:56.434528 137879976533824 simple_timer.cpp:55] [rocprofv3] output generation ::     0.421691 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:56.434633 137879976533824 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.423385 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/2527699_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/13][Approximate profiling time left: 18 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1D/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:57.973478 130118872211264 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190847 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:57.974057 130118872211264 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:58.170314 130118872211264 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:58.266066 130118872211264 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.292009 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:58.290227 130118872211264 generateRocpd.cpp:583] writing SQL database for process 2527708 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:14:58.291092 130118872211264 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527708_results.db (UUID=0001fa82-c653-7653-9082-a42300e9943a)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:58.375154 130118872211264 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008054 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:58.376345 130118872211264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001174 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:58.378018 130118872211264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001658 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:58.388824 130118872211264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008623 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:58.670306 130118872211264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.281466 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:58.672691 130118872211264 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002368 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:58.672709 130118872211264 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:58.682159 130118872211264 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009443 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:58.682175 130118872211264 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:58.682181 130118872211264 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:58.682188 130118872211264 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:58.682290 130118872211264 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000095 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:58.682582 130118872211264 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.392355 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:58.685768 130118872211264 simple_timer.cpp:55] [rocprofv3] output generation ::     0.418027 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:58.685958 130118872211264 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.419809 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/2527708_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/13][Approximate profiling time left: 15 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1D/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:00.203251 128723404144448 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.186761 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:00.203883 128723404144448 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:00.397630 128723404144448 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:00.501821 128723404144448 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.297938 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:00.524518 128723404144448 generateRocpd.cpp:583] writing SQL database for process 2527726 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:15:00.525334 128723404144448 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527726_results.db (UUID=0001fa82-cf0d-7f0d-9acf-b4a641c691e9)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:00.609460 128723404144448 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007800 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:00.610690 128723404144448 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001211 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:00.612281 128723404144448 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001576 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:00.622883 128723404144448 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008598 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:00.631550 128723404144448 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008653 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:00.633695 128723404144448 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002131 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:00.633712 128723404144448 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:00.642476 128723404144448 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008756 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:00.642493 128723404144448 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:00.642499 128723404144448 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:00.642506 128723404144448 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:00.642623 128723404144448 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000089 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:00.642811 128723404144448 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.118293 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:00.645615 128723404144448 simple_timer.cpp:55] [rocprofv3] output generation ::     0.141979 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:00.645662 128723404144448 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.143796 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/2527726_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/13][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1D/MI200/perfmon/pmc_perf_SQC_DCACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:02.169653 138620909162304 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191030 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:02.170287 138620909162304 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:02.370791 138620909162304 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:02.459277 138620909162304 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.288991 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:02.481880 138620909162304 generateRocpd.cpp:583] writing SQL database for process 2527738 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:15:02.482705 138620909162304 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527738_results.db (UUID=0001fa82-d6b7-76b7-bde8-34a1b55696f0)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:02.564913 138620909162304 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008038 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:02.566008 138620909162304 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001078 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:02.567931 138620909162304 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001909 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:02.578542 138620909162304 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008486 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:02.987917 138620909162304 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.409360 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:02.990250 138620909162304 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002313 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:02.990266 138620909162304 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:02.999385 138620909162304 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009111 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:02.999399 138620909162304 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:02.999405 138620909162304 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:02.999412 138620909162304 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:02.999546 138620909162304 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000125 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:02.999804 138620909162304 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.517924 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:03.002859 138620909162304 simple_timer.cpp:55] [rocprofv3] output generation ::     0.541990 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:03.002980 138620909162304 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.543654 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/2527738_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/13][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1D/MI200/perfmon/pmc_perf_SQC_ICACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:04.531224 137776400260928 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.188649 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:04.531811 137776400260928 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:04.726080 137776400260928 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:04.815064 137776400260928 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.283253 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:04.839089 137776400260928 generateRocpd.cpp:583] writing SQL database for process 2527747 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:15:04.839887 137776400260928 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527747_results.db (UUID=0001fa82-dff3-7ff3-8c64-f975379d69d3)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:04.924937 137776400260928 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008121 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:04.926167 137776400260928 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001213 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:04.928281 137776400260928 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002099 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:04.938836 137776400260928 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008430 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:05.341647 137776400260928 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.402796 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:05.343911 137776400260928 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002247 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:05.343928 137776400260928 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:05.353076 137776400260928 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009141 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:05.353091 137776400260928 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:05.353097 137776400260928 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:05.353104 137776400260928 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:05.353228 137776400260928 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000112 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:05.353472 137776400260928 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.514383 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:05.356510 137776400260928 simple_timer.cpp:55] [rocprofv3] output generation ::     0.539507 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:05.356629 137776400260928 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.541489 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/2527747_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/13][Approximate profiling time left: 9 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1D/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:06.944503 127213062676288 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.203661 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:06.945196 127213062676288 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:07.139119 127213062676288 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:07.236708 127213062676288 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.291513 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:07.259328 127213062676288 generateRocpd.cpp:583] writing SQL database for process 2527756 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:15:07.260127 127213062676288 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527756_results.db (UUID=0001fa82-e951-7951-9382-8329e0f2b2aa)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:07.343995 127213062676288 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008282 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:07.345197 127213062676288 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001181 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:07.347341 127213062676288 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002129 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:07.357972 127213062676288 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008579 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:07.943102 127213062676288 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.585114 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:07.945580 127213062676288 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002452 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:07.945598 127213062676288 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:07.954692 127213062676288 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009087 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:07.954707 127213062676288 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:07.954713 127213062676288 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:07.954720 127213062676288 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:07.954858 127213062676288 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000129 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:07.955158 127213062676288 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.695831 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:07.958250 127213062676288 simple_timer.cpp:55] [rocprofv3] output generation ::     0.719672 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:07.958395 127213062676288 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.721639 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/2527756_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/13][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1D/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_LDS_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:09.505022 136959968915264 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192088 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:09.505618 136959968915264 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:09.703695 136959968915264 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:09.791459 136959968915264 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.285841 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:09.813637 136959968915264 generateRocpd.cpp:583] writing SQL database for process 2527766 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:15:09.814439 136959968915264 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527766_results.db (UUID=0001fa82-f35d-735d-b67c-57ff88e4619c)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:09.898965 136959968915264 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008079 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:09.900173 136959968915264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001193 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:09.902349 136959968915264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002161 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:09.913262 136959968915264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008733 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:10.256730 136959968915264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.343453 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:10.259129 136959968915264 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002383 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:10.259147 136959968915264 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:10.268113 136959968915264 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008959 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:10.268127 136959968915264 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:10.268133 136959968915264 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:10.268139 136959968915264 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:10.268250 136959968915264 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:10.268455 136959968915264 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.454818 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:10.271464 136959968915264 simple_timer.cpp:55] [rocprofv3] output generation ::     0.478560 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:10.271576 136959968915264 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.480069 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/2527766_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 11/13][Approximate profiling time left: 4 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1D/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:11.812223 138406311264064 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192141 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:11.812788 138406311264064 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:12.007336 138406311264064 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:12.088112 138406311264064 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.275324 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:12.110792 138406311264064 generateRocpd.cpp:583] writing SQL database for process 2527775 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:15:12.111598 138406311264064 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527775_results.db (UUID=0001fa82-fc60-7c60-9d91-3fca22cc0f39)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:12.196624 138406311264064 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008123 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:12.197824 138406311264064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001185 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:12.199811 138406311264064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001972 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:12.210356 138406311264064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008544 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:12.540574 138406311264064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.330204 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:12.542855 138406311264064 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002261 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:12.542872 138406311264064 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:12.553076 138406311264064 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.010197 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:12.553092 138406311264064 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:12.553099 138406311264064 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:12.553106 138406311264064 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:12.553233 138406311264064 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000119 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:12.553470 138406311264064 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.442678 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:12.556585 138406311264064 simple_timer.cpp:55] [rocprofv3] output generation ::     0.466162 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:12.556704 138406311264064 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.468543 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/2527775_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 12/13][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1D/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:14.137047 140106349485888 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.197005 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:14.137663 140106349485888 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:14.334902 140106349485888 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:14.419814 140106349485888 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.282151 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:14.442656 140106349485888 generateRocpd.cpp:583] writing SQL database for process 2527784 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:15:14.443457 140106349485888 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527784_results.db (UUID=0001fa83-0570-7570-963f-d45368c3488f)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:14.527804 140106349485888 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008099 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:14.529016 140106349485888 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001195 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:14.531138 140106349485888 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002099 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:14.541551 140106349485888 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008428 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:15.059830 140106349485888 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.518264 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:15.062221 140106349485888 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002367 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:15.062238 140106349485888 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:15.072192 140106349485888 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009947 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:15.072207 140106349485888 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:15.072213 140106349485888 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:15.072220 140106349485888 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:15.072331 140106349485888 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:15.072546 140106349485888 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.629891 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:15.075553 140106349485888 simple_timer.cpp:55] [rocprofv3] output generation ::     0.653831 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:15.075675 140106349485888 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.655812 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/2527784_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 13/13][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1D/MI200/perfmon/pmc_perf_SQ_LEVEL_WAVES_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:16.623118 140644509638464 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.187955 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:16.623697 140644509638464 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:16.817101 140644509638464 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:16.904347 140644509638464 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280650 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:16.926691 140644509638464 generateRocpd.cpp:583] writing SQL database for process 2527793 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:15:16.927493 140644509638464 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527793_results.db (UUID=0001fa83-0f30-7f30-983b-d4f2e88384b2)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:17.011558 140644509638464 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008121 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:17.012749 140644509638464 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001174 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:17.014341 140644509638464 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001576 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:17.024856 140644509638464 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008535 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:17.342811 140644509638464 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.317940 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:17.345091 140644509638464 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002262 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:17.345108 140644509638464 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:17.354377 140644509638464 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009261 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:17.354391 140644509638464 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:17.354398 140644509638464 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:17.354406 140644509638464 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:17.354534 140644509638464 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000120 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:17.354791 140644509638464 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.428100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:17.357849 140644509638464 simple_timer.cpp:55] [rocprofv3] output generation ::     0.451985 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:17.357961 140644509638464 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.453564 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1D/MI200/out/pmc_1/2527793_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Checking for roofline.csv in tests/workloads/mem_levels_vL1D/MI200
[roofline] Benchmark execution failed: 'L1'. Skipping roofline.
