Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/mem_levels_L2_vL1d_LDS/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: All

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/13][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:56.300793 139877444837184 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192448 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:56.301427 139877444837184 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:56.494422 139877444837184 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:56.578673 139877444837184 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.277246 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:56.601241 139877444837184 generateRocpd.cpp:583] writing SQL database for process 2527083 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:12:56.602056 139877444837184 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527083_results.db (UUID=0001fa80-eb09-7b09-a447-a4f4e6544364)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:56.687096 139877444837184 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008121 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:56.688313 139877444837184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001200 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:56.689954 139877444837184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001627 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:56.700403 139877444837184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008414 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:57.065719 139877444837184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.365300 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:57.068629 139877444837184 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002880 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:57.068647 139877444837184 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:57.079080 139877444837184 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.010427 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:57.079094 139877444837184 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:57.079100 139877444837184 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:57.079107 139877444837184 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:57.079232 139877444837184 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000093 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:57.079448 139877444837184 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.478207 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:57.083371 139877444837184 simple_timer.cpp:55] [rocprofv3] output generation ::     0.503180 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:57.083476 139877444837184 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.504754 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/2527083_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/13][Approximate profiling time left: 25 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:58.659015 128653153804096 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.195454 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:58.659719 128653153804096 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:58.852741 128653153804096 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:58.938009 128653153804096 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.278290 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:58.961470 128653153804096 generateRocpd.cpp:583] writing SQL database for process 2527185 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:12:58.962296 128653153804096 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527185_results.db (UUID=0001fa80-f43c-743c-bf8f-ed8aecbc6388)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:59.046260 128653153804096 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:59.047466 128653153804096 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001189 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:59.049566 128653153804096 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002085 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:59.060249 128653153804096 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008506 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:59.372940 128653153804096 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.312677 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:59.375282 128653153804096 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002327 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:59.375300 128653153804096 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:59.384513 128653153804096 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009205 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:59.384527 128653153804096 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:59.384533 128653153804096 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:59.384540 128653153804096 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:59.384667 128653153804096 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000119 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:59.384891 128653153804096 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.423421 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:59.388194 128653153804096 simple_timer.cpp:55] [rocprofv3] output generation ::     0.448095 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:59.388296 128653153804096 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.450228 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/2527185_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/13][Approximate profiling time left: 23 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:00.945490 136916293189440 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190994 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:00.946085 136916293189440 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:01.139506 136916293189440 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:01.230372 136916293189440 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.284288 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:01.252794 136916293189440 generateRocpd.cpp:583] writing SQL database for process 2527258 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:13:01.253588 136916293189440 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527258_results.db (UUID=0001fa80-fd2f-7d2f-a57c-adb52527954e)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:01.334838 136916293189440 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007894 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:01.335882 136916293189440 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001028 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:01.337418 136916293189440 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001521 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:01.347669 136916293189440 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008384 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:01.646701 136916293189440 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.299018 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:01.648759 136916293189440 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002037 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:01.648776 136916293189440 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:01.657886 136916293189440 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:01.657900 136916293189440 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:01.657906 136916293189440 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:01.657912 136916293189440 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:01.658015 136916293189440 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000096 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:01.658227 136916293189440 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.405433 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:01.661338 136916293189440 simple_timer.cpp:55] [rocprofv3] output generation ::     0.429324 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:01.661446 136916293189440 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.431035 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/2527258_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/13][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:03.202839 139239644290880 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.194076 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:03.203471 139239644290880 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:03.397917 139239644290880 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:03.492406 139239644290880 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.288935 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:03.514934 139239644290880 generateRocpd.cpp:583] writing SQL database for process 2527317 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:13:03.515739 139239644290880 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527317_results.db (UUID=0001fa81-05fd-75fd-8743-a10a1b219182)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:03.598083 139239644290880 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007963 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:03.599197 139239644290880 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001098 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:03.601209 139239644290880 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001997 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:03.611442 139239644290880 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008268 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:03.897418 139239644290880 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.285961 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:03.899528 139239644290880 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002079 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:03.899545 139239644290880 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:03.908442 139239644290880 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008890 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:03.908458 139239644290880 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:03.908464 139239644290880 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:03.908471 139239644290880 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:03.908621 139239644290880 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000122 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:03.908847 139239644290880 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.393913 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:03.911965 139239644290880 simple_timer.cpp:55] [rocprofv3] output generation ::     0.417926 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:03.912064 139239644290880 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.419607 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/2527317_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/13][Approximate profiling time left: 18 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:05.444713 126366864068416 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.195824 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:05.445369 126366864068416 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:05.640263 126366864068416 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:05.732815 126366864068416 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.287446 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:05.755401 126366864068416 generateRocpd.cpp:583] writing SQL database for process 2527350 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:13:05.756206 126366864068416 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527350_results.db (UUID=0001fa81-0ebd-7ebd-9d2c-1b5dd631cef6)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:05.835906 126366864068416 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007779 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:05.836987 126366864068416 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001064 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:05.838556 126366864068416 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001553 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:05.848563 126366864068416 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008106 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:06.127333 126366864068416 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.278754 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:06.129498 126366864068416 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002138 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:06.129516 126366864068416 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:06.138277 126366864068416 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008754 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:06.138291 126366864068416 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:06.138298 126366864068416 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:06.138305 126366864068416 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:06.138407 126366864068416 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000093 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:06.138613 126366864068416 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.383212 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:06.141505 126366864068416 simple_timer.cpp:55] [rocprofv3] output generation ::     0.407240 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:06.141596 126366864068416 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.408730 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/2527350_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/13][Approximate profiling time left: 15 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:07.650223 130065250721600 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.185223 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:07.650779 130065250721600 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:07.844491 130065250721600 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:07.932602 130065250721600 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.281824 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:07.955217 130065250721600 generateRocpd.cpp:583] writing SQL database for process 2527361 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:13:07.956004 130065250721600 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527361_results.db (UUID=0001fa81-1765-7765-9703-7ebb09fd4f8c)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:08.034795 130065250721600 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007603 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:08.035842 130065250721600 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001030 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:08.037382 130065250721600 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001525 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:08.047495 130065250721600 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008190 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:08.055855 130065250721600 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008345 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:08.057810 130065250721600 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.001941 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:08.057828 130065250721600 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:08.066483 130065250721600 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008648 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:08.066498 130065250721600 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:08.066504 130065250721600 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:08.066511 130065250721600 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:08.066611 130065250721600 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000093 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:08.066805 130065250721600 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.111588 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:08.069782 130065250721600 simple_timer.cpp:55] [rocprofv3] output generation ::     0.135653 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:08.069830 130065250721600 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.137175 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/2527361_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/13][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/perfmon/pmc_perf_SQC_DCACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:09.606645 139051124801344 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189628 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:09.607276 139051124801344 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:09.804519 139051124801344 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:09.901750 139051124801344 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.294474 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:09.923936 139051124801344 generateRocpd.cpp:583] writing SQL database for process 2527370 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:13:09.924744 139051124801344 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527370_results.db (UUID=0001fa81-1f05-7f05-ac36-8b08e5f0fcca)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:10.009841 139051124801344 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008192 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:10.011065 139051124801344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001208 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:10.013066 139051124801344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001986 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:10.023537 139051124801344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008443 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:10.434372 139051124801344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.410819 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:10.436722 139051124801344 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002330 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:10.436739 139051124801344 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:10.445363 139051124801344 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008617 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:10.445376 139051124801344 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:10.445382 139051124801344 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:10.445389 139051124801344 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:10.445516 139051124801344 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000116 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:10.445766 139051124801344 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.521831 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:10.448776 139051124801344 simple_timer.cpp:55] [rocprofv3] output generation ::     0.545508 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:10.448887 139051124801344 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.547088 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/2527370_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/13][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/perfmon/pmc_perf_SQC_ICACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:11.971846 129788950437696 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.188765 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:11.972442 129788950437696 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:12.166562 129788950437696 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:12.260800 129788950437696 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.288359 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:12.283298 129788950437696 generateRocpd.cpp:583] writing SQL database for process 2527378 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:13:12.284104 129788950437696 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527378_results.db (UUID=0001fa81-2844-7844-9abe-02d18276a5d6)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:12.363423 129788950437696 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007942 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:12.364486 129788950437696 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001045 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:12.366344 129788950437696 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001843 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:12.376418 129788950437696 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008195 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:12.775856 129788950437696 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.399423 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:12.777828 129788950437696 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.001944 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:12.777845 129788950437696 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:12.787190 129788950437696 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009338 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:12.787203 129788950437696 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:12.787210 129788950437696 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:12.787217 129788950437696 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:12.787347 129788950437696 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000121 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:12.787610 129788950437696 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.504312 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:12.790705 129788950437696 simple_timer.cpp:55] [rocprofv3] output generation ::     0.528430 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:12.790824 129788950437696 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.529974 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/2527378_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/13][Approximate profiling time left: 9 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:14.353800 127969402560320 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.197135 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:14.354454 127969402560320 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:14.561170 127969402560320 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:14.650867 127969402560320 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.296413 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:14.673463 127969402560320 generateRocpd.cpp:583] writing SQL database for process 2527386 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:13:14.674264 127969402560320 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527386_results.db (UUID=0001fa81-3189-7189-b78d-562d72fa3b5e)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:14.756338 127969402560320 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008207 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:14.757427 127969402560320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001072 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:14.759329 127969402560320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001888 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:14.769819 127969402560320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008555 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:15.351264 127969402560320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.581427 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:15.353242 127969402560320 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.001945 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:15.353259 127969402560320 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:15.362204 127969402560320 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008938 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:15.362219 127969402560320 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:15.362226 127969402560320 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:15.362233 127969402560320 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:15.362377 127969402560320 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000134 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:15.362658 127969402560320 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.689196 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:15.365744 127969402560320 simple_timer.cpp:55] [rocprofv3] output generation ::     0.713426 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:15.365894 127969402560320 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.714972 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/2527386_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/13][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_LDS_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:16.908776 131282617507648 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192387 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:16.909405 131282617507648 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:17.105069 131282617507648 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:17.193358 131282617507648 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.283953 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:17.215572 131282617507648 generateRocpd.cpp:583] writing SQL database for process 2527409 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:13:17.216369 131282617507648 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527409_results.db (UUID=0001fa81-3b89-7b89-abbb-7018080c8d01)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:17.297718 131282617507648 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007972 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:17.298824 131282617507648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001084 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:17.300748 131282617507648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001909 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:17.310940 131282617507648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008283 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:17.652690 131282617507648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.341736 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:17.654754 131282617507648 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002045 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:17.654772 131282617507648 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:17.664137 131282617507648 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009359 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:17.664152 131282617507648 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:17.664159 131282617507648 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:17.664165 131282617507648 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:17.664287 131282617507648 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000115 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:17.664529 131282617507648 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.448957 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:17.667566 131282617507648 simple_timer.cpp:55] [rocprofv3] output generation ::     0.472495 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:17.667669 131282617507648 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.474264 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/2527409_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 11/13][Approximate profiling time left: 4 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:19.207579 134668351651648 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192679 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:19.208201 134668351651648 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:19.401769 134668351651648 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:19.497879 134668351651648 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.289678 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:19.520191 134668351651648 generateRocpd.cpp:583] writing SQL database for process 2527418 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:13:19.520998 134668351651648 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527418_results.db (UUID=0001fa81-4483-7483-854c-94118cf755ce)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:19.602678 134668351651648 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007982 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:19.603772 134668351651648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001078 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:19.605691 134668351651648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001904 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:19.615887 134668351651648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008255 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:19.950226 134668351651648 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.334323 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:19.952323 134668351651648 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002075 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:19.952341 134668351651648 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:19.961098 134668351651648 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008750 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:19.961112 134668351651648 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:19.961118 134668351651648 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:19.961125 134668351651648 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:19.961268 134668351651648 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000114 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:19.961516 134668351651648 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.441325 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:19.964527 134668351651648 simple_timer.cpp:55] [rocprofv3] output generation ::     0.465185 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:19.964633 134668351651648 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.466706 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/2527418_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 12/13][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:21.524514 135619764821824 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.196050 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:21.525081 135619764821824 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:21.717045 135619764821824 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:21.814065 135619764821824 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.288985 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:21.837328 135619764821824 generateRocpd.cpp:583] writing SQL database for process 2527426 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:13:21.838140 135619764821824 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527426_results.db (UUID=0001fa81-4d8d-7d8d-892c-b1e546fca117)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:21.920060 135619764821824 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008060 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:21.921167 135619764821824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001090 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:21.923091 135619764821824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001909 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:21.933289 135619764821824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008246 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:22.451247 135619764821824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.517941 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:22.453287 135619764821824 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002007 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:22.453304 135619764821824 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:22.462588 135619764821824 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009277 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:22.462604 135619764821824 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:22.462610 135619764821824 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:22.462617 135619764821824 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:22.462757 135619764821824 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000131 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:22.463044 135619764821824 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.625716 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:22.466075 135619764821824 simple_timer.cpp:55] [rocprofv3] output generation ::     0.650151 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:22.466209 135619764821824 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.652092 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/2527426_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 13/13][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/perfmon/pmc_perf_SQ_LEVEL_WAVES_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:23.997464 139861263097664 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192743 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:23.998094 139861263097664 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:24.191889 139861263097664 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:24.280017 139861263097664 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.281923 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:24.302026 139861263097664 generateRocpd.cpp:583] writing SQL database for process 2527436 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:13:24.302814 139861263097664 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527436_results.db (UUID=0001fa81-5739-7739-bae3-90a110d57392)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:24.381459 139861263097664 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007921 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:24.382514 139861263097664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001039 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:24.384066 139861263097664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001537 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:24.394148 139861263097664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008196 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:24.712310 139861263097664 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.318146 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:24.714464 139861263097664 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002119 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:24.714481 139861263097664 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:24.723204 139861263097664 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008716 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:24.723219 139861263097664 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:24.723225 139861263097664 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:24.723231 139861263097664 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:24.723336 139861263097664 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:24.723536 139861263097664 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.421511 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:24.726495 139861263097664 simple_timer.cpp:55] [rocprofv3] output generation ::     0.445062 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:24.726592 139861263097664 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.446505 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2_vL1d_LDS/MI200/out/pmc_1/2527436_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Checking for roofline.csv in tests/workloads/mem_levels_L2_vL1d_LDS/MI200
[roofline] Benchmark execution failed: 'L1'. Skipping roofline.
