Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/device_inv_int/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: All

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/13][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/device_inv_int/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:52.934985 134597170233152 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192312 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:52.935619 134597170233152 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:53.127972 134597170233152 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:53.215512 134597170233152 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.279893 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:53.238139 134597170233152 generateRocpd.cpp:583] writing SQL database for process 2523902 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:00:53.238944 134597170233152 generateRocpd.cpp:606] Opened result file: tests/workloads/device_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523902_results.db (UUID=0001fa75-e163-7163-b818-d60729c77bc1)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:53.321879 134597170233152 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008026 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:53.322996 134597170233152 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001102 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:53.324592 134597170233152 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001581 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:53.334880 134597170233152 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008315 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:53.659969 134597170233152 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.325073 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:53.662117 134597170233152 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002123 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:53.662135 134597170233152 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:53.671654 134597170233152 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009513 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:53.671669 134597170233152 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:53.671675 134597170233152 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:53.671681 134597170233152 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:53.671792 134597170233152 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:53.671996 134597170233152 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.433857 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:53.674964 134597170233152 simple_timer.cpp:55] [rocprofv3] output generation ::     0.457927 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:53.675070 134597170233152 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.459501 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_inv_int/MI200/out/pmc_1/2523902_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/13][Approximate profiling time left: 25 seconds]...
[profiling] Current input file: tests/workloads/device_inv_int/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:55.224796 132999831314240 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192160 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:55.225380 132999831314240 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:55.419676 132999831314240 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:55.510330 132999831314240 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.284950 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:55.532384 132999831314240 generateRocpd.cpp:583] writing SQL database for process 2523913 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:00:55.533182 132999831314240 generateRocpd.cpp:606] Opened result file: tests/workloads/device_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523913_results.db (UUID=0001fa75-ea55-7a55-8169-b60a941be2ee)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:55.615841 132999831314240 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007869 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:55.616961 132999831314240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:55.618579 132999831314240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001601 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:55.628872 132999831314240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008286 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:55.941606 132999831314240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.312716 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:55.943684 132999831314240 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002061 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:55.943703 132999831314240 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:55.952683 132999831314240 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008969 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:55.952699 132999831314240 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:55.952713 132999831314240 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:55.952724 132999831314240 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:55.952835 132999831314240 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000096 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:55.953045 132999831314240 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.420662 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:55.956025 132999831314240 simple_timer.cpp:55] [rocprofv3] output generation ::     0.444146 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:55.956137 132999831314240 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.445765 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_inv_int/MI200/out/pmc_1/2523913_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/13][Approximate profiling time left: 22 seconds]...
[profiling] Current input file: tests/workloads/device_inv_int/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:57.534828 130878347099968 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.193328 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:57.535443 130878347099968 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:57.729013 130878347099968 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:57.832444 130878347099968 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.297002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:57.854833 130878347099968 generateRocpd.cpp:583] writing SQL database for process 2523923 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:00:57.855614 130878347099968 generateRocpd.cpp:606] Opened result file: tests/workloads/device_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523923_results.db (UUID=0001fa75-f35a-735a-8ff6-8a9a8e74f332)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:57.938327 130878347099968 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008029 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:57.939465 130878347099968 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001121 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:57.941080 130878347099968 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001600 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:57.951377 130878347099968 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008261 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:58.251772 130878347099968 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.300381 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:58.253920 130878347099968 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002131 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:58.253937 130878347099968 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:58.262988 130878347099968 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009043 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:58.263002 130878347099968 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:58.263008 130878347099968 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:58.263015 130878347099968 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:58.263134 130878347099968 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000111 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:58.263392 130878347099968 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.408559 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:58.266337 130878347099968 simple_timer.cpp:55] [rocprofv3] output generation ::     0.432368 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:58.266428 130878347099968 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.433942 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_inv_int/MI200/out/pmc_1/2523923_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/13][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: tests/workloads/device_inv_int/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:59.788231 129578889002816 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192007 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:00:59.788824 129578889002816 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:00:59.983651 129578889002816 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:00.078653 129578889002816 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.289830 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:00.101026 129578889002816 generateRocpd.cpp:583] writing SQL database for process 2523932 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:01:00.101829 129578889002816 generateRocpd.cpp:606] Opened result file: tests/workloads/device_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523932_results.db (UUID=0001fa75-fc29-7c29-9efd-c724e99178bf)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:00.184618 129578889002816 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007994 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:00.185744 129578889002816 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001108 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:00.187732 129578889002816 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001973 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:00.198069 129578889002816 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008326 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:00.485498 129578889002816 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.287414 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:00.487715 129578889002816 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002201 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:00.487732 129578889002816 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:00.497540 129578889002816 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009801 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:00.497555 129578889002816 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:00.497561 129578889002816 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:00.497568 129578889002816 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:00.497683 129578889002816 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000107 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:00.497925 129578889002816 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.396899 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:00.500930 129578889002816 simple_timer.cpp:55] [rocprofv3] output generation ::     0.420929 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:00.501042 129578889002816 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.422345 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_inv_int/MI200/out/pmc_1/2523932_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/13][Approximate profiling time left: 18 seconds]...
[profiling] Current input file: tests/workloads/device_inv_int/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:02.053577 139587091816256 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191914 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:02.054176 139587091816256 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:02.247784 139587091816256 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:02.337822 139587091816256 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.283646 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:02.359879 139587091816256 generateRocpd.cpp:583] writing SQL database for process 2523942 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:01:02.360675 139587091816256 generateRocpd.cpp:606] Opened result file: tests/workloads/device_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523942_results.db (UUID=0001fa76-0502-7502-a715-a16e91403480)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:02.443096 139587091816256 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007937 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:02.444236 139587091816256 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001123 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:02.445855 139587091816256 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001603 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:02.456142 139587091816256 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008289 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:02.738423 139587091816256 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.282266 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:02.740559 139587091816256 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002118 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:02.740576 139587091816256 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:02.749387 139587091816256 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008803 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:02.749401 139587091816256 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:02.749408 139587091816256 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:02.749414 139587091816256 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:02.749543 139587091816256 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000121 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:02.749799 139587091816256 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.389920 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:02.752781 139587091816256 simple_timer.cpp:55] [rocprofv3] output generation ::     0.413654 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:02.752879 139587091816256 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.415012 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_inv_int/MI200/out/pmc_1/2523942_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/13][Approximate profiling time left: 15 seconds]...
[profiling] Current input file: tests/workloads/device_inv_int/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:04.265735 134587775991616 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.182718 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:04.266331 134587775991616 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:04.460326 134587775991616 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:04.544673 134587775991616 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.278342 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:04.566929 134587775991616 generateRocpd.cpp:583] writing SQL database for process 2523950 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:01:04.567722 134587775991616 generateRocpd.cpp:606] Opened result file: tests/workloads/device_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523950_results.db (UUID=0001fa76-0daf-7daf-bdf2-dc13960c4a4c)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:04.649774 134587775991616 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007731 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:04.650893 134587775991616 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:04.652536 134587775991616 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001628 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:04.663006 134587775991616 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008482 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:04.671474 134587775991616 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008453 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:04.673474 134587775991616 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.001985 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:04.673491 134587775991616 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:04.682112 134587775991616 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008613 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:04.682126 134587775991616 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:04.682132 134587775991616 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:04.682138 134587775991616 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:04.682236 134587775991616 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000090 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:04.682434 134587775991616 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.115505 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:04.685212 134587775991616 simple_timer.cpp:55] [rocprofv3] output generation ::     0.139133 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:04.685262 134587775991616 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.140539 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_inv_int/MI200/out/pmc_1/2523950_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/13][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/device_inv_int/MI200/perfmon/pmc_perf_SQC_DCACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:06.190376 139122581061440 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190387 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:06.190939 139122581061440 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:06.386024 139122581061440 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:06.482644 139122581061440 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.291705 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:06.504686 139122581061440 generateRocpd.cpp:583] writing SQL database for process 2523959 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:01:06.505456 139122581061440 generateRocpd.cpp:606] Opened result file: tests/workloads/device_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523959_results.db (UUID=0001fa76-152c-752c-bccb-54923599a380)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:06.586846 139122581061440 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008078 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:06.587958 139122581061440 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001095 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:06.589985 139122581061440 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002013 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:06.600419 139122581061440 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008434 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:07.017750 139122581061440 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.417316 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:07.019898 139122581061440 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002132 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:07.019916 139122581061440 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:07.028512 139122581061440 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008589 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:07.028527 139122581061440 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:07.028533 139122581061440 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:07.028540 139122581061440 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:07.028650 139122581061440 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000102 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:07.028872 139122581061440 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.524186 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:07.031850 139122581061440 simple_timer.cpp:55] [rocprofv3] output generation ::     0.547792 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:07.031958 139122581061440 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.549275 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_inv_int/MI200/out/pmc_1/2523959_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/13][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/device_inv_int/MI200/perfmon/pmc_perf_SQC_ICACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:08.551841 134333525466944 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191145 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:08.552414 134333525466944 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:08.745000 134333525466944 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:08.835273 134333525466944 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.282860 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:08.857637 134333525466944 generateRocpd.cpp:583] writing SQL database for process 2523968 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:01:08.858436 134333525466944 generateRocpd.cpp:606] Opened result file: tests/workloads/device_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523968_results.db (UUID=0001fa76-1e65-7e65-bca9-8601aa5989d1)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:08.941120 134333525466944 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008090 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:08.942243 134333525466944 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001107 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:08.944235 134333525466944 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001977 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:08.954592 134333525466944 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008341 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:09.355767 134333525466944 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.401160 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:09.357870 134333525466944 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002075 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:09.357887 134333525466944 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:09.366548 134333525466944 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008654 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:09.366563 134333525466944 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:09.366569 134333525466944 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:09.366576 134333525466944 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:09.366705 134333525466944 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000122 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:09.366970 134333525466944 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.509333 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:09.369946 134333525466944 simple_timer.cpp:55] [rocprofv3] output generation ::     0.533065 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:09.370065 134333525466944 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.534745 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_inv_int/MI200/out/pmc_1/2523968_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/13][Approximate profiling time left: 8 seconds]...
[profiling] Current input file: tests/workloads/device_inv_int/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:10.922422 132729413394240 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.197350 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:10.923014 132729413394240 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:11.119295 132729413394240 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:11.207468 132729413394240 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.284454 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:11.229664 132729413394240 generateRocpd.cpp:583] writing SQL database for process 2523976 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:01:11.230450 132729413394240 generateRocpd.cpp:606] Opened result file: tests/workloads/device_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523976_results.db (UUID=0001fa76-27a2-77a2-a9ae-166d8b736f61)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:11.313079 132729413394240 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008241 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:11.314188 132729413394240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001092 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:11.316175 132729413394240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001970 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:11.326669 132729413394240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008504 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:11.909410 132729413394240 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.582724 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:11.911792 132729413394240 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002350 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:11.911812 132729413394240 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:11.920923 132729413394240 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:11.920938 132729413394240 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:11.920944 132729413394240 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:11.920951 132729413394240 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:11.921126 132729413394240 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000139 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:11.921387 132729413394240 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.691723 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:11.924405 132729413394240 simple_timer.cpp:55] [rocprofv3] output generation ::     0.715416 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:11.924547 132729413394240 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.717034 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_inv_int/MI200/out/pmc_1/2523976_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/13][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: tests/workloads/device_inv_int/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_LDS_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:13.474691 140218909925184 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189264 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:13.475333 140218909925184 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:13.669393 140218909925184 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:13.761683 140218909925184 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.286350 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:13.784012 140218909925184 generateRocpd.cpp:583] writing SQL database for process 2523984 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:01:13.784800 140218909925184 generateRocpd.cpp:606] Opened result file: tests/workloads/device_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523984_results.db (UUID=0001fa76-31a2-71a2-99c1-366c3c8b3209)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:13.864252 140218909925184 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007876 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:13.865342 140218909925184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001075 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:13.867277 140218909925184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001920 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:13.877257 140218909925184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008058 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:14.220374 140218909925184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.343099 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:14.222431 140218909925184 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002020 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:14.222449 140218909925184 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:14.231336 140218909925184 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008879 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:14.231351 140218909925184 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:14.231358 140218909925184 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:14.231365 140218909925184 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:14.231499 140218909925184 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000126 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:14.231758 140218909925184 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.447746 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:14.234737 140218909925184 simple_timer.cpp:55] [rocprofv3] output generation ::     0.471615 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:14.234855 140218909925184 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.473122 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_inv_int/MI200/out/pmc_1/2523984_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 11/13][Approximate profiling time left: 4 seconds]...
[profiling] Current input file: tests/workloads/device_inv_int/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:15.788937 134166780530496 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189802 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:15.789525 134166780530496 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:15.982305 134166780530496 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:16.080531 134166780530496 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.291006 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:16.102822 134166780530496 generateRocpd.cpp:583] writing SQL database for process 2523992 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:01:16.103602 134166780530496 generateRocpd.cpp:606] Opened result file: tests/workloads/device_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523992_results.db (UUID=0001fa76-3aac-7aac-beca-c05fa2d79d7f)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:16.185964 134166780530496 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008016 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:16.187092 134166780530496 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001113 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:16.189085 134166780530496 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001979 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:16.199482 134166780530496 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008402 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:16.531332 134166780530496 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.331836 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:16.533483 134166780530496 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002128 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:16.533501 134166780530496 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:16.542391 134166780530496 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008883 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:16.542407 134166780530496 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:16.542414 134166780530496 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:16.542421 134166780530496 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:16.542540 134166780530496 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000110 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:16.542791 134166780530496 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.439970 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:16.545803 134166780530496 simple_timer.cpp:55] [rocprofv3] output generation ::     0.463697 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:16.545916 134166780530496 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.465334 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_inv_int/MI200/out/pmc_1/2523992_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 12/13][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: tests/workloads/device_inv_int/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:18.094885 128086445088576 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.195670 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:18.095461 128086445088576 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:18.294126 128086445088576 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:18.385189 128086445088576 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.289728 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:18.407730 128086445088576 generateRocpd.cpp:583] writing SQL database for process 2524002 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:01:18.408530 128086445088576 generateRocpd.cpp:606] Opened result file: tests/workloads/device_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2524002_results.db (UUID=0001fa76-43a8-73a8-9f3b-23ee57bb8f95)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:18.492012 128086445088576 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008326 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:18.493242 128086445088576 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001214 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:18.495213 128086445088576 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001955 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:18.505671 128086445088576 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008458 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:19.027509 128086445088576 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.521822 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:19.029829 128086445088576 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002303 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:19.029847 128086445088576 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:19.038771 128086445088576 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008917 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:19.038785 128086445088576 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:19.038791 128086445088576 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:19.038797 128086445088576 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:19.038902 128086445088576 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:19.039126 128086445088576 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.631396 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:19.042151 128086445088576 simple_timer.cpp:55] [rocprofv3] output generation ::     0.655192 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:19.042272 128086445088576 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.657034 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_inv_int/MI200/out/pmc_1/2524002_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 13/13][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/device_inv_int/MI200/perfmon/pmc_perf_SQ_LEVEL_WAVES_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:20.579895 136344803286848 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191344 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:20.580464 136344803286848 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:20.776603 136344803286848 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:01:20.868501 136344803286848 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.288037 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:20.891185 136344803286848 generateRocpd.cpp:583] writing SQL database for process 2524010 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:01:20.891950 136344803286848 generateRocpd.cpp:606] Opened result file: tests/workloads/device_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2524010_results.db (UUID=0001fa76-4d61-7d61-8631-c9c48485c37d)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:20.970844 136344803286848 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007928 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:20.971908 136344803286848 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001048 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:20.973473 136344803286848 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001550 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:20.983670 136344803286848 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008279 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:21.299356 136344803286848 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.315671 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:21.301326 136344803286848 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.001950 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:21.301342 136344803286848 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:21.310795 136344803286848 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009445 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:21.310809 136344803286848 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:21.310816 136344803286848 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:21.310822 136344803286848 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:21.310931 136344803286848 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000101 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:21.311124 136344803286848 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.419939 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:21.314010 136344803286848 simple_timer.cpp:55] [rocprofv3] output generation ::     0.443676 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:01:21.314115 136344803286848 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.445566 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/device_inv_int/MI200/out/pmc_1/2524010_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Checking for roofline.csv in tests/workloads/device_inv_int/MI200
[roofline] Benchmark execution failed: Failed to load benchmark for devices -1: HIP Error 101. Skipping roofline.
