scripts/bench_cpu_mem_bw
script to runcpu_mem_bw
and process the results ofcpu_mem_bw
scripts/roofline
script and document to draw graph of roofline model.
cpu_info_test.cpp
get cpu information(Eg. number of big-core/freq)cpu_inst_gflops_latency.cpp
measure instruction throughput/latencycpu_mem_bw.cpp
measure CPU hierarchical memory bandwidths/latency of micro-kernelscpu_stream.cpp
mperf version of John McCalpin's STREAM benchmarkcpu_spec_dram_bw.cpp
measure dram bandwidthcpu_pmu_transpose.cpp
collect data of cpu pmu eventscpu_tma_transpose.cpp
ARM TMA examplegpu_march_probe.cpp
get gpu micro-arch parameters(number of register/warp size/Cache Line size)gpu_spec_dram_bw.cpp
measure GPU DRAM Bandwidthgpu_mem_bw.cpp
measure Bandwidth of GPU multi-level cachesgpu_adreno_pmu_test.cpp
collect data of Adreno GPU pmu eventsgpu_mali_pmu_test.cpp
collect data of Mali GPU pmu eventsgpu_inst_gflops_latency.cpp
measure gpu/OpenCL instruction throughput/latency
cpu_pmu_analysis/
- store some study cases on arm cpu platform, keep adding.
mali_pmu_analysis/
- store some study cases on mali gpu platform, keep adding.
adreno_pmu_analysis/
- store some study cases on adreno gpu platform, keep adding.