Paper
7 August 2024 Value evaluation and automatic implementation of software prefetching in general program optimization
Dongsheng Qi, Naijie Gu, Junjie Su
Author Affiliations +
Proceedings Volume 13229, Seventh International Conference on Advanced Electronic Materials, Computers, and Software Engineering (AEMCSE 2024); 1322926 (2024) https://doi.org/10.1117/12.3038147
Event: Seventh International Conference on Advanced Electronic Materials, Computers, and Software Engineering (AEMCSE 2024), 2024, Nanchang, China
Abstract
Modern CPU’s memory data access speed is much lower than the calculation speed. Data visits are often a bottleneck of program performance. Cache has a higher data access speed compared to memory, which can load data in memory in advance to improve the efficiency of CPU access to memory data. The cache is usually a multi-level structure. The closer to the CPU level, the faster the cache visits. The cache miss is that the data which CPU needs to be visited is not in the current cache. The processor needs to read data from the next level of cache or memory. The occurrence of cache misses will reduce the memory performance of the CPU. Prefetch can load the data visited by the processor to the cache in advance, reduce the occurrence of a cache miss, improve the efficiency of access to access, and then improve program performance. Related scholars have studied key technologies in different scenarios and on key technologies such as prefetch data, prefetch distance, and prefetch methods. However, related studies are mainly aimed at specific scenarios or access modes, and less prefetch research on general programs. For the prefetch optimization problem of general programs, this article proposes value evaluation model and automatic implementation method of software prefetching in general program optimization. The model can automatically analyze the data that needs to be prefetch and the method can automatically insert the data prefetch into the program. Based on the value evaluation model and automatic implementation method, this article implements the automatic software prefetch tool SP-Frame under the LLVM compiler. In the Crono Benchmark, the SP-Frame tool can reduce 54 %of the L1 cache miss on average, and the average speedup has achieved 23%.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Dongsheng Qi, Naijie Gu, and Junjie Su "Value evaluation and automatic implementation of software prefetching in general program optimization", Proc. SPIE 13229, Seventh International Conference on Advanced Electronic Materials, Computers, and Software Engineering (AEMCSE 2024), 1322926 (7 August 2024); https://doi.org/10.1117/12.3038147
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Mathematical optimization

Data modeling

Logic

Performance modeling

Analytical research

Automation

Reflection

Back to Top