Fine-Grained Memory Profiling of GPGPU Kernels

Buelow, Max von; Guthe, Stefan; Fellner, Dieter W.

dc.contributor.author	Buelow, Max von	en_US
dc.contributor.author	Guthe, Stefan	en_US
dc.contributor.author	Fellner, Dieter W.	en_US
dc.contributor.editor	Umetani, Nobuyuki	en_US
dc.contributor.editor	Wojtan, Chris	en_US
dc.contributor.editor	Vouga, Etienne	en_US
dc.date.accessioned	2022-10-04T06:39:53Z
dc.date.available	2022-10-04T06:39:53Z
dc.date.issued	2022
dc.identifier.issn	1467-8659
dc.identifier.uri	https://doi.org/10.1111/cgf.14671
dc.identifier.uri	https://diglib.eg.org:443/handle/10.1111/cgf14671
dc.description.abstract	Memory performance is a crucial bottleneck in many GPGPU applications, making optimizations for hardware and software mandatory. While hardware vendors already use highly efficient caching architectures, software engineers usually have to organize their data accordingly in order to efficiently make use of these, requiring deep knowledge of the actual hardware. In this paper we present a novel technique for fine-grained memory profiling that simulates the whole pipeline of memory flow and finally accumulates profiling values in a way that the user retains information about the potential region in the GPU program by showing these values separately for each allocation. Our memory simulator turns out to outperform state-of-theart memory models of NVIDIA architectures by a magnitude of 2.4 for the L1 cache and 1.3 for the L2 cache, in terms of accuracy. Additionally, we find our technique of fine grained memory profiling a useful tool for memory optimizations, which we successfully show in case of ray tracing and machine learning applications.	en_US
dc.publisher	The Eurographics Association and John Wiley & Sons Ltd.	en_US
dc.subject	CCS Concepts: Hardware → Simulation and emulation; Computing methodologies → Graphics processors; Theory of computation → Program analysis
dc.subject	Hardware → Simulation and emulation
dc.subject	Computing methodologies → Graphics processors
dc.subject	Theory of computation → Program analysis
dc.title	Fine-Grained Memory Profiling of GPGPU Kernels	en_US
dc.description.seriesinformation	Computer Graphics Forum
dc.description.sectionheaders	Fast Geometric Computation
dc.description.volume	41
dc.description.number	7
dc.identifier.doi	10.1111/cgf.14671
dc.identifier.pages	227-235
dc.identifier.pages	9 pages

Files in this item

Name:: v41i7pp227-235.pdf
Size:: 633.9Kb
Format:: PDF

View/Open

Name:: paper1060_supplemental_material.pdf
Size:: 143.3Kb
Format:: PDF

View/Open

Name:: cgf14671_v41i7pp227-235.pdf
Size:: 707.1Kb
Format:: PDF
Description:: ProjektDeal version

View/Open

This item appears in the following Collection(s)

41-Issue 7
Pacific Graphics 2022 - Symposium Proceedings

Show simple item record