New sparse matrix storage format to improve the performance of total SPMV time

dc.contributor.authorBayyapu, B.
dc.contributor.authorRaghavendra, S.R.
dc.contributor.authorGuddeti, G.
dc.date.accessioned2026-02-05T09:35:05Z
dc.date.issued2012
dc.description.abstractGraphics Processing Units (GPUs) are massive data parallel processors. High performance comes only at the cost of identifying data parallelism in the applications while using data parallel processors like GPU. This is an easy effort for applications that have regular memory access and high computation intensity. GPUs are equally attractive for sparse matrix vector multiplications (SPMV for short) that have irregular memory access. SPMV is an important computation in most of the scientific and engineering applications and scaling the performance, bandwidth utilization and compute intensity (ratio of computation to the data access) of SPMV computation is a priority in both academia and industry. There are various data structures and access patterns proposed for sparse matrix representation on GPUs and optimizations and improvements on these data structures is a continuous effort. This paper proposes a new format for the sparse matrix representation that reduces the data organization time and the memory transfer time from CPU to GPU for the memory bound SPMV computation. The BLSI (Bit Level Single Indexing) sparse matrix representation is up to 204% faster than COO (Co-ordinate), 104% faster than CSR (Compressed Sparse Row) and 217% faster than HYB (Hybrid) formats in memory transfer time from CPU to GPU. The proposed sparse matrix format is implemented in CUDA-C on CUDA (Compute Unified Device Architecture) supported NVIDIA graphics cards. © 2012 SCPE.
dc.identifier.citationScalable Computing, 2012, 13, 2, pp. 159-171
dc.identifier.issn18951767
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/26890
dc.subjectCompute intensity
dc.subjectCUDA-C
dc.subjectData parallelism
dc.subjectGraphics Processing Unit
dc.subjectNvidia graphics
dc.subjectSparse matrices
dc.subjectTransfer time
dc.subjectComputer graphics
dc.subjectData structures
dc.subjectDigital storage
dc.subjectParallel architectures
dc.subjectParallel processing systems
dc.subjectProgram processors
dc.subjectPsychophysiology
dc.subjectComputer graphics equipment
dc.titleNew sparse matrix storage format to improve the performance of total SPMV time

Files

Collections