Cache analysis and software optimizations for faster on-chip network simulations

dc.contributor.authorParane, K.
dc.contributor.authorPrabhu, Prasad, B.M.
dc.contributor.authorTalawar, B.
dc.date.accessioned2020-03-30T10:02:36Z
dc.date.available2020-03-30T10:02:36Z
dc.date.issued2016
dc.description.abstractFast simulations are critical in reducing time to market in CMPs and SoCs. Several simulators have been used to evaluate the performance and power consumed by Network-on-Chips. Researchers and designers rely upon these simulators for design space exploration of NoC architectures. Our experiments show that simulating large NoC topologies take hours to several days for completion. To speedup the simulations, it is necessary to investigate and optimize the hotspots in simulator source code. Among several simulators available, we choose Booksim2.0, as it is being extensively used in the NoC community. In this paper, we analyze the cache and memory system behaviour of Booksim2.0 to accurately monitor input dependent performance bottlenecks. Our measurements show that cache and memory usage patterns vary widely based on the input parameters given to Booksim2.0. Based on these measurements, the cache configuration having least misses has been identified. We also employ thread parallelization and vectorization to improve the overall performance of Booksim2.0. The OpenMP programming model and SIMD are used for parallelizing and vectorizing the more time-consuming portions of Booksim2.0. Speedups of 2.93� and 3.97� were observed for the Mesh topology with 30 � 30 network size by employing thread parallelization and vectorization respectively. � 2016 IEEE.en_US
dc.identifier.citation11th International Conference on Industrial and Information Systems, ICIIS 2016 - Conference Proceedings, 2016, Vol.2018-January, , pp.83-88en_US
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/7657
dc.titleCache analysis and software optimizations for faster on-chip network simulationsen_US
dc.typeBook chapteren_US

Files