[ESL] Enhancing Matrix Multiplication with a Monolithic 3D Based Scrat…

SMRL 0 1,133 2020.05.11 15:48

Cong Thuan Do, Jeong Hwan Choi, Young Seo Lee, Cheol Hong Kim, and Sung Woo Chung, "Enhancing Matrix Multiplication with a Monolithic 3D Based Scratchpad Memory", IEEE Embedded Systems Letters, vol. 13, no. 2, pp. 57-60, June 2021.

Abstract

Convolutional neural networks (CNNs) are one of the most popular machine learning algorithms. The convolutional layers, which account for most execution time of CNNs, are implemented with matrix multiplication because the convolution operation performs dot products between filters and local regions of the input. On the other hand, GPUs with thousands of cores were proven to significantly accelerate matrix multiplication, compared to CPUs with a limited number of cores, especially for large matrices. However, the current memory architecture allows only one row access at a time so that multiple accesses are necessary to read the column data of the second matrix, thus slowing down matrix multiplication. In this study, we adopt the monolithic 3D integration for the GPU scratchpad memory, called M3D SPM, to enhance matrix multiplication. The M3D SPM allows one access to read the column data of the second matrix, similar to the case of the first matrix. The simulation results show that our M3D SPM improves the system performance by 46.3% for the 32×32 matrix multiplication, over the conventional 2D SPM where the column data of the second matrix are read sequentially.

Comments

로그인한 회원만 댓글 등록이 가능합니다.

번호	제목	글쓴이	날짜	조회
42	[TETC] Near-Memory Computing with Compressed Embedding Table…	SMRL	12.20	240
41	[DATE] Twin ECC: A Data Duplication Based ECC for Strong DRA…	SMRL	11.16	618
40	[DATE] Stealth ECC: A Data-Width Aware Adaptive ECC Scheme f…	SMRL	11.11	798
39	[ESL] IDRA: An In-storage Data Reorganization Accelerator fo…	SMRL	03.10	1004
38	[MICRO] On-demand Mobile CPU Cooling with Thin-Film Thermoel…	SMRL	02.22	970
37	[ESL] Quant-PIM: An Energy-efficient Processing-in-memory Ac…	SMRL	01.07	960
열람중	[ESL] Enhancing Matrix Multiplication with a Monolithic 3D B…	SMRL	05.11	1134
35	[TC] An Adaptive Thermal Management Framework for Heterogene…	SMRL	01.27	1157
34	[ICCD] A High-Performance Processing-in-Memory Accelerator f…	SMRL	09.23	1767
33	[TC] Signal Strength-aware Adaptive Offloading with Local Im…	SMRL	09.02	1412
32	[ISLPED] Exploring the Relation between Monolithic 3D L1 GPU…	SMRL	05.07	1914
31	[ISLPED] Temperature-aware Adaptive VM Allocation in Heterog…	SMRL	05.07	1714
30	[TETC] Quantifying the Impact of Monolithic 3D (M3D) Integra…	SMRL	01.26	1986
29	[TPDS] A Survey on Recent OS-level Energy Management Techniq…	SMRL	04.02	2941
28	[TC] Enhancing Energy Efficiency of Multimedia Applications …	SMRL	05.24	3759

Category

Publication Highlights

[ESL] Enhancing Matrix Multiplication with a Monolithic 3D Based Scrat…

Comments