Saved in:
Bibliographic Details
Main Authors: Xu, Zhicheng, Liu, Che-Kai, Li, Chao, Mao, Ruibin, Yang, Jianyi, Kämpfe, Thomas, Imani, Mohsen, Li, Can, Zhuo, Cheng, Yin, Xunzhao
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2401.05708
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866929206678519808
author Xu, Zhicheng
Liu, Che-Kai
Li, Chao
Mao, Ruibin
Yang, Jianyi
Kämpfe, Thomas
Imani, Mohsen
Li, Can
Zhuo, Cheng
Yin, Xunzhao
author_facet Xu, Zhicheng
Liu, Che-Kai
Li, Chao
Mao, Ruibin
Yang, Jianyi
Kämpfe, Thomas
Imani, Mohsen
Li, Can
Zhuo, Cheng
Yin, Xunzhao
contents Rapid advancements in artificial intelligence have given rise to transformative models, profoundly impacting our lives. These models demand massive volumes of data to operate effectively, exacerbating the data-transfer bottleneck inherent in the conventional von-Neumann architecture. Compute-in-memory (CIM), a novel computing paradigm, tackles these issues by seamlessly embedding in-memory search functions, thereby obviating the need for data transfers. However, existing non-volatile memory (NVM)-based accelerators are application specific. During the similarity based associative search operation, they only support a single, specific distance metric, such as Hamming, Manhattan, or Euclidean distance in measuring the query against the stored data, calling for reconfigurable in-memory solutions adaptable to various applications. To overcome such a limitation, in this paper, we present FeReX, a reconfigurable associative memory (AM) that accommodates various distance metrics including Hamming, Manhattan, and Euclidean distances. Leveraging multi-bit ferroelectric field-effect transistors (FeFETs) as the proxy and a hardware-software co-design approach, we introduce a constrained satisfaction problem (CSP)-based method to automate AM search input voltage and stored voltage configurations for different distance based search functions. Device-circuit co-simulations first validate the effectiveness of the proposed FeReX methodology for reconfigurable search distance functions. Then, we benchmark FeReX in the context of k-nearest neighbor (KNN) and hyperdimensional computing (HDC), which highlights the robustness of FeReX and demonstrates up to 250x speedup and 10^4 energy savings compared with GPU.
format Preprint
id arxiv_https___arxiv_org_abs_2401_05708
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle FeReX: A Reconfigurable Design of Multi-bit Ferroelectric Compute-in-Memory for Nearest Neighbor Search
Xu, Zhicheng
Liu, Che-Kai
Li, Chao
Mao, Ruibin
Yang, Jianyi
Kämpfe, Thomas
Imani, Mohsen
Li, Can
Zhuo, Cheng
Yin, Xunzhao
Emerging Technologies
Rapid advancements in artificial intelligence have given rise to transformative models, profoundly impacting our lives. These models demand massive volumes of data to operate effectively, exacerbating the data-transfer bottleneck inherent in the conventional von-Neumann architecture. Compute-in-memory (CIM), a novel computing paradigm, tackles these issues by seamlessly embedding in-memory search functions, thereby obviating the need for data transfers. However, existing non-volatile memory (NVM)-based accelerators are application specific. During the similarity based associative search operation, they only support a single, specific distance metric, such as Hamming, Manhattan, or Euclidean distance in measuring the query against the stored data, calling for reconfigurable in-memory solutions adaptable to various applications. To overcome such a limitation, in this paper, we present FeReX, a reconfigurable associative memory (AM) that accommodates various distance metrics including Hamming, Manhattan, and Euclidean distances. Leveraging multi-bit ferroelectric field-effect transistors (FeFETs) as the proxy and a hardware-software co-design approach, we introduce a constrained satisfaction problem (CSP)-based method to automate AM search input voltage and stored voltage configurations for different distance based search functions. Device-circuit co-simulations first validate the effectiveness of the proposed FeReX methodology for reconfigurable search distance functions. Then, we benchmark FeReX in the context of k-nearest neighbor (KNN) and hyperdimensional computing (HDC), which highlights the robustness of FeReX and demonstrates up to 250x speedup and 10^4 energy savings compared with GPU.
title FeReX: A Reconfigurable Design of Multi-bit Ferroelectric Compute-in-Memory for Nearest Neighbor Search
topic Emerging Technologies
url https://arxiv.org/abs/2401.05708