:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Shao, Kunming, Zhao, Liang, Yu, Jiangnan, Liao, Zhipeng, Wang, Xiaomeng, Zou, Yi, Cheng, Tim Kwang-Ting, Tsui, Chi-Ying
Format:	Preprint
Veröffentlicht:	2026
Schlagworte:	Hardware Architecture Machine Learning
Online-Zugang:	https://arxiv.org/abs/2601.06724
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Balancing FP8 Computation Accuracy and Efficiency on Digital CIM via Shift-Aware On-the-fly Aligned-Mantissa Bitwidth Prediction
von: Zhao, Liang, et al.
Veröffentlicht: (2026)

DIRC-RAG: Accelerating Edge RAG with Robust High-Density and High-Loading-Bandwidth Digital In-ReRAM Computation
von: Shao, Kunming, et al.
Veröffentlicht: (2025)

A Memory-Efficient Retrieval Architecture for RAG-Enabled Wearable Medical LLMs-Agents
von: Liao, Zhipeng, et al.
Veröffentlicht: (2025)

A Flexible Precision Scaling Deep Neural Network Accelerator with Efficient Weight Combination
von: Zhao, Liang, et al.
Veröffentlicht: (2025)

SynDCIM: A Performance-Aware Digital Computing-in-Memory Compiler with Multi-Spec-Oriented Subcircuit Synthesis
von: Shao, Kunming, et al.
Veröffentlicht: (2024)

Be CIM or Be Memory: A Dual-mode-aware DNN Compiler for CIM Accelerators
von: Zhao, Shixin, et al.
Veröffentlicht: (2025)

RCW-CIM: A Digital CIM-based LLM Accelerator with Read-Compute/Write
von: Guo, Yan-Cheng, et al.
Veröffentlicht: (2026)

3DGauCIM: Accelerating Static/Dynamic 3D Gaussian Splatting via Digital CIM for High Frame Rate Real-Time Edge Rendering
von: Huang, Wei-Hsing, et al.
Veröffentlicht: (2025)

EdgeCIM: A Hardware-Software Co-Design for CIM-Based Acceleration of Small Language Models
von: Bazzi, Jinane, et al.
Veröffentlicht: (2026)

AccelCIM: Systematic Dataflow Exploration for SRAM Compute-in-Memory Accelerator
von: Xue, Chenhao, et al.
Veröffentlicht: (2026)

Unicorn-CIM: Uncovering the Vulnerability and Improving the Resilience of High-Precision Compute-in-Memory
von: Li, Qiufeng, et al.
Veröffentlicht: (2025)

FusionCIM: Accelerating LLM Inference with Fusion-Driven Computing-in-Memory Architecture
von: Xuan, Zihao, et al.
Veröffentlicht: (2026)

Voxel-CIM: An Efficient Compute-in-Memory Accelerator for Voxel-based Point Cloud Neural Networks
von: Lin, Xipeng, et al.
Veröffentlicht: (2024)

CIMR-V: An End-to-End SRAM-based CIM Accelerator with RISC-V for AI Edge Device
von: and, Yan-Cheng Guo, et al.
Veröffentlicht: (2025)

CIMFlow: An Integrated Framework for Systematic Design and Evaluation of Digital CIM Architectures
von: Qi, Yingjie, et al.
Veröffentlicht: (2025)

Acore-CIM: build accurate and reliable mixed-signal CIM cores with RISC-V controlled self-calibration
von: Numan, Omar, et al.
Veröffentlicht: (2025)

CIM-Tuner: Balancing the Compute and Storage Capacity of SRAM-CIM Accelerator via Hardware-mapping Co-exploration
von: Chen, Jinwu, et al.
Veröffentlicht: (2026)

SEGA-DCIM: Design Space Exploration-Guided Automatic Digital CIM Compiler with Multiple Precision Support
von: Diao, Haikang, et al.
Veröffentlicht: (2025)

GEM3D CIM General Purpose Matrix Computation Using 3D Integrated SRAM eDRAM Hybrid Compute In Memory on Memory Architecture
von: Chakraborty, Subhradip, et al.
Veröffentlicht: (2026)

CLSA-CIM: A Cross-Layer Scheduling Approach for Computing-in-Memory Architectures
von: Pelke, Rebecca, et al.
Veröffentlicht: (2024)

StreamDCIM: A Tile-based Streaming Digital CIM Accelerator with Mixed-stationary Cross-forwarding Dataflow for Multimodal Transformer
von: Qin, Shantian, et al.
Veröffentlicht: (2025)

ASDR: Exploiting Adaptive Sampling and Data Reuse for CIM-based Instant Neural Rendering
von: Liu, Fangxin, et al.
Veröffentlicht: (2025)

CIM-MLC: A Multi-level Compilation Stack for Computing-In-Memory Accelerators
von: Qu, Songyun, et al.
Veröffentlicht: (2024)

From Quarter to All: Accelerating Speculative LLM Decoding via Floating-Point Exponent Remapping and Parameter Sharing
von: Zhao, Yushu, et al.
Veröffentlicht: (2025)

High-Level Surface Code Decoding via Parallel FFNNs on CIM Platforms
von: Wang, Hao, et al.
Veröffentlicht: (2024)

CIMinus: Empowering Sparse DNN Workloads Modeling and Exploration on SRAM-based CIM Architectures
von: Qi, Yingjie, et al.
Veröffentlicht: (2025)

CIMple: Standard-cell SRAM-based CIM with LUT-based split softmax for attention acceleration
von: Ahn, Bas, et al.
Veröffentlicht: (2026)

Ouroboros: Wafer-Scale SRAM CIM with Token-Grained Pipelining for Large Language Model Inference
von: Liu, Yiqi, et al.
Veröffentlicht: (2026)

Dynamic neural network with memristive CIM and CAM for 2D and 3D vision
von: Zhang, Yue, et al.
Veröffentlicht: (2024)

A Reconfigurable Computing In-Memory Macro with Charge-sharing-based Weighted Accumulator
von: Yang, Junyi, et al.
Veröffentlicht: (2026)

MGS: Markov Greedy Sums for Accurate Low-Bitwidth Floating-Point Accumulation
von: Natesh, Vikas, et al.
Veröffentlicht: (2025)

A 28nm 1.80Mb/mm2 Digital/Analog Hybrid SRAM-CIM Macro Using 2D-Weighted Capacitor Array for Complex Number Mac Operations
von: Konno, Shota, et al.
Veröffentlicht: (2025)

31.1 A 14.08-to-135.69Token/s ReRAM-on-Logic Stacked Outlier-Free Large-Language-Model Accelerator with Block-Clustered Weight-Compression and Adaptive Parallel-Speculative-Decoding
von: Dong, Pingcheng, et al.
Veröffentlicht: (2026)

Computing-In-Memory Aware Model Adaption For Edge Devices
von: Lin, Ming-Han, et al.
Veröffentlicht: (2025)

Flexible Bit-Truncation Memory for Approximate Applications on the Edge
von: Oswald, William, et al.
Veröffentlicht: (2025)

CiMLoop: A Flexible, Accurate, and Fast Compute-In-Memory Modeling Tool
von: Andrulis, Tanner, et al.
Veröffentlicht: (2024)

H3DFact: Heterogeneous 3D Integrated CIM for Factorization with Holographic Perceptual Representations
von: Wan, Zishen, et al.
Veröffentlicht: (2024)

NASiC: 3D NAND-based CAM-Selected Multibit CIM Architecture for Efficient On-Device Mixture-of-Experts LLM Inference
von: Xu, Weikai, et al.
Veröffentlicht: (2026)

UniCAIM: A Unified CAM/CIM Architecture with Static-Dynamic KV Cache Pruning for Efficient Long-Context LLM Inference
von: Xu, Weikai, et al.
Veröffentlicht: (2025)

A$^3$PIM: An Automated, Analytic and Accurate Processing-in-Memory Offloader
von: Jiang, Qingcai, et al.
Veröffentlicht: (2024)