Guardado en:
| Autores principales: | Park, Jihoon, Choe, Jeongin, Kim, Dohyun, Kim, Jae-Joon |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2501.06780 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
CLAASIC: a Cortex-Inspired Hardware Accelerator
por: Puente, Valentin, et al.
Publicado: (2016)
por: Puente, Valentin, et al.
Publicado: (2016)
From GPUs to RRAMs: Distributed In-Memory Primal-Dual Hybrid Gradient Method for Solving Large-Scale Linear Optimization Problem
por: Vo, Huynh Q. N., et al.
Publicado: (2025)
por: Vo, Huynh Q. N., et al.
Publicado: (2025)
Open Challenges for a Production-ready Cloud Environment on top of RISC-V hardware
por: Call, Aaron, et al.
Publicado: (2025)
por: Call, Aaron, et al.
Publicado: (2025)
DFabric: Scaling Out Data Parallel Applications with CXL-Ethernet Hybrid Interconnects
por: Zhang, Xu, et al.
Publicado: (2024)
por: Zhang, Xu, et al.
Publicado: (2024)
Managed-Retention Memory: A New Class of Memory for the AI Era
por: Legtchenko, Sergey, et al.
Publicado: (2025)
por: Legtchenko, Sergey, et al.
Publicado: (2025)
Efficient Optimization Accelerator Framework for Multistate Ising Problems
por: Garg, Chirag, et al.
Publicado: (2025)
por: Garg, Chirag, et al.
Publicado: (2025)
Architecting Distributed Quantum Computers: Design Insights from Resource Estimation
por: Filippov, Dmitry, et al.
Publicado: (2025)
por: Filippov, Dmitry, et al.
Publicado: (2025)
Wattlytics: A Web Platform for Co-Optimizing Performance, Energy, and TCO in HPC Clusters
por: Afzal, Ayesha, et al.
Publicado: (2026)
por: Afzal, Ayesha, et al.
Publicado: (2026)
TreeVQA: A Tree-Structured Execution Framework for Shot Reduction in Variational Quantum Algorithms
por: Hou, Yuewen, et al.
Publicado: (2025)
por: Hou, Yuewen, et al.
Publicado: (2025)
Harnessing the Full Potential of RRAMs through Scalable and Distributed In-Memory Computing with Integrated Error Correction
por: Vo, Huynh Q. N., et al.
Publicado: (2025)
por: Vo, Huynh Q. N., et al.
Publicado: (2025)
ForgetMeNot: Understanding and Modeling the Impact of Forever Chemicals Toward Sustainable Large-Scale Computing
por: Roy, Rohan Basu, et al.
Publicado: (2025)
por: Roy, Rohan Basu, et al.
Publicado: (2025)
Carbon Connect: An Ecosystem for Sustainable Computing
por: Lee, Benjamin C., et al.
Publicado: (2024)
por: Lee, Benjamin C., et al.
Publicado: (2024)
Reference Architecture of a Quantum-Centric Supercomputer
por: Seelam, Seetharami, et al.
Publicado: (2026)
por: Seelam, Seetharami, et al.
Publicado: (2026)
PIM-AI: A Novel Architecture for High-Efficiency LLM Inference
por: Ortega, Cristobal, et al.
Publicado: (2024)
por: Ortega, Cristobal, et al.
Publicado: (2024)
Scaling Intelligence: Designing Data Centers for Next-Gen Language Models
por: Tithi, Jesmin Jahan, et al.
Publicado: (2025)
por: Tithi, Jesmin Jahan, et al.
Publicado: (2025)
Transforming the Hybrid Cloud for Emerging AI Workloads
por: Chen, Deming, et al.
Publicado: (2024)
por: Chen, Deming, et al.
Publicado: (2024)
WaferLLM: Large Language Model Inference at Wafer Scale
por: He, Congjie, et al.
Publicado: (2025)
por: He, Congjie, et al.
Publicado: (2025)
PASS: An Asynchronous Probabilistic Processor for Next Generation Intelligence
por: Patel, Saavan, et al.
Publicado: (2024)
por: Patel, Saavan, et al.
Publicado: (2024)
Experience Deploying Containerized GenAI Services at an HPC Center
por: Beltre, Angel M., et al.
Publicado: (2025)
por: Beltre, Angel M., et al.
Publicado: (2025)
Highly Versatile FPGA-Implemented Cyber Coherent Ising Machine
por: Aonishi, Toru, et al.
Publicado: (2024)
por: Aonishi, Toru, et al.
Publicado: (2024)
PID-Comm: A Fast and Flexible Collective Communication Framework for Commodity Processing-in-DIMM Devices
por: Noh, Si Ung, et al.
Publicado: (2024)
por: Noh, Si Ung, et al.
Publicado: (2024)
A System Level Compiler for Massively-Parallel, Spatial, Dataflow Architectures
por: Van Essendelft, Dirk, et al.
Publicado: (2025)
por: Van Essendelft, Dirk, et al.
Publicado: (2025)
Flex-PE: Flexible and SIMD Multi-Precision Processing Element for AI Workloads
por: Lokhande, Mukul, et al.
Publicado: (2024)
por: Lokhande, Mukul, et al.
Publicado: (2024)
Accelerating Triangle Counting with Real Processing-in-Memory Systems
por: Asquini, Lorenzo, et al.
Publicado: (2025)
por: Asquini, Lorenzo, et al.
Publicado: (2025)
Balanced Data Placement for GEMV Acceleration with Processing-In-Memory
por: Ibrahim, Mohamed Assem, et al.
Publicado: (2024)
por: Ibrahim, Mohamed Assem, et al.
Publicado: (2024)
DiP: A Scalable, Energy-Efficient Systolic Array for Matrix Multiplication Acceleration
por: Abdelmaksoud, Ahmed J., et al.
Publicado: (2024)
por: Abdelmaksoud, Ahmed J., et al.
Publicado: (2024)
NMP-PaK: Near-Memory Processing Acceleration of Scalable De Novo Genome Assembly
por: Kim, Heewoo, et al.
Publicado: (2025)
por: Kim, Heewoo, et al.
Publicado: (2025)
Optimal Multi-Constrained Workflow Scheduling for Cyber-Physical Systems in the Edge-Cloud Continuum
por: Kouloumpris, Andreas, et al.
Publicado: (2025)
por: Kouloumpris, Andreas, et al.
Publicado: (2025)
CMDS: Cross-layer Dataflow Optimization for DNN Accelerators Exploiting Multi-bank Memories
por: Shi, Man, et al.
Publicado: (2024)
por: Shi, Man, et al.
Publicado: (2024)
Atomique: A Quantum Compiler for Reconfigurable Neutral Atom Arrays
por: Wang, Hanrui, et al.
Publicado: (2023)
por: Wang, Hanrui, et al.
Publicado: (2023)
RevaMp3D: Architecting the Processor Core and Cache Hierarchy for Systems with Monolithically-Integrated Logic and Memory
por: Ghiasi, Nika Mansouri, et al.
Publicado: (2022)
por: Ghiasi, Nika Mansouri, et al.
Publicado: (2022)
Workload-Aware Hardware Accelerator Mining for Distributed Deep Learning Training
por: Adnan, Muhammad, et al.
Publicado: (2024)
por: Adnan, Muhammad, et al.
Publicado: (2024)
KiSS: A Novel Container Size-Aware Memory Management Policy for Serverless in Edge-Cloud Continuum
por: Gupta, Sabyasachi, et al.
Publicado: (2025)
por: Gupta, Sabyasachi, et al.
Publicado: (2025)
DeepStack: Scalable and Accurate Design Space Exploration for Distributed 3D-Stacked AI Accelerators
por: Mo, Zhiwen, et al.
Publicado: (2026)
por: Mo, Zhiwen, et al.
Publicado: (2026)
DP-HLS: A High-Level Synthesis Framework for Accelerating Dynamic Programming Algorithms in Bioinformatics
por: Cao, Yingqi, et al.
Publicado: (2024)
por: Cao, Yingqi, et al.
Publicado: (2024)
Memory-Centric Computing: Solving Computing's Memory Problem
por: Mutlu, Onur, et al.
Publicado: (2025)
por: Mutlu, Onur, et al.
Publicado: (2025)
cMPI: Using CXL Memory Sharing for MPI One-Sided and Two-Sided Inter-Node Communications
por: Wang, Xi, et al.
Publicado: (2025)
por: Wang, Xi, et al.
Publicado: (2025)
ZipFlow: a Compiler-based Framework to Unleash Compressed Data Movement for Modern GPUs
por: Yeo, Gwangoo, et al.
Publicado: (2026)
por: Yeo, Gwangoo, et al.
Publicado: (2026)
Efficient and Scalable Architecture for Multiple-chip Implementation of Simulated Bifurcation Machines
por: Kashimata, Tomoya, et al.
Publicado: (2023)
por: Kashimata, Tomoya, et al.
Publicado: (2023)
Analyzing a Two-Tier Disaggregated Memory Protection Scheme Based on Memory Replication
por: Volos, Haris, et al.
Publicado: (2025)
por: Volos, Haris, et al.
Publicado: (2025)
Ejemplares similares
-
CLAASIC: a Cortex-Inspired Hardware Accelerator
por: Puente, Valentin, et al.
Publicado: (2016) -
From GPUs to RRAMs: Distributed In-Memory Primal-Dual Hybrid Gradient Method for Solving Large-Scale Linear Optimization Problem
por: Vo, Huynh Q. N., et al.
Publicado: (2025) -
Open Challenges for a Production-ready Cloud Environment on top of RISC-V hardware
por: Call, Aaron, et al.
Publicado: (2025) -
DFabric: Scaling Out Data Parallel Applications with CXL-Ethernet Hybrid Interconnects
por: Zhang, Xu, et al.
Publicado: (2024) -
Managed-Retention Memory: A New Class of Memory for the AI Era
por: Legtchenko, Sergey, et al.
Publicado: (2025)