:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Park, Jihoon, Choe, Jeongin, Kim, Dohyun, Kim, Jae-Joon
Formato:	Preprint
Publicado:	2025
Materias:	Hardware Architecture Distributed, Parallel, and Cluster Computing Emerging Technologies Machine Learning Programming Languages
Acceso en línea:	https://arxiv.org/abs/2501.06780
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

CLAASIC: a Cortex-Inspired Hardware Accelerator
por: Puente, Valentin, et al.
Publicado: (2016)

From GPUs to RRAMs: Distributed In-Memory Primal-Dual Hybrid Gradient Method for Solving Large-Scale Linear Optimization Problem
por: Vo, Huynh Q. N., et al.
Publicado: (2025)

Open Challenges for a Production-ready Cloud Environment on top of RISC-V hardware
por: Call, Aaron, et al.
Publicado: (2025)

DFabric: Scaling Out Data Parallel Applications with CXL-Ethernet Hybrid Interconnects
por: Zhang, Xu, et al.
Publicado: (2024)

Managed-Retention Memory: A New Class of Memory for the AI Era
por: Legtchenko, Sergey, et al.
Publicado: (2025)

Efficient Optimization Accelerator Framework for Multistate Ising Problems
por: Garg, Chirag, et al.
Publicado: (2025)

Architecting Distributed Quantum Computers: Design Insights from Resource Estimation
por: Filippov, Dmitry, et al.
Publicado: (2025)

Wattlytics: A Web Platform for Co-Optimizing Performance, Energy, and TCO in HPC Clusters
por: Afzal, Ayesha, et al.
Publicado: (2026)

TreeVQA: A Tree-Structured Execution Framework for Shot Reduction in Variational Quantum Algorithms
por: Hou, Yuewen, et al.
Publicado: (2025)

Harnessing the Full Potential of RRAMs through Scalable and Distributed In-Memory Computing with Integrated Error Correction
por: Vo, Huynh Q. N., et al.
Publicado: (2025)

ForgetMeNot: Understanding and Modeling the Impact of Forever Chemicals Toward Sustainable Large-Scale Computing
por: Roy, Rohan Basu, et al.
Publicado: (2025)

Carbon Connect: An Ecosystem for Sustainable Computing
por: Lee, Benjamin C., et al.
Publicado: (2024)

Reference Architecture of a Quantum-Centric Supercomputer
por: Seelam, Seetharami, et al.
Publicado: (2026)

PIM-AI: A Novel Architecture for High-Efficiency LLM Inference
por: Ortega, Cristobal, et al.
Publicado: (2024)

Scaling Intelligence: Designing Data Centers for Next-Gen Language Models
por: Tithi, Jesmin Jahan, et al.
Publicado: (2025)

Transforming the Hybrid Cloud for Emerging AI Workloads
por: Chen, Deming, et al.
Publicado: (2024)

WaferLLM: Large Language Model Inference at Wafer Scale
por: He, Congjie, et al.
Publicado: (2025)

PASS: An Asynchronous Probabilistic Processor for Next Generation Intelligence
por: Patel, Saavan, et al.
Publicado: (2024)

Experience Deploying Containerized GenAI Services at an HPC Center
por: Beltre, Angel M., et al.
Publicado: (2025)

Highly Versatile FPGA-Implemented Cyber Coherent Ising Machine
por: Aonishi, Toru, et al.
Publicado: (2024)

PID-Comm: A Fast and Flexible Collective Communication Framework for Commodity Processing-in-DIMM Devices
por: Noh, Si Ung, et al.
Publicado: (2024)

A System Level Compiler for Massively-Parallel, Spatial, Dataflow Architectures
por: Van Essendelft, Dirk, et al.
Publicado: (2025)

Flex-PE: Flexible and SIMD Multi-Precision Processing Element for AI Workloads
por: Lokhande, Mukul, et al.
Publicado: (2024)

Accelerating Triangle Counting with Real Processing-in-Memory Systems
por: Asquini, Lorenzo, et al.
Publicado: (2025)

Balanced Data Placement for GEMV Acceleration with Processing-In-Memory
por: Ibrahim, Mohamed Assem, et al.
Publicado: (2024)

DiP: A Scalable, Energy-Efficient Systolic Array for Matrix Multiplication Acceleration
por: Abdelmaksoud, Ahmed J., et al.
Publicado: (2024)

NMP-PaK: Near-Memory Processing Acceleration of Scalable De Novo Genome Assembly
por: Kim, Heewoo, et al.
Publicado: (2025)

Optimal Multi-Constrained Workflow Scheduling for Cyber-Physical Systems in the Edge-Cloud Continuum
por: Kouloumpris, Andreas, et al.
Publicado: (2025)

CMDS: Cross-layer Dataflow Optimization for DNN Accelerators Exploiting Multi-bank Memories
por: Shi, Man, et al.
Publicado: (2024)

Atomique: A Quantum Compiler for Reconfigurable Neutral Atom Arrays
por: Wang, Hanrui, et al.
Publicado: (2023)

RevaMp3D: Architecting the Processor Core and Cache Hierarchy for Systems with Monolithically-Integrated Logic and Memory
por: Ghiasi, Nika Mansouri, et al.
Publicado: (2022)

Workload-Aware Hardware Accelerator Mining for Distributed Deep Learning Training
por: Adnan, Muhammad, et al.
Publicado: (2024)

KiSS: A Novel Container Size-Aware Memory Management Policy for Serverless in Edge-Cloud Continuum
por: Gupta, Sabyasachi, et al.
Publicado: (2025)

DeepStack: Scalable and Accurate Design Space Exploration for Distributed 3D-Stacked AI Accelerators
por: Mo, Zhiwen, et al.
Publicado: (2026)

DP-HLS: A High-Level Synthesis Framework for Accelerating Dynamic Programming Algorithms in Bioinformatics
por: Cao, Yingqi, et al.
Publicado: (2024)

Memory-Centric Computing: Solving Computing's Memory Problem
por: Mutlu, Onur, et al.
Publicado: (2025)

cMPI: Using CXL Memory Sharing for MPI One-Sided and Two-Sided Inter-Node Communications
por: Wang, Xi, et al.
Publicado: (2025)

ZipFlow: a Compiler-based Framework to Unleash Compressed Data Movement for Modern GPUs
por: Yeo, Gwangoo, et al.
Publicado: (2026)

Efficient and Scalable Architecture for Multiple-chip Implementation of Simulated Bifurcation Machines
por: Kashimata, Tomoya, et al.
Publicado: (2023)

Analyzing a Two-Tier Disaggregated Memory Protection Scheme Based on Memory Replication
por: Volos, Haris, et al.
Publicado: (2025)