:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Park, Jiyoung, Jang, Hankyu, Song, Changseok, Jung, Wookeun
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.05145
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

WebLLM: A High-Performance In-Browser LLM Inference Engine
by: Ruan, Charlie F., et al.
Published: (2024)

Speculative Decoding with CTC-based Draft Model for LLM Inference Acceleration
by: Wen, Zhuofan, et al.
Published: (2024)

TIDE: Every Layer Knows the Token Beneath the Context
by: Jaiswal, Ajay, et al.
Published: (2026)

SpoT-Mamba: Learning Long-Range Dependency on Spatio-Temporal Graphs with Selective State Spaces
by: Choi, Jinhyeok, et al.
Published: (2024)

Structural Reasoning Improves Molecular Understanding of LLM
by: Jang, Yunhui, et al.
Published: (2024)

Exploring and Improving Drafts in Blockwise Parallel Decoding
by: Kim, Taehyeon, et al.
Published: (2024)

PRISM: Parametrically Refactoring Inference for Speculative Sampling Draft Models
by: Wang, Xuliang, et al.
Published: (2026)

TRINE: A Token-Aware, Runtime-Adaptive FPGA Inference Engine for Multimodal AI
by: Oh, Hyunwoo, et al.
Published: (2026)

Balancing Graph Embedding Smoothness in Self-Supervised Learning via Information-Theoretic Decomposition
by: Jung, Heesoo, et al.
Published: (2025)

P-EAGLE: Parallel-Drafting EAGLE with Scalable Training
by: Hui, Mude, et al.
Published: (2026)

Temporal Imbalance of Positive and Negative Supervision in Class-Incremental Learning
by: Ma, Jinge, et al.
Published: (2026)

KITS: Inductive Spatio-Temporal Kriging with Increment Training Strategy
by: Xu, Qianxiong, et al.
Published: (2023)

Temporal Alignment Guidance: On-Manifold Sampling in Diffusion Models
by: Park, Youngrok, et al.
Published: (2025)

Multi-LLM Adaptive Conformal Inference for Reliable LLM Responses
by: Noh, Kangjun, et al.
Published: (2026)

FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving
by: Ye, Zihao, et al.
Published: (2025)

Decocted Experience Improves Test-Time Inference in LLM Agents
by: Shen, Maohao, et al.
Published: (2026)

Expanding Foundational Language Capabilities in Open-Source LLMs through a Korean Case Study
by: Lim, Junghwan, et al.
Published: (2025)

Time Series Imputation with Multivariate Radial Basis Function Neural Network
by: Jung, Chanyoung, et al.
Published: (2024)

MicroFlow: An Efficient Rust-Based Inference Engine for TinyML
by: Carnelos, Matteo, et al.
Published: (2024)

Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation
by: Lee, JoonHo, et al.
Published: (2024)

Fair Class-Incremental Learning using Sample Weighting
by: Park, Jaeyoung, et al.
Published: (2024)

Improving LLM Reasoning through Scaling Inference Computation with Collaborative Verification
by: Liang, Zhenwen, et al.
Published: (2024)

Experiential Reflective Learning for Self-Improving LLM Agents
by: Allard, Marc-Antoine, et al.
Published: (2026)

Unifying Inductive, Cross-Domain, and Multimodal Learning for Robust and Generalizable Recommendation
by: Chung, Chanyoung, et al.
Published: (2025)

Self-Supervised Pre-Training for Precipitation Post-Processor
by: An, Sojung, et al.
Published: (2023)

CSAttention: Centroid-Scoring Attention for Accelerating LLM Inference
by: Song, Chuxu, et al.
Published: (2026)

Evaluating Temporal Plasticity in Foundation Time Series Models for Incremental Fine-tuning
by: Liu, Jia, et al.
Published: (2025)

Enhancing Federated Class-Incremental Learning via Spatial-Temporal Statistics Aggregation
by: Guan, Zenghao, et al.
Published: (2025)

A Slices Perspective for Incremental Nonparametric Inference in High Dimensional State Spaces
by: Shienman, Moshe, et al.
Published: (2024)

DEER: Draft with Diffusion, Verify with Autoregressive Models
by: Cheng, Zicong, et al.
Published: (2025)

Can We Rely on LLM Agents to Draft Long-Horizon Plans? Let's Take TravelPlanner as an Example
by: Chen, Yanan, et al.
Published: (2024)

Energy-Efficient Wireless LLM Inference via Uncertainty and Importance-Aware Speculative Decoding
by: Park, Jihoon, et al.
Published: (2025)

Carbon Intensity-Aware Adaptive Inference of DNNs
by: Jung, Jiwan
Published: (2024)

Accelerating Large-Scale Reasoning Model Inference with Sparse Self-Speculative Decoding
by: Zhao, Yilong, et al.
Published: (2025)

TTKV: Temporal-Tiered KV Cache for Long-Context LLM Inference
by: Dzikanyanga, Gradwell, et al.
Published: (2026)

Generative Representation Learning on Hyper-relational Knowledge Graphs via Masked Discrete Diffusion
by: Lee, Jaejun, et al.
Published: (2026)

PAC-Bayesian Generalization Bounds for Knowledge Graph Representation Learning
by: Lee, Jaejun, et al.
Published: (2024)

Representation Learning on Hyper-Relational and Numeric Knowledge Graphs with Transformers
by: Chung, Chanyoung, et al.
Published: (2023)

Direct Alignment of Draft Model for Speculative Decoding with Chat-Fine-Tuned LLMs
by: Goel, Raghavv, et al.
Published: (2024)

Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs
by: Bian, Song, et al.
Published: (2025)