Saved in:
| Main Authors: | Park, Jiyoung, Jang, Hankyu, Song, Changseok, Jung, Wookeun |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.05145 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
WebLLM: A High-Performance In-Browser LLM Inference Engine
by: Ruan, Charlie F., et al.
Published: (2024)
by: Ruan, Charlie F., et al.
Published: (2024)
Speculative Decoding with CTC-based Draft Model for LLM Inference Acceleration
by: Wen, Zhuofan, et al.
Published: (2024)
by: Wen, Zhuofan, et al.
Published: (2024)
TIDE: Every Layer Knows the Token Beneath the Context
by: Jaiswal, Ajay, et al.
Published: (2026)
by: Jaiswal, Ajay, et al.
Published: (2026)
SpoT-Mamba: Learning Long-Range Dependency on Spatio-Temporal Graphs with Selective State Spaces
by: Choi, Jinhyeok, et al.
Published: (2024)
by: Choi, Jinhyeok, et al.
Published: (2024)
Structural Reasoning Improves Molecular Understanding of LLM
by: Jang, Yunhui, et al.
Published: (2024)
by: Jang, Yunhui, et al.
Published: (2024)
Exploring and Improving Drafts in Blockwise Parallel Decoding
by: Kim, Taehyeon, et al.
Published: (2024)
by: Kim, Taehyeon, et al.
Published: (2024)
PRISM: Parametrically Refactoring Inference for Speculative Sampling Draft Models
by: Wang, Xuliang, et al.
Published: (2026)
by: Wang, Xuliang, et al.
Published: (2026)
TRINE: A Token-Aware, Runtime-Adaptive FPGA Inference Engine for Multimodal AI
by: Oh, Hyunwoo, et al.
Published: (2026)
by: Oh, Hyunwoo, et al.
Published: (2026)
Balancing Graph Embedding Smoothness in Self-Supervised Learning via Information-Theoretic Decomposition
by: Jung, Heesoo, et al.
Published: (2025)
by: Jung, Heesoo, et al.
Published: (2025)
P-EAGLE: Parallel-Drafting EAGLE with Scalable Training
by: Hui, Mude, et al.
Published: (2026)
by: Hui, Mude, et al.
Published: (2026)
Temporal Imbalance of Positive and Negative Supervision in Class-Incremental Learning
by: Ma, Jinge, et al.
Published: (2026)
by: Ma, Jinge, et al.
Published: (2026)
KITS: Inductive Spatio-Temporal Kriging with Increment Training Strategy
by: Xu, Qianxiong, et al.
Published: (2023)
by: Xu, Qianxiong, et al.
Published: (2023)
Temporal Alignment Guidance: On-Manifold Sampling in Diffusion Models
by: Park, Youngrok, et al.
Published: (2025)
by: Park, Youngrok, et al.
Published: (2025)
Multi-LLM Adaptive Conformal Inference for Reliable LLM Responses
by: Noh, Kangjun, et al.
Published: (2026)
by: Noh, Kangjun, et al.
Published: (2026)
FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving
by: Ye, Zihao, et al.
Published: (2025)
by: Ye, Zihao, et al.
Published: (2025)
Decocted Experience Improves Test-Time Inference in LLM Agents
by: Shen, Maohao, et al.
Published: (2026)
by: Shen, Maohao, et al.
Published: (2026)
Expanding Foundational Language Capabilities in Open-Source LLMs through a Korean Case Study
by: Lim, Junghwan, et al.
Published: (2025)
by: Lim, Junghwan, et al.
Published: (2025)
Time Series Imputation with Multivariate Radial Basis Function Neural Network
by: Jung, Chanyoung, et al.
Published: (2024)
by: Jung, Chanyoung, et al.
Published: (2024)
MicroFlow: An Efficient Rust-Based Inference Engine for TinyML
by: Carnelos, Matteo, et al.
Published: (2024)
by: Carnelos, Matteo, et al.
Published: (2024)
Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation
by: Lee, JoonHo, et al.
Published: (2024)
by: Lee, JoonHo, et al.
Published: (2024)
Fair Class-Incremental Learning using Sample Weighting
by: Park, Jaeyoung, et al.
Published: (2024)
by: Park, Jaeyoung, et al.
Published: (2024)
Improving LLM Reasoning through Scaling Inference Computation with Collaborative Verification
by: Liang, Zhenwen, et al.
Published: (2024)
by: Liang, Zhenwen, et al.
Published: (2024)
Experiential Reflective Learning for Self-Improving LLM Agents
by: Allard, Marc-Antoine, et al.
Published: (2026)
by: Allard, Marc-Antoine, et al.
Published: (2026)
Unifying Inductive, Cross-Domain, and Multimodal Learning for Robust and Generalizable Recommendation
by: Chung, Chanyoung, et al.
Published: (2025)
by: Chung, Chanyoung, et al.
Published: (2025)
Self-Supervised Pre-Training for Precipitation Post-Processor
by: An, Sojung, et al.
Published: (2023)
by: An, Sojung, et al.
Published: (2023)
CSAttention: Centroid-Scoring Attention for Accelerating LLM Inference
by: Song, Chuxu, et al.
Published: (2026)
by: Song, Chuxu, et al.
Published: (2026)
Evaluating Temporal Plasticity in Foundation Time Series Models for Incremental Fine-tuning
by: Liu, Jia, et al.
Published: (2025)
by: Liu, Jia, et al.
Published: (2025)
Enhancing Federated Class-Incremental Learning via Spatial-Temporal Statistics Aggregation
by: Guan, Zenghao, et al.
Published: (2025)
by: Guan, Zenghao, et al.
Published: (2025)
A Slices Perspective for Incremental Nonparametric Inference in High Dimensional State Spaces
by: Shienman, Moshe, et al.
Published: (2024)
by: Shienman, Moshe, et al.
Published: (2024)
DEER: Draft with Diffusion, Verify with Autoregressive Models
by: Cheng, Zicong, et al.
Published: (2025)
by: Cheng, Zicong, et al.
Published: (2025)
Can We Rely on LLM Agents to Draft Long-Horizon Plans? Let's Take TravelPlanner as an Example
by: Chen, Yanan, et al.
Published: (2024)
by: Chen, Yanan, et al.
Published: (2024)
Energy-Efficient Wireless LLM Inference via Uncertainty and Importance-Aware Speculative Decoding
by: Park, Jihoon, et al.
Published: (2025)
by: Park, Jihoon, et al.
Published: (2025)
Carbon Intensity-Aware Adaptive Inference of DNNs
by: Jung, Jiwan
Published: (2024)
by: Jung, Jiwan
Published: (2024)
Accelerating Large-Scale Reasoning Model Inference with Sparse Self-Speculative Decoding
by: Zhao, Yilong, et al.
Published: (2025)
by: Zhao, Yilong, et al.
Published: (2025)
TTKV: Temporal-Tiered KV Cache for Long-Context LLM Inference
by: Dzikanyanga, Gradwell, et al.
Published: (2026)
by: Dzikanyanga, Gradwell, et al.
Published: (2026)
Generative Representation Learning on Hyper-relational Knowledge Graphs via Masked Discrete Diffusion
by: Lee, Jaejun, et al.
Published: (2026)
by: Lee, Jaejun, et al.
Published: (2026)
PAC-Bayesian Generalization Bounds for Knowledge Graph Representation Learning
by: Lee, Jaejun, et al.
Published: (2024)
by: Lee, Jaejun, et al.
Published: (2024)
Representation Learning on Hyper-Relational and Numeric Knowledge Graphs with Transformers
by: Chung, Chanyoung, et al.
Published: (2023)
by: Chung, Chanyoung, et al.
Published: (2023)
Direct Alignment of Draft Model for Speculative Decoding with Chat-Fine-Tuned LLMs
by: Goel, Raghavv, et al.
Published: (2024)
by: Goel, Raghavv, et al.
Published: (2024)
Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs
by: Bian, Song, et al.
Published: (2025)
by: Bian, Song, et al.
Published: (2025)
Similar Items
-
WebLLM: A High-Performance In-Browser LLM Inference Engine
by: Ruan, Charlie F., et al.
Published: (2024) -
Speculative Decoding with CTC-based Draft Model for LLM Inference Acceleration
by: Wen, Zhuofan, et al.
Published: (2024) -
TIDE: Every Layer Knows the Token Beneath the Context
by: Jaiswal, Ajay, et al.
Published: (2026) -
SpoT-Mamba: Learning Long-Range Dependency on Spatio-Temporal Graphs with Selective State Spaces
by: Choi, Jinhyeok, et al.
Published: (2024) -
Structural Reasoning Improves Molecular Understanding of LLM
by: Jang, Yunhui, et al.
Published: (2024)