Saved in:
| Main Authors: | Hariri, Mohsen, Luo, Alan, Chen, Weicong, Zhong, Shaochen, Zhang, Tianyi, Wang, Qifan, Hu, Xia, Han, Xiaotian, Chaudhary, Vipin |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.15075 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float (DFloat11)
by: Zhang, Tianyi, et al.
Published: (2025)
by: Zhang, Tianyi, et al.
Published: (2025)
100-LongBench: Are de facto Long-Context Benchmarks Literally Evaluating Long-Context Ability?
by: Yang, Wang, et al.
Published: (2025)
by: Yang, Wang, et al.
Published: (2025)
$K^4$: Online Log Anomaly Detection Via Unsupervised Typicality Learning
by: Chen, Weicong, et al.
Published: (2025)
by: Chen, Weicong, et al.
Published: (2025)
Scorio.jl: A Julia package for ranking stochastic responses
by: Hariri, Mohsen, et al.
Published: (2026)
by: Hariri, Mohsen, et al.
Published: (2026)
Ranking Reasoning LLMs under Test-Time Scaling
by: Hariri, Mohsen, et al.
Published: (2026)
by: Hariri, Mohsen, et al.
Published: (2026)
Don't Pass@k: A Bayesian Framework for Large Language Model Evaluation
by: Hariri, Mohsen, et al.
Published: (2025)
by: Hariri, Mohsen, et al.
Published: (2025)
Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?
by: Wang, Shouren, et al.
Published: (2025)
by: Wang, Shouren, et al.
Published: (2025)
Reliability-Gated Source Anchoring for Continual Test-Time Adaptation
by: Singh, Vikash, et al.
Published: (2026)
by: Singh, Vikash, et al.
Published: (2026)
SELF: Self-Extend the Context Length With Logistic Growth Function
by: Dang, Phat Thanh, et al.
Published: (2025)
by: Dang, Phat Thanh, et al.
Published: (2025)
Medical Image Spatial Grounding with Semantic Sampling
by: Yu, Andrew Seohwan, et al.
Published: (2026)
by: Yu, Andrew Seohwan, et al.
Published: (2026)
Thinking Preference Optimization
by: Yang, Wang, et al.
Published: (2025)
by: Yang, Wang, et al.
Published: (2025)
CausalGuard: Conformal Inference under Graph Uncertainty
by: Singh, Vikash, et al.
Published: (2026)
by: Singh, Vikash, et al.
Published: (2026)
Robust Ultra-High-Dimensional Variable Selection With Correlated Structure Using Group Testing
by: Guo, Wanru, et al.
Published: (2026)
by: Guo, Wanru, et al.
Published: (2026)
Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model
by: Liu, Zirui, et al.
Published: (2023)
by: Liu, Zirui, et al.
Published: (2023)
Longer Context, Deeper Thinking: Uncovering the Role of Long-Context Ability in Reasoning
by: Yang, Wang, et al.
Published: (2025)
by: Yang, Wang, et al.
Published: (2025)
AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models
by: Luo, Feng, et al.
Published: (2025)
by: Luo, Feng, et al.
Published: (2025)
KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache
by: Liu, Zirui, et al.
Published: (2024)
by: Liu, Zirui, et al.
Published: (2024)
Trust The Typical
by: Ganguly, Debargha, et al.
Published: (2026)
by: Ganguly, Debargha, et al.
Published: (2026)
FFB: A Fair Fairness Benchmark for In-Processing Group Fairness Methods
by: Han, Xiaotian, et al.
Published: (2023)
by: Han, Xiaotian, et al.
Published: (2023)
Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion
by: Wang, Guanchu, et al.
Published: (2024)
by: Wang, Guanchu, et al.
Published: (2024)
Novel adaptation of video segmentation to 3D MRI: efficient zero-shot knee segmentation with SAM2
by: Yu, Andrew Seohwan, et al.
Published: (2024)
by: Yu, Andrew Seohwan, et al.
Published: (2024)
Less Noise, More Voice: Reinforcement Learning for Reasoning via Instruction Purification
by: Guo, Yiju, et al.
Published: (2026)
by: Guo, Yiju, et al.
Published: (2026)
Speculative Thinking: Enhancing Small-Model Reasoning with Large Model Guidance at Inference Time
by: Yang, Wang, et al.
Published: (2025)
by: Yang, Wang, et al.
Published: (2025)
WKVQuant: Quantizing Weight and Key/Value Cache for Large Language Models Gains More
by: Yue, Yuxuan, et al.
Published: (2024)
by: Yue, Yuxuan, et al.
Published: (2024)
Mid-Think: Training-Free Intermediate-Budget Reasoning via Token-Level Triggers
by: Yang, Wang, et al.
Published: (2026)
by: Yang, Wang, et al.
Published: (2026)
LoRATK: LoRA Once, Backdoor Everywhere in the Share-and-Play Ecosystem
by: Liu, Hongyi, et al.
Published: (2024)
by: Liu, Hongyi, et al.
Published: (2024)
When Less is More: The LLM Scaling Paradox in Context Compression
by: Guo, Ruishan, et al.
Published: (2026)
by: Guo, Ruishan, et al.
Published: (2026)
Visual Concept Networks: A Graph-Based Approach to Detecting Anomalous Data in Deep Neural Networks
by: Ganguly, Debargha, et al.
Published: (2024)
by: Ganguly, Debargha, et al.
Published: (2024)
When Less is More: 8-bit Quantization Improves Continual Learning in Large Language Models
by: Zhang, Michael S., et al.
Published: (2025)
by: Zhang, Michael S., et al.
Published: (2025)
Less is More: Improving LLM Alignment via Preference Data Selection
by: Deng, Xun, et al.
Published: (2025)
by: Deng, Xun, et al.
Published: (2025)
Less is More: Parameter-Efficient Selection of Intermediate Tasks for Transfer Learning
by: Schulte, David, et al.
Published: (2024)
by: Schulte, David, et al.
Published: (2024)
When Domains Interact: Asymmetric and Order-Sensitive Cross-Domain Effects in Reinforcement Learning for Reasoning
by: Yang, Wang, et al.
Published: (2026)
by: Yang, Wang, et al.
Published: (2026)
Forget Less, Generalize More: Unifying Temporal and Structural Adaptation for Dynamic Graphs
by: Chang, Qian, et al.
Published: (2026)
by: Chang, Qian, et al.
Published: (2026)
CausalRAG: Integrating Causal Graphs into Retrieval-Augmented Generation
by: Wang, Nengbo, et al.
Published: (2025)
by: Wang, Nengbo, et al.
Published: (2025)
Less Random, More Private: What is the Optimal Subsampling Scheme for DP-SGD?
by: Dong, Andy, et al.
Published: (2026)
by: Dong, Andy, et al.
Published: (2026)
Less or More From Teacher: Exploiting Trilateral Geometry For Knowledge Distillation
by: Hu, Chengming, et al.
Published: (2023)
by: Hu, Chengming, et al.
Published: (2023)
KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches
by: Yuan, Jiayi, et al.
Published: (2024)
by: Yuan, Jiayi, et al.
Published: (2024)
When Less is More: On the Value of "Co-training" for Semi-Supervised Software Defect Predictors
by: Majumder, Suvodeep, et al.
Published: (2022)
by: Majumder, Suvodeep, et al.
Published: (2022)
Draft Less, Retrieve More: Hybrid Tree Construction for Speculative Decoding
by: Shen, Yuhao, et al.
Published: (2026)
by: Shen, Yuhao, et al.
Published: (2026)
Less is More: Pseudo-Label Filtering for Continual Test-Time Adaptation
by: Tan, Jiayao, et al.
Published: (2024)
by: Tan, Jiayao, et al.
Published: (2024)
Similar Items
-
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float (DFloat11)
by: Zhang, Tianyi, et al.
Published: (2025) -
100-LongBench: Are de facto Long-Context Benchmarks Literally Evaluating Long-Context Ability?
by: Yang, Wang, et al.
Published: (2025) -
$K^4$: Online Log Anomaly Detection Via Unsupervised Typicality Learning
by: Chen, Weicong, et al.
Published: (2025) -
Scorio.jl: A Julia package for ranking stochastic responses
by: Hariri, Mohsen, et al.
Published: (2026) -
Ranking Reasoning LLMs under Test-Time Scaling
by: Hariri, Mohsen, et al.
Published: (2026)