:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Hariri, Mohsen, Luo, Alan, Chen, Weicong, Zhong, Shaochen, Zhang, Tianyi, Wang, Qifan, Hu, Xia, Han, Xiaotian, Chaudhary, Vipin
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2502.15075
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float (DFloat11)
by: Zhang, Tianyi, et al.
Published: (2025)

100-LongBench: Are de facto Long-Context Benchmarks Literally Evaluating Long-Context Ability?
by: Yang, Wang, et al.
Published: (2025)

$K^4$: Online Log Anomaly Detection Via Unsupervised Typicality Learning
by: Chen, Weicong, et al.
Published: (2025)

Scorio.jl: A Julia package for ranking stochastic responses
by: Hariri, Mohsen, et al.
Published: (2026)

Ranking Reasoning LLMs under Test-Time Scaling
by: Hariri, Mohsen, et al.
Published: (2026)

Don't Pass@k: A Bayesian Framework for Large Language Model Evaluation
by: Hariri, Mohsen, et al.
Published: (2025)

Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?
by: Wang, Shouren, et al.
Published: (2025)

Reliability-Gated Source Anchoring for Continual Test-Time Adaptation
by: Singh, Vikash, et al.
Published: (2026)

SELF: Self-Extend the Context Length With Logistic Growth Function
by: Dang, Phat Thanh, et al.
Published: (2025)

Medical Image Spatial Grounding with Semantic Sampling
by: Yu, Andrew Seohwan, et al.
Published: (2026)

Thinking Preference Optimization
by: Yang, Wang, et al.
Published: (2025)

CausalGuard: Conformal Inference under Graph Uncertainty
by: Singh, Vikash, et al.
Published: (2026)

Robust Ultra-High-Dimensional Variable Selection With Correlated Structure Using Group Testing
by: Guo, Wanru, et al.
Published: (2026)

Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model
by: Liu, Zirui, et al.
Published: (2023)

Longer Context, Deeper Thinking: Uncovering the Role of Long-Context Ability in Reasoning
by: Yang, Wang, et al.
Published: (2025)

AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models
by: Luo, Feng, et al.
Published: (2025)

KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache
by: Liu, Zirui, et al.
Published: (2024)

Trust The Typical
by: Ganguly, Debargha, et al.
Published: (2026)

FFB: A Fair Fairness Benchmark for In-Processing Group Fairness Methods
by: Han, Xiaotian, et al.
Published: (2023)

Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion
by: Wang, Guanchu, et al.
Published: (2024)

Novel adaptation of video segmentation to 3D MRI: efficient zero-shot knee segmentation with SAM2
by: Yu, Andrew Seohwan, et al.
Published: (2024)

Less Noise, More Voice: Reinforcement Learning for Reasoning via Instruction Purification
by: Guo, Yiju, et al.
Published: (2026)

Speculative Thinking: Enhancing Small-Model Reasoning with Large Model Guidance at Inference Time
by: Yang, Wang, et al.
Published: (2025)

WKVQuant: Quantizing Weight and Key/Value Cache for Large Language Models Gains More
by: Yue, Yuxuan, et al.
Published: (2024)

Mid-Think: Training-Free Intermediate-Budget Reasoning via Token-Level Triggers
by: Yang, Wang, et al.
Published: (2026)

LoRATK: LoRA Once, Backdoor Everywhere in the Share-and-Play Ecosystem
by: Liu, Hongyi, et al.
Published: (2024)

When Less is More: The LLM Scaling Paradox in Context Compression
by: Guo, Ruishan, et al.
Published: (2026)

Visual Concept Networks: A Graph-Based Approach to Detecting Anomalous Data in Deep Neural Networks
by: Ganguly, Debargha, et al.
Published: (2024)

When Less is More: 8-bit Quantization Improves Continual Learning in Large Language Models
by: Zhang, Michael S., et al.
Published: (2025)

Less is More: Improving LLM Alignment via Preference Data Selection
by: Deng, Xun, et al.
Published: (2025)

Less is More: Parameter-Efficient Selection of Intermediate Tasks for Transfer Learning
by: Schulte, David, et al.
Published: (2024)

When Domains Interact: Asymmetric and Order-Sensitive Cross-Domain Effects in Reinforcement Learning for Reasoning
by: Yang, Wang, et al.
Published: (2026)

Forget Less, Generalize More: Unifying Temporal and Structural Adaptation for Dynamic Graphs
by: Chang, Qian, et al.
Published: (2026)

CausalRAG: Integrating Causal Graphs into Retrieval-Augmented Generation
by: Wang, Nengbo, et al.
Published: (2025)

Less Random, More Private: What is the Optimal Subsampling Scheme for DP-SGD?
by: Dong, Andy, et al.
Published: (2026)

Less or More From Teacher: Exploiting Trilateral Geometry For Knowledge Distillation
by: Hu, Chengming, et al.
Published: (2023)

KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches
by: Yuan, Jiayi, et al.
Published: (2024)

When Less is More: On the Value of "Co-training" for Semi-Supervised Software Defect Predictors
by: Majumder, Suvodeep, et al.
Published: (2022)

Draft Less, Retrieve More: Hybrid Tree Construction for Speculative Decoding
by: Shen, Yuhao, et al.
Published: (2026)

Less is More: Pseudo-Label Filtering for Continual Test-Time Adaptation
by: Tan, Jiayao, et al.
Published: (2024)