:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Umer, Muhammad, Mohsin, Muhammad Ahmed, Bilal, Ahsan, Chaudhry, Arslan, Haupt, Andreas, Koyejo, Sanmi, Fox, Emily, Cioffi, John M.
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Computation and Language
Online Access:	https://arxiv.org/abs/2605.18721
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Continuous-Utility Direct Preference Optimization
by: Mohsin, Muhammad Ahmed, et al.
Published: (2026)

Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey
by: Bilal, Ahsan, et al.
Published: (2025)

Epistemic Uncertainty for Test-Time Discovery
by: Riaz, Kainat, et al.
Published: (2026)

Neural Gaussian Radio Fields for Channel Estimation
by: Umer, Muhammad, et al.
Published: (2025)

Canonical Optimization for MIMO MAC Design
by: Umer, Muhammad, et al.
Published: (2026)

Pressure, What Pressure? Sycophancy Disentanglement in Language Models via Reward Decomposition
by: Mohsin, Muhammad Ahmed, et al.
Published: (2026)

Scalable Ensembling For Mitigating Reward Overoptimisation
by: Ahmed, Ahmed M., et al.
Published: (2024)

Continual Learning for Wireless Channel Prediction
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)

Discovering Implicit Large Language Model Alignment Objectives
by: Chen, Edward, et al.
Published: (2026)

Extracting books from production language models
by: Ahmed, Ahmed, et al.
Published: (2026)

Hierarchical Deep Reinforcement Learning for Adaptive Resource Management in Integrated Terrestrial and Non-Terrestrial Networks
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)

What If We Allocate Test-Time Compute Adaptively?
by: Bilal, Ahsan, et al.
Published: (2026)

Reasoning Models Don't Just Think Longer, They Move Differently
by: Gjølbye, Anders, et al.
Published: (2026)

$S^3$: Stratified Scaling Search for Test-Time in Diffusion Language Models
by: Bilal, Ahsan, et al.
Published: (2026)

6G Twin: Hybrid Gaussian Radio Fields for Channel Estimation and Non-Linear Precoder Design for Radio Access Networks
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)

Channel Prediction under Network Distribution Shift Using Continual Learning-based Loss Regularization
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)

Welfare, Improvability, and Variance: A Principal-Agent Approach to Optimal Benchmark Item Aggregation
by: Haupt, Andreas, et al.
Published: (2026)

Meta-Reinforcement Learning for Fast and Data-Efficient Spectrum Allocation in Dynamic Wireless Networks
by: Giwa, Oluwaseyi, et al.
Published: (2025)

Transformer-Based Sparse CSI Estimation for Non-Stationary Channels
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)

Don't Walk the Line: Boundary Guidance for Filtered Generation
by: Ball, Sarah, et al.
Published: (2025)

Finetuning Language Models to Emit Linguistic Expressions of Uncertainty
by: Chaudhry, Arslan, et al.
Published: (2024)

Why Do Safety Guardrails Degrade Across Languages?
by: Zhang, Max, et al.
Published: (2026)

Logits are All We Need to Adapt Closed Models
by: Hiranandani, Gaurush, et al.
Published: (2025)

Latent Adversarial Regularization for Offline Preference Optimization
by: Jiang, Enyi, et al.
Published: (2026)

Conditional Prior-based Non-stationary Channel Estimation Using Accelerated Diffusion Models
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)

Quantifying the Effect of Test Set Contamination on Generative Evaluations
by: Schaeffer, Rylan, et al.
Published: (2026)

Scaling Laws for Downstream Task Performance of Large Language Models
by: Isik, Berivan, et al.
Published: (2024)

Is Pre-training Truly Better Than Meta-Learning?
by: Miranda, Brando, et al.
Published: (2023)

Retrieval Augmented Generation with Multi-Modal LLM Framework for Wireless Environments
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)

The Collapse of Heterogeneity in Silicon Philosophers
by: Shi, Yuanming, et al.
Published: (2026)

Reliable and Efficient Amortized Model-based Evaluation
by: Truong, Sang, et al.
Published: (2025)

Position: Machine Learning Conferences Should Establish a "Refutations and Critiques" Track
by: Schaeffer, Rylan, et al.
Published: (2025)

Structured Prompts Improve Evaluation of Language Models
by: Aali, Asad, et al.
Published: (2025)

From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?
by: Zhou, Zhanke, et al.
Published: (2025)

Single layer tiny Co$^4$ outpaces GPT-2 and GPT-BERT
by: Zain, Noor Ul, et al.
Published: (2025)

Lean-ing on Quality: How High-Quality Data Beats Diverse Multilingual Data in AutoFormalization
by: Chan, Willy, et al.
Published: (2025)

Task and Perception-aware Distributed Source Coding for Correlated Speech under Bandwidth-constrained Channels
by: Bhattacharya, Sagnik, et al.
Published: (2025)

On the Fundamental Limits of LLMs at Scale
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)

Quantifying the Importance of Data Alignment in Downstream Model Performance
by: Chawla, Krrish, et al.
Published: (2025)

ZIP-FIT: Embedding-Free Data Selection via Compression-Based Alignment
by: Obbad, Elyas, et al.
Published: (2024)