Saved in:
| Main Authors: | Umer, Muhammad, Mohsin, Muhammad Ahmed, Bilal, Ahsan, Chaudhry, Arslan, Haupt, Andreas, Koyejo, Sanmi, Fox, Emily, Cioffi, John M. |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.18721 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Continuous-Utility Direct Preference Optimization
by: Mohsin, Muhammad Ahmed, et al.
Published: (2026)
by: Mohsin, Muhammad Ahmed, et al.
Published: (2026)
Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey
by: Bilal, Ahsan, et al.
Published: (2025)
by: Bilal, Ahsan, et al.
Published: (2025)
Epistemic Uncertainty for Test-Time Discovery
by: Riaz, Kainat, et al.
Published: (2026)
by: Riaz, Kainat, et al.
Published: (2026)
Neural Gaussian Radio Fields for Channel Estimation
by: Umer, Muhammad, et al.
Published: (2025)
by: Umer, Muhammad, et al.
Published: (2025)
Canonical Optimization for MIMO MAC Design
by: Umer, Muhammad, et al.
Published: (2026)
by: Umer, Muhammad, et al.
Published: (2026)
Pressure, What Pressure? Sycophancy Disentanglement in Language Models via Reward Decomposition
by: Mohsin, Muhammad Ahmed, et al.
Published: (2026)
by: Mohsin, Muhammad Ahmed, et al.
Published: (2026)
Scalable Ensembling For Mitigating Reward Overoptimisation
by: Ahmed, Ahmed M., et al.
Published: (2024)
by: Ahmed, Ahmed M., et al.
Published: (2024)
Continual Learning for Wireless Channel Prediction
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)
Discovering Implicit Large Language Model Alignment Objectives
by: Chen, Edward, et al.
Published: (2026)
by: Chen, Edward, et al.
Published: (2026)
Extracting books from production language models
by: Ahmed, Ahmed, et al.
Published: (2026)
by: Ahmed, Ahmed, et al.
Published: (2026)
Hierarchical Deep Reinforcement Learning for Adaptive Resource Management in Integrated Terrestrial and Non-Terrestrial Networks
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)
What If We Allocate Test-Time Compute Adaptively?
by: Bilal, Ahsan, et al.
Published: (2026)
by: Bilal, Ahsan, et al.
Published: (2026)
Reasoning Models Don't Just Think Longer, They Move Differently
by: Gjølbye, Anders, et al.
Published: (2026)
by: Gjølbye, Anders, et al.
Published: (2026)
$S^3$: Stratified Scaling Search for Test-Time in Diffusion Language Models
by: Bilal, Ahsan, et al.
Published: (2026)
by: Bilal, Ahsan, et al.
Published: (2026)
6G Twin: Hybrid Gaussian Radio Fields for Channel Estimation and Non-Linear Precoder Design for Radio Access Networks
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)
Channel Prediction under Network Distribution Shift Using Continual Learning-based Loss Regularization
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)
Welfare, Improvability, and Variance: A Principal-Agent Approach to Optimal Benchmark Item Aggregation
by: Haupt, Andreas, et al.
Published: (2026)
by: Haupt, Andreas, et al.
Published: (2026)
Meta-Reinforcement Learning for Fast and Data-Efficient Spectrum Allocation in Dynamic Wireless Networks
by: Giwa, Oluwaseyi, et al.
Published: (2025)
by: Giwa, Oluwaseyi, et al.
Published: (2025)
Transformer-Based Sparse CSI Estimation for Non-Stationary Channels
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)
Don't Walk the Line: Boundary Guidance for Filtered Generation
by: Ball, Sarah, et al.
Published: (2025)
by: Ball, Sarah, et al.
Published: (2025)
Finetuning Language Models to Emit Linguistic Expressions of Uncertainty
by: Chaudhry, Arslan, et al.
Published: (2024)
by: Chaudhry, Arslan, et al.
Published: (2024)
Why Do Safety Guardrails Degrade Across Languages?
by: Zhang, Max, et al.
Published: (2026)
by: Zhang, Max, et al.
Published: (2026)
Logits are All We Need to Adapt Closed Models
by: Hiranandani, Gaurush, et al.
Published: (2025)
by: Hiranandani, Gaurush, et al.
Published: (2025)
Latent Adversarial Regularization for Offline Preference Optimization
by: Jiang, Enyi, et al.
Published: (2026)
by: Jiang, Enyi, et al.
Published: (2026)
Conditional Prior-based Non-stationary Channel Estimation Using Accelerated Diffusion Models
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)
Quantifying the Effect of Test Set Contamination on Generative Evaluations
by: Schaeffer, Rylan, et al.
Published: (2026)
by: Schaeffer, Rylan, et al.
Published: (2026)
Scaling Laws for Downstream Task Performance of Large Language Models
by: Isik, Berivan, et al.
Published: (2024)
by: Isik, Berivan, et al.
Published: (2024)
Is Pre-training Truly Better Than Meta-Learning?
by: Miranda, Brando, et al.
Published: (2023)
by: Miranda, Brando, et al.
Published: (2023)
Retrieval Augmented Generation with Multi-Modal LLM Framework for Wireless Environments
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)
The Collapse of Heterogeneity in Silicon Philosophers
by: Shi, Yuanming, et al.
Published: (2026)
by: Shi, Yuanming, et al.
Published: (2026)
Reliable and Efficient Amortized Model-based Evaluation
by: Truong, Sang, et al.
Published: (2025)
by: Truong, Sang, et al.
Published: (2025)
Position: Machine Learning Conferences Should Establish a "Refutations and Critiques" Track
by: Schaeffer, Rylan, et al.
Published: (2025)
by: Schaeffer, Rylan, et al.
Published: (2025)
Structured Prompts Improve Evaluation of Language Models
by: Aali, Asad, et al.
Published: (2025)
by: Aali, Asad, et al.
Published: (2025)
From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?
by: Zhou, Zhanke, et al.
Published: (2025)
by: Zhou, Zhanke, et al.
Published: (2025)
Single layer tiny Co$^4$ outpaces GPT-2 and GPT-BERT
by: Zain, Noor Ul, et al.
Published: (2025)
by: Zain, Noor Ul, et al.
Published: (2025)
Lean-ing on Quality: How High-Quality Data Beats Diverse Multilingual Data in AutoFormalization
by: Chan, Willy, et al.
Published: (2025)
by: Chan, Willy, et al.
Published: (2025)
Task and Perception-aware Distributed Source Coding for Correlated Speech under Bandwidth-constrained Channels
by: Bhattacharya, Sagnik, et al.
Published: (2025)
by: Bhattacharya, Sagnik, et al.
Published: (2025)
On the Fundamental Limits of LLMs at Scale
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)
by: Mohsin, Muhammad Ahmed, et al.
Published: (2025)
Quantifying the Importance of Data Alignment in Downstream Model Performance
by: Chawla, Krrish, et al.
Published: (2025)
by: Chawla, Krrish, et al.
Published: (2025)
ZIP-FIT: Embedding-Free Data Selection via Compression-Based Alignment
by: Obbad, Elyas, et al.
Published: (2024)
by: Obbad, Elyas, et al.
Published: (2024)
Similar Items
-
Continuous-Utility Direct Preference Optimization
by: Mohsin, Muhammad Ahmed, et al.
Published: (2026) -
Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey
by: Bilal, Ahsan, et al.
Published: (2025) -
Epistemic Uncertainty for Test-Time Discovery
by: Riaz, Kainat, et al.
Published: (2026) -
Neural Gaussian Radio Fields for Channel Estimation
by: Umer, Muhammad, et al.
Published: (2025) -
Canonical Optimization for MIMO MAC Design
by: Umer, Muhammad, et al.
Published: (2026)