Saved in:
| Main Authors: | Zhang, Qizhen, Garg, Ankush, Foerster, Jakob, Chatterji, Niladri, Malik, Kshitiz, Lewis, Mike |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.02400 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Benign Overfitting without Linearity: Neural Network Classifiers Trained by Gradient Descent for Noisy Linear Data
by: Frei, Spencer, et al.
Published: (2022)
by: Frei, Spencer, et al.
Published: (2022)
Compute Optimal Scaling of Skills: Knowledge vs Reasoning
by: Roberts, Nicholas, et al.
Published: (2025)
by: Roberts, Nicholas, et al.
Published: (2025)
BTS: Harmonizing Specialized Experts into a Generalist LLM
by: Zhang, Qizhen, et al.
Published: (2025)
by: Zhang, Qizhen, et al.
Published: (2025)
Noisy Zero-Shot Coordination: Breaking The Common Knowledge Assumption In Zero-Shot Coordination Games
by: Anwar, Usman, et al.
Published: (2024)
by: Anwar, Usman, et al.
Published: (2024)
Analysing the Sample Complexity of Opponent Shaping
by: Fung, Kitty, et al.
Published: (2024)
by: Fung, Kitty, et al.
Published: (2024)
BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
by: Zhang, Qizhen, et al.
Published: (2024)
by: Zhang, Qizhen, et al.
Published: (2024)
JaxUED: A simple and useable UED library in Jax
by: Coward, Samuel, et al.
Published: (2024)
by: Coward, Samuel, et al.
Published: (2024)
PARDEN, Can You Repeat That? Defending against Jailbreaks via Repetition
by: Zhang, Ziyang, et al.
Published: (2024)
by: Zhang, Ziyang, et al.
Published: (2024)
Think Smart, Act SMARL! Analyzing Probabilistic Logic Shields for Multi-Agent Reinforcement Learning
by: Chatterji, Satchit, et al.
Published: (2024)
by: Chatterji, Satchit, et al.
Published: (2024)
Improving Regret Approximation for Unsupervised Dynamic Environment Generation
by: Mead, Harry, et al.
Published: (2026)
by: Mead, Harry, et al.
Published: (2026)
A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem
by: Barde, Paul, et al.
Published: (2023)
by: Barde, Paul, et al.
Published: (2023)
Learning to Forget with Information Divergence Reweighted Objectives for Noisy Labels
by: Birrell, Jeremiah, et al.
Published: (2025)
by: Birrell, Jeremiah, et al.
Published: (2025)
Accelerated Smoothing: A Scalable Approach to Randomized Smoothing
by: Bhardwaj, Devansh, et al.
Published: (2024)
by: Bhardwaj, Devansh, et al.
Published: (2024)
TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation
by: Cook, Jonathan, et al.
Published: (2024)
by: Cook, Jonathan, et al.
Published: (2024)
Unsolvability Ceiling in Multi-LLM Routing: An Empirical Study of Evaluation Artifacts
by: Garg, Saloni, et al.
Published: (2026)
by: Garg, Saloni, et al.
Published: (2026)
Performance of Small Language Model Pretraining on FABRIC: An Empirical Study
by: Rao, Praveen
Published: (2026)
by: Rao, Praveen
Published: (2026)
HelloFresh: LLM Evaluations on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits
by: Franzmeyer, Tim, et al.
Published: (2024)
by: Franzmeyer, Tim, et al.
Published: (2024)
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks
by: Matthews, Michael, et al.
Published: (2024)
by: Matthews, Michael, et al.
Published: (2024)
General Formulation and PCL-Analysis for Restless Bandits with Limited Observability
by: Liu, Keqin, et al.
Published: (2023)
by: Liu, Keqin, et al.
Published: (2023)
Empirical Risk Minimization with $f$-Divergence Regularization
by: Daunas, Francisco, et al.
Published: (2026)
by: Daunas, Francisco, et al.
Published: (2026)
LLM Unlearning on Noisy Forget Sets: A Study of Incomplete, Rewritten, and Watermarked Data
by: Wang, Changsheng, et al.
Published: (2025)
by: Wang, Changsheng, et al.
Published: (2025)
An Empirical Study of the Impact of Federated Learning on Machine Learning Model Accuracy
by: Yang, Haotian, et al.
Published: (2025)
by: Yang, Haotian, et al.
Published: (2025)
Decoupled Kullback-Leibler Divergence Loss
by: Cui, Jiequan, et al.
Published: (2023)
by: Cui, Jiequan, et al.
Published: (2023)
Loss Functions and Operators Generated by f-Divergences
by: Roulet, Vincent, et al.
Published: (2025)
by: Roulet, Vincent, et al.
Published: (2025)
On Symmetric Losses for Robust Policy Optimization with Noisy Preferences
by: Nishimori, Soichiro, et al.
Published: (2025)
by: Nishimori, Soichiro, et al.
Published: (2025)
Revisiting Auxiliary Losses for Conditional Depth Routing: An Empirical Study
by: Lin, Qingwei
Published: (2026)
by: Lin, Qingwei
Published: (2026)
Explainability-Guided Adversarial Attacks on Transformer-Based Malware Detectors Using Control Flow Graphs
by: Wheeler, Andrew, et al.
Published: (2026)
by: Wheeler, Andrew, et al.
Published: (2026)
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
by: Sims, Anya, et al.
Published: (2024)
by: Sims, Anya, et al.
Published: (2024)
Mirror Learning: A Unifying Framework of Policy Optimisation
by: Kuba, Jakub Grudzien, et al.
Published: (2022)
by: Kuba, Jakub Grudzien, et al.
Published: (2022)
Learning Multi-Agent Communication with Contrastive Learning
by: Lo, Yat Long, et al.
Published: (2023)
by: Lo, Yat Long, et al.
Published: (2023)
Minimum Empirical Divergence for Sub-Gaussian Linear Bandits
by: Balagopalan, Kapilan, et al.
Published: (2024)
by: Balagopalan, Kapilan, et al.
Published: (2024)
Towards Knowledge Guided Pretraining Approaches for Multimodal Foundation Models: Applications in Remote Sensing
by: Ravirathinam, Praveen, et al.
Published: (2024)
by: Ravirathinam, Praveen, et al.
Published: (2024)
The Complexity Dynamics of Grokking
by: DeMoss, Branton, et al.
Published: (2024)
by: DeMoss, Branton, et al.
Published: (2024)
LLM-Inspired Pretrain-Then-Finetune for Small-Data, Large-Scale Optimization
by: Zhang, Zishi, et al.
Published: (2026)
by: Zhang, Zishi, et al.
Published: (2026)
Self-Supervised Learning from Noisy and Incomplete Data
by: Tachella, Julián, et al.
Published: (2026)
by: Tachella, Julián, et al.
Published: (2026)
Missing Data Imputation by Reducing Mutual Information with Rectified Flows
by: Yu, Jiahao, et al.
Published: (2025)
by: Yu, Jiahao, et al.
Published: (2025)
Generalized Kullback-Leibler Divergence Loss
by: Cui, Jiequan, et al.
Published: (2025)
by: Cui, Jiequan, et al.
Published: (2025)
Infusion: Shaping Model Behavior by Editing Training Data via Influence Functions
by: Rosser, J, et al.
Published: (2026)
by: Rosser, J, et al.
Published: (2026)
CLID-MU: Cross-Layer Information Divergence Based Meta Update Strategy for Learning with Noisy Labels
by: Hu, Ruofan, et al.
Published: (2025)
by: Hu, Ruofan, et al.
Published: (2025)
Introducing Fractional Classification Loss for Robust Learning with Noisy Labels
by: Kurucu, Mert Can, et al.
Published: (2025)
by: Kurucu, Mert Can, et al.
Published: (2025)
Similar Items
-
Benign Overfitting without Linearity: Neural Network Classifiers Trained by Gradient Descent for Noisy Linear Data
by: Frei, Spencer, et al.
Published: (2022) -
Compute Optimal Scaling of Skills: Knowledge vs Reasoning
by: Roberts, Nicholas, et al.
Published: (2025) -
BTS: Harmonizing Specialized Experts into a Generalist LLM
by: Zhang, Qizhen, et al.
Published: (2025) -
Noisy Zero-Shot Coordination: Breaking The Common Knowledge Assumption In Zero-Shot Coordination Games
by: Anwar, Usman, et al.
Published: (2024) -
Analysing the Sample Complexity of Opponent Shaping
by: Fung, Kitty, et al.
Published: (2024)