:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhang, Qizhen, Garg, Ankush, Foerster, Jakob, Chatterji, Niladri, Malik, Kshitiz, Lewis, Mike
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2602.02400
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Benign Overfitting without Linearity: Neural Network Classifiers Trained by Gradient Descent for Noisy Linear Data
by: Frei, Spencer, et al.
Published: (2022)

Compute Optimal Scaling of Skills: Knowledge vs Reasoning
by: Roberts, Nicholas, et al.
Published: (2025)

BTS: Harmonizing Specialized Experts into a Generalist LLM
by: Zhang, Qizhen, et al.
Published: (2025)

Noisy Zero-Shot Coordination: Breaking The Common Knowledge Assumption In Zero-Shot Coordination Games
by: Anwar, Usman, et al.
Published: (2024)

Analysing the Sample Complexity of Opponent Shaping
by: Fung, Kitty, et al.
Published: (2024)

BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
by: Zhang, Qizhen, et al.
Published: (2024)

JaxUED: A simple and useable UED library in Jax
by: Coward, Samuel, et al.
Published: (2024)

PARDEN, Can You Repeat That? Defending against Jailbreaks via Repetition
by: Zhang, Ziyang, et al.
Published: (2024)

Think Smart, Act SMARL! Analyzing Probabilistic Logic Shields for Multi-Agent Reinforcement Learning
by: Chatterji, Satchit, et al.
Published: (2024)

Improving Regret Approximation for Unsupervised Dynamic Environment Generation
by: Mead, Harry, et al.
Published: (2026)

A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem
by: Barde, Paul, et al.
Published: (2023)

Learning to Forget with Information Divergence Reweighted Objectives for Noisy Labels
by: Birrell, Jeremiah, et al.
Published: (2025)

Accelerated Smoothing: A Scalable Approach to Randomized Smoothing
by: Bhardwaj, Devansh, et al.
Published: (2024)

TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation
by: Cook, Jonathan, et al.
Published: (2024)

Unsolvability Ceiling in Multi-LLM Routing: An Empirical Study of Evaluation Artifacts
by: Garg, Saloni, et al.
Published: (2026)

Performance of Small Language Model Pretraining on FABRIC: An Empirical Study
by: Rao, Praveen
Published: (2026)

HelloFresh: LLM Evaluations on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits
by: Franzmeyer, Tim, et al.
Published: (2024)

Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks
by: Matthews, Michael, et al.
Published: (2024)

General Formulation and PCL-Analysis for Restless Bandits with Limited Observability
by: Liu, Keqin, et al.
Published: (2023)

Empirical Risk Minimization with $f$-Divergence Regularization
by: Daunas, Francisco, et al.
Published: (2026)

LLM Unlearning on Noisy Forget Sets: A Study of Incomplete, Rewritten, and Watermarked Data
by: Wang, Changsheng, et al.
Published: (2025)

An Empirical Study of the Impact of Federated Learning on Machine Learning Model Accuracy
by: Yang, Haotian, et al.
Published: (2025)

Decoupled Kullback-Leibler Divergence Loss
by: Cui, Jiequan, et al.
Published: (2023)

Loss Functions and Operators Generated by f-Divergences
by: Roulet, Vincent, et al.
Published: (2025)

On Symmetric Losses for Robust Policy Optimization with Noisy Preferences
by: Nishimori, Soichiro, et al.
Published: (2025)

Revisiting Auxiliary Losses for Conditional Depth Routing: An Empirical Study
by: Lin, Qingwei
Published: (2026)

Explainability-Guided Adversarial Attacks on Transformer-Based Malware Detectors Using Control Flow Graphs
by: Wheeler, Andrew, et al.
Published: (2026)

The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
by: Sims, Anya, et al.
Published: (2024)

Mirror Learning: A Unifying Framework of Policy Optimisation
by: Kuba, Jakub Grudzien, et al.
Published: (2022)

Learning Multi-Agent Communication with Contrastive Learning
by: Lo, Yat Long, et al.
Published: (2023)

Minimum Empirical Divergence for Sub-Gaussian Linear Bandits
by: Balagopalan, Kapilan, et al.
Published: (2024)

Towards Knowledge Guided Pretraining Approaches for Multimodal Foundation Models: Applications in Remote Sensing
by: Ravirathinam, Praveen, et al.
Published: (2024)

The Complexity Dynamics of Grokking
by: DeMoss, Branton, et al.
Published: (2024)

LLM-Inspired Pretrain-Then-Finetune for Small-Data, Large-Scale Optimization
by: Zhang, Zishi, et al.
Published: (2026)

Self-Supervised Learning from Noisy and Incomplete Data
by: Tachella, Julián, et al.
Published: (2026)

Missing Data Imputation by Reducing Mutual Information with Rectified Flows
by: Yu, Jiahao, et al.
Published: (2025)

Generalized Kullback-Leibler Divergence Loss
by: Cui, Jiequan, et al.
Published: (2025)

Infusion: Shaping Model Behavior by Editing Training Data via Influence Functions
by: Rosser, J, et al.
Published: (2026)

CLID-MU: Cross-Layer Information Divergence Based Meta Update Strategy for Learning with Noisy Labels
by: Hu, Ruofan, et al.
Published: (2025)

Introducing Fractional Classification Loss for Robust Learning with Noisy Labels
by: Kurucu, Mert Can, et al.
Published: (2025)