:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Luo, Haipeng
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2603.03409
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

A Note on How to Remove the $\ln\ln T$ Term from the Squint Bound
by: Orabona, Francesco
Published: (2026)

ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints
by: Agnihotri, Akhil, et al.
Published: (2023)

Comparator-Adaptive $Φ$-Regret: Improved Bounds, Simpler Algorithms, and Applications to Games
by: Hait, Soumita, et al.
Published: (2025)

A Short Note on Batch-efficient Divide-and-Conquer Algorithm for EigenDecomposition
by: Song, Yue
Published: (2026)

Contextual Multinomial Logit Bandits with General Value Functions
by: Zhang, Mengxiao, et al.
Published: (2024)

Squint: Fast Visual Reinforcement Learning for Sim-to-Real Robotics
by: Almuzairee, Abdulaziz, et al.
Published: (2026)

One Good Source is All You Need: Near-Optimal Regret for Bandits under Heterogeneous Noise
by: Bhat, Amith, et al.
Published: (2026)

Improved Bounds for Swap Multicalibration and Swap Omniprediction
by: Luo, Haipeng, et al.
Published: (2025)

Optimal Multiclass U-Calibration Error and Beyond
by: Luo, Haipeng, et al.
Published: (2024)

Contextual Linear Bandits with Delay as Payoff
by: Zhang, Mengxiao, et al.
Published: (2025)

Reinforcement Learning from Adversarial Preferences in Tabular MDPs
by: Tsuchiya, Taira, et al.
Published: (2025)

Adaptive Calibration in Non-Stationary Environments
by: Liu, Junyan, et al.
Published: (2026)

Compositions of Variant Experts for Integrating Short-Term and Long-Term Preferences
by: Do, Jaime Hieu, et al.
Published: (2025)

Clustering Mixtures of Discrete Distributions: A Note on Mitra's Algorithm
by: Seif, Mohamed, et al.
Published: (2024)

Near-Optimal Last-Iterate Convergence for Zero-Sum Games with Bandit Feedback and Opponent Actions
by: Hait, Soumita, et al.
Published: (2026)

Interaction-Grounded Learning for Contextual Markov Decision Processes with Personalized Feedback
by: Zhang, Mengxiao, et al.
Published: (2026)

Efficient Swap Multicalibration of Elicitable Properties
by: Hu, Lunjia, et al.
Published: (2025)

Alternating Regret for Online Convex Optimization
by: Hait, Soumita, et al.
Published: (2025)

Efficient Contextual Bandits with Uninformed Feedback Graphs
by: Zhang, Mengxiao, et al.
Published: (2024)

Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback
by: Cassel, Asaf, et al.
Published: (2024)

Provably Efficient Interactive-Grounded Learning with Personalized Reward
by: Zhang, Mengxiao, et al.
Published: (2024)

Scale-Invariant Fast Convergence in Games
by: Tsuchiya, Taira, et al.
Published: (2026)

Corrupted Learning Dynamics in Games
by: Tsuchiya, Taira, et al.
Published: (2024)

Last-Iterate Convergence Properties of Regret-Matching Algorithms in Games
by: Cai, Yang, et al.
Published: (2023)

Decentralized Online Learning in General-Sum Stackelberg Games
by: Yu, Yaolong, et al.
Published: (2024)

Swap Regret Minimization Through Response-Based Approachability
by: Anagnostides, Ioannis, et al.
Published: (2026)

Adapting to Stochastic and Adversarial Losses in Episodic MDPs with Aggregate Bandit Feedback
by: Ito, Shinji, et al.
Published: (2025)

Beyond Short Steps in Frank-Wolfe Algorithms
by: Martínez-Rubio, David, et al.
Published: (2025)

Simultaneous Swap Regret Minimization via KL-Calibration
by: Luo, Haipeng, et al.
Published: (2025)

Fast Last-Iterate Convergence of Learning in Games Requires Forgetful Algorithms
by: Cai, Yang, et al.
Published: (2024)

A Short Note on Evaluating RepNet for Temporal Repetition Counting in Videos
by: Dwibedi, Debidatta, et al.
Published: (2024)

Group Distributionally Robust Optimization with Flexible Sample Queries
by: Bai, Haomin, et al.
Published: (2025)

Constructing a Question-Answering Simulator through the Distillation of LLMs
by: Liu, Haipeng, et al.
Published: (2025)

Large-Scale Spectral Graph Neural Networks via Laplacian Sparsification: Technical Report
by: Ding, Haipeng, et al.
Published: (2025)

Online Learning for Uninformed Markov Games: Empirical Nash-Value Regret and Non-Stationarity Adaptation
by: Liu, Junyan, et al.
Published: (2026)

Is Online Linear Optimization Sufficient for Strategic Robustness?
by: Cai, Yang, et al.
Published: (2026)

From Average-Iterate to Last-Iterate Convergence in Games: A Reduction and Its Applications
by: Cai, Yang, et al.
Published: (2025)

AgriVariant: Variant Effect Prediction using DeepChem-Variant for Precision Breeding in Rice
by: Bisoi, Ankita Vaishnobi, et al.
Published: (2026)

No-Regret Learning for Fair Multi-Agent Social Welfare Optimization
by: Zhang, Mengxiao, et al.
Published: (2024)

Proximal Regret and Proximal Correlated Equilibria: A New Tractable Solution Concept for Online Learning and Games
by: Cai, Yang, et al.
Published: (2025)