Saved in:
| Main Author: | Luo, Haipeng |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.03409 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Note on How to Remove the $\ln\ln T$ Term from the Squint Bound
by: Orabona, Francesco
Published: (2026)
by: Orabona, Francesco
Published: (2026)
ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints
by: Agnihotri, Akhil, et al.
Published: (2023)
by: Agnihotri, Akhil, et al.
Published: (2023)
Comparator-Adaptive $Φ$-Regret: Improved Bounds, Simpler Algorithms, and Applications to Games
by: Hait, Soumita, et al.
Published: (2025)
by: Hait, Soumita, et al.
Published: (2025)
A Short Note on Batch-efficient Divide-and-Conquer Algorithm for EigenDecomposition
by: Song, Yue
Published: (2026)
by: Song, Yue
Published: (2026)
Contextual Multinomial Logit Bandits with General Value Functions
by: Zhang, Mengxiao, et al.
Published: (2024)
by: Zhang, Mengxiao, et al.
Published: (2024)
Squint: Fast Visual Reinforcement Learning for Sim-to-Real Robotics
by: Almuzairee, Abdulaziz, et al.
Published: (2026)
by: Almuzairee, Abdulaziz, et al.
Published: (2026)
One Good Source is All You Need: Near-Optimal Regret for Bandits under Heterogeneous Noise
by: Bhat, Amith, et al.
Published: (2026)
by: Bhat, Amith, et al.
Published: (2026)
Improved Bounds for Swap Multicalibration and Swap Omniprediction
by: Luo, Haipeng, et al.
Published: (2025)
by: Luo, Haipeng, et al.
Published: (2025)
Optimal Multiclass U-Calibration Error and Beyond
by: Luo, Haipeng, et al.
Published: (2024)
by: Luo, Haipeng, et al.
Published: (2024)
Contextual Linear Bandits with Delay as Payoff
by: Zhang, Mengxiao, et al.
Published: (2025)
by: Zhang, Mengxiao, et al.
Published: (2025)
Reinforcement Learning from Adversarial Preferences in Tabular MDPs
by: Tsuchiya, Taira, et al.
Published: (2025)
by: Tsuchiya, Taira, et al.
Published: (2025)
Adaptive Calibration in Non-Stationary Environments
by: Liu, Junyan, et al.
Published: (2026)
by: Liu, Junyan, et al.
Published: (2026)
Compositions of Variant Experts for Integrating Short-Term and Long-Term Preferences
by: Do, Jaime Hieu, et al.
Published: (2025)
by: Do, Jaime Hieu, et al.
Published: (2025)
Clustering Mixtures of Discrete Distributions: A Note on Mitra's Algorithm
by: Seif, Mohamed, et al.
Published: (2024)
by: Seif, Mohamed, et al.
Published: (2024)
Near-Optimal Last-Iterate Convergence for Zero-Sum Games with Bandit Feedback and Opponent Actions
by: Hait, Soumita, et al.
Published: (2026)
by: Hait, Soumita, et al.
Published: (2026)
Interaction-Grounded Learning for Contextual Markov Decision Processes with Personalized Feedback
by: Zhang, Mengxiao, et al.
Published: (2026)
by: Zhang, Mengxiao, et al.
Published: (2026)
Efficient Swap Multicalibration of Elicitable Properties
by: Hu, Lunjia, et al.
Published: (2025)
by: Hu, Lunjia, et al.
Published: (2025)
Alternating Regret for Online Convex Optimization
by: Hait, Soumita, et al.
Published: (2025)
by: Hait, Soumita, et al.
Published: (2025)
Efficient Contextual Bandits with Uninformed Feedback Graphs
by: Zhang, Mengxiao, et al.
Published: (2024)
by: Zhang, Mengxiao, et al.
Published: (2024)
Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback
by: Cassel, Asaf, et al.
Published: (2024)
by: Cassel, Asaf, et al.
Published: (2024)
Provably Efficient Interactive-Grounded Learning with Personalized Reward
by: Zhang, Mengxiao, et al.
Published: (2024)
by: Zhang, Mengxiao, et al.
Published: (2024)
Scale-Invariant Fast Convergence in Games
by: Tsuchiya, Taira, et al.
Published: (2026)
by: Tsuchiya, Taira, et al.
Published: (2026)
Corrupted Learning Dynamics in Games
by: Tsuchiya, Taira, et al.
Published: (2024)
by: Tsuchiya, Taira, et al.
Published: (2024)
Last-Iterate Convergence Properties of Regret-Matching Algorithms in Games
by: Cai, Yang, et al.
Published: (2023)
by: Cai, Yang, et al.
Published: (2023)
Decentralized Online Learning in General-Sum Stackelberg Games
by: Yu, Yaolong, et al.
Published: (2024)
by: Yu, Yaolong, et al.
Published: (2024)
Swap Regret Minimization Through Response-Based Approachability
by: Anagnostides, Ioannis, et al.
Published: (2026)
by: Anagnostides, Ioannis, et al.
Published: (2026)
Adapting to Stochastic and Adversarial Losses in Episodic MDPs with Aggregate Bandit Feedback
by: Ito, Shinji, et al.
Published: (2025)
by: Ito, Shinji, et al.
Published: (2025)
Beyond Short Steps in Frank-Wolfe Algorithms
by: Martínez-Rubio, David, et al.
Published: (2025)
by: Martínez-Rubio, David, et al.
Published: (2025)
Simultaneous Swap Regret Minimization via KL-Calibration
by: Luo, Haipeng, et al.
Published: (2025)
by: Luo, Haipeng, et al.
Published: (2025)
Fast Last-Iterate Convergence of Learning in Games Requires Forgetful Algorithms
by: Cai, Yang, et al.
Published: (2024)
by: Cai, Yang, et al.
Published: (2024)
A Short Note on Evaluating RepNet for Temporal Repetition Counting in Videos
by: Dwibedi, Debidatta, et al.
Published: (2024)
by: Dwibedi, Debidatta, et al.
Published: (2024)
Group Distributionally Robust Optimization with Flexible Sample Queries
by: Bai, Haomin, et al.
Published: (2025)
by: Bai, Haomin, et al.
Published: (2025)
Constructing a Question-Answering Simulator through the Distillation of LLMs
by: Liu, Haipeng, et al.
Published: (2025)
by: Liu, Haipeng, et al.
Published: (2025)
Large-Scale Spectral Graph Neural Networks via Laplacian Sparsification: Technical Report
by: Ding, Haipeng, et al.
Published: (2025)
by: Ding, Haipeng, et al.
Published: (2025)
Online Learning for Uninformed Markov Games: Empirical Nash-Value Regret and Non-Stationarity Adaptation
by: Liu, Junyan, et al.
Published: (2026)
by: Liu, Junyan, et al.
Published: (2026)
Is Online Linear Optimization Sufficient for Strategic Robustness?
by: Cai, Yang, et al.
Published: (2026)
by: Cai, Yang, et al.
Published: (2026)
From Average-Iterate to Last-Iterate Convergence in Games: A Reduction and Its Applications
by: Cai, Yang, et al.
Published: (2025)
by: Cai, Yang, et al.
Published: (2025)
AgriVariant: Variant Effect Prediction using DeepChem-Variant for Precision Breeding in Rice
by: Bisoi, Ankita Vaishnobi, et al.
Published: (2026)
by: Bisoi, Ankita Vaishnobi, et al.
Published: (2026)
No-Regret Learning for Fair Multi-Agent Social Welfare Optimization
by: Zhang, Mengxiao, et al.
Published: (2024)
by: Zhang, Mengxiao, et al.
Published: (2024)
Proximal Regret and Proximal Correlated Equilibria: A New Tractable Solution Concept for Online Learning and Games
by: Cai, Yang, et al.
Published: (2025)
by: Cai, Yang, et al.
Published: (2025)
Similar Items
-
A Note on How to Remove the $\ln\ln T$ Term from the Squint Bound
by: Orabona, Francesco
Published: (2026) -
ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints
by: Agnihotri, Akhil, et al.
Published: (2023) -
Comparator-Adaptive $Φ$-Regret: Improved Bounds, Simpler Algorithms, and Applications to Games
by: Hait, Soumita, et al.
Published: (2025) -
A Short Note on Batch-efficient Divide-and-Conquer Algorithm for EigenDecomposition
by: Song, Yue
Published: (2026) -
Contextual Multinomial Logit Bandits with General Value Functions
by: Zhang, Mengxiao, et al.
Published: (2024)