Enregistré dans:
| Auteurs principaux: | Ito, Shinji, Luo, Haipeng, Maiti, Arnab, Tsuchiya, Taira, Wu, Yue |
|---|---|
| Format: | Preprint |
| Publié: |
2026
|
| Sujets: | |
| Accès en ligne: | https://arxiv.org/abs/2602.06348 |
| Tags: |
Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
|
Documents similaires
Instance-Dependent Regret Bounds for Learning Two-Player Zero-Sum Games with Bandit Feedback
par: Ito, Shinji, et autres
Publié: (2025)
par: Ito, Shinji, et autres
Publié: (2025)
Reinforcement Learning for Reachability: Guaranteeing Asymptotic Optimality
par: Palasamudram, Amogh, et autres
Publié: (2026)
par: Palasamudram, Amogh, et autres
Publié: (2026)
A geometric decomposition of finite games: Convergence vs. recurrence under exponential weights
par: Legacci, Davide, et autres
Publié: (2024)
par: Legacci, Davide, et autres
Publié: (2024)
A Parallelizable Approach for Characterizing NE in Zero-Sum Games After a Linear Number of Iterations of Gradient Descent
par: Kim, Taemin, et autres
Publié: (2025)
par: Kim, Taemin, et autres
Publié: (2025)
Regret Bounds for Robust Online Decision Making
par: Appel, Alexander, et autres
Publié: (2025)
par: Appel, Alexander, et autres
Publié: (2025)
Robust equilibria in continuous games: From strategic to dynamic robustness
par: Lotidis, Kyriakos, et autres
Publié: (2025)
par: Lotidis, Kyriakos, et autres
Publié: (2025)
Accelerated regularized learning in finite N-person games
par: Lotidis, Kyriakos, et autres
Publié: (2024)
par: Lotidis, Kyriakos, et autres
Publié: (2024)
Frequency Principle: Fourier Analysis Sheds Light on Deep Neural Networks
par: Xu, Zhi-Qin John, et autres
Publié: (2019)
par: Xu, Zhi-Qin John, et autres
Publié: (2019)
Breaking $1/ε$ Barrier in Quantum Zero-Sum Games: Generalizing Metric Subregularity for Spectraplexes
par: Su, Yiheng, et autres
Publié: (2025)
par: Su, Yiheng, et autres
Publié: (2025)
No-regret learning in harmonic games: Extrapolation in the face of conflicting interests
par: Legacci, Davide, et autres
Publié: (2024)
par: Legacci, Davide, et autres
Publié: (2024)
Computing Game Symmetries and Equilibria That Respect Them
par: Tewolde, Emanuel, et autres
Publié: (2025)
par: Tewolde, Emanuel, et autres
Publié: (2025)
Online Decision Making with Generative Action Sets
par: Xu, Jianyu, et autres
Publié: (2025)
par: Xu, Jianyu, et autres
Publié: (2025)
The Current and Future Perspectives of Zinc Oxide Nanoparticles in the Treatment of Diabetes Mellitus
par: Yousaf, Iqra
Publié: (2024)
par: Yousaf, Iqra
Publié: (2024)
Backpropagation Through Time For Networks With Long-Term Dependencies
par: Bird, George, et autres
Publié: (2021)
par: Bird, George, et autres
Publié: (2021)
Nested replicator dynamics, nested logit choice, and similarity-based learning
par: Mertikopoulos, Panayotis, et autres
Publié: (2024)
par: Mertikopoulos, Panayotis, et autres
Publié: (2024)
Sparse Knowledge Distillation: A Mathematical Framework for Probability-Domain Temperature Scaling and Multi-Stage Compression
par: Flouro, Aaron R., et autres
Publié: (2026)
par: Flouro, Aaron R., et autres
Publié: (2026)
Aligning Inductive Bias for Data-Efficient Generalization in State Space Models
par: Chen, Qiyu, et autres
Publié: (2025)
par: Chen, Qiyu, et autres
Publié: (2025)
Ambiguous Online Learning
par: Kosoy, Vanessa
Publié: (2025)
par: Kosoy, Vanessa
Publié: (2025)
Adaptive Discretization in Online Reinforcement Learning
par: Sinclair, Sean R., et autres
Publié: (2021)
par: Sinclair, Sean R., et autres
Publié: (2021)
Improved Approximation Ratio for Strategyproof Facility Location on a Cycle
par: Rogowski, Krzysztof, et autres
Publié: (2025)
par: Rogowski, Krzysztof, et autres
Publié: (2025)
Understanding the Nature of Generative AI as Threshold Logic in High-Dimensional Space
par: Levin, Ilya
Publié: (2026)
par: Levin, Ilya
Publié: (2026)
A Quadratic Speedup in Finding Nash Equilibria of Quantum Zero-Sum Games
par: Vasconcelos, Francisca, et autres
Publié: (2023)
par: Vasconcelos, Francisca, et autres
Publié: (2023)
Decision Making under Imperfect Recall: Algorithms and Benchmarks
par: Tewolde, Emanuel, et autres
Publié: (2026)
par: Tewolde, Emanuel, et autres
Publié: (2026)
Agnostic Learning under Targeted Poisoning: Optimal Rates and the Role of Randomness
par: Chornomaz, Bogdan, et autres
Publié: (2025)
par: Chornomaz, Bogdan, et autres
Publié: (2025)
Near-Optimal Consistency-Robustness Trade-Offs for Learning-Augmented Online Knapsack Problems
par: Daneshvaramoli, Mohammadreza, et autres
Publié: (2024)
par: Daneshvaramoli, Mohammadreza, et autres
Publié: (2024)
Superior Scoring Rules for Probabilistic Evaluation of Single-Label Multi-Class Classification Tasks
par: Ahmadian, Rouhollah, et autres
Publié: (2024)
par: Ahmadian, Rouhollah, et autres
Publié: (2024)
Golden Handcuffs make safer AI agents
par: Ebtekar, Aram, et autres
Publié: (2026)
par: Ebtekar, Aram, et autres
Publié: (2026)
Integration of Deep Reinforcement Learning and Agent-based Simulation to Explore Strategies Counteracting Information Disorder
par: Lomasto, Luigi, et autres
Publié: (2026)
par: Lomasto, Luigi, et autres
Publié: (2026)
Margin in Abstract Spaces
par: Ashlagi, Yair, et autres
Publié: (2026)
par: Ashlagi, Yair, et autres
Publié: (2026)
Reinforcement Learning in MDPs with Information-Ordered Policies
par: Zhang, Zhongjun, et autres
Publié: (2025)
par: Zhang, Zhongjun, et autres
Publié: (2025)
MenuNet: A Strategy-Proof Mechanism for Matching Markets
par: Sun, Zhaohong, et autres
Publié: (2026)
par: Sun, Zhaohong, et autres
Publié: (2026)
Mitigating Catastrophic Forgetting in Streaming Generative and Predictive Learning via Stateful Replay
par: Du, Wenzhang
Publié: (2025)
par: Du, Wenzhang
Publié: (2025)
AI Agents for the Dhumbal Card Game: A Comparative Study
par: Malla, Sahaj Raj
Publié: (2025)
par: Malla, Sahaj Raj
Publié: (2025)
Think Thrice Before You Speak: Dual knowledge-enhanced Theory-of-Mind Reasoning for Persuasive Agents
par: Ma, Minghui, et autres
Publié: (2026)
par: Ma, Minghui, et autres
Publié: (2026)
Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayes Theory
par: Zhang, Zhi, et autres
Publié: (2024)
par: Zhang, Zhi, et autres
Publié: (2024)
Multi-agent learning under uncertainty: Recurrence vs. concentration
par: Lotidis, Kyriakos, et autres
Publié: (2025)
par: Lotidis, Kyriakos, et autres
Publié: (2025)
Batched Nonparametric Bandits via k-Nearest Neighbor UCB
par: Arya, Sakshi
Publié: (2025)
par: Arya, Sakshi
Publié: (2025)
MMD-Balls as Credal Sets: A PAC-Bayesian Framework for Epistemic Uncertainty in Test-Time Adaptation
par: Ariq, Ahanaf Hasan
Publié: (2026)
par: Ariq, Ahanaf Hasan
Publié: (2026)
Inductive Venn-Abers and related regressors
par: Petej, Ivan, et autres
Publié: (2026)
par: Petej, Ivan, et autres
Publié: (2026)
Aggregation in conformal e-classification
par: Vovk, Vladimir
Publié: (2026)
par: Vovk, Vladimir
Publié: (2026)
Documents similaires
-
Instance-Dependent Regret Bounds for Learning Two-Player Zero-Sum Games with Bandit Feedback
par: Ito, Shinji, et autres
Publié: (2025) -
Reinforcement Learning for Reachability: Guaranteeing Asymptotic Optimality
par: Palasamudram, Amogh, et autres
Publié: (2026) -
A geometric decomposition of finite games: Convergence vs. recurrence under exponential weights
par: Legacci, Davide, et autres
Publié: (2024) -
A Parallelizable Approach for Characterizing NE in Zero-Sum Games After a Linear Number of Iterations of Gradient Descent
par: Kim, Taemin, et autres
Publié: (2025) -
Regret Bounds for Robust Online Decision Making
par: Appel, Alexander, et autres
Publié: (2025)