Saved in:
| Main Authors: | Jedra, Yassir, Réveillard, William, Stojanovic, Stefan, Proutiere, Alexandre |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.15739 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise Matrix Estimation
by: Stojanovic, Stefan, et al.
Published: (2024)
by: Stojanovic, Stefan, et al.
Published: (2024)
Near-optimal Rank Adaptive Inference of High Dimensional Matrices
by: Zheng, Frédéric, et al.
Published: (2025)
by: Zheng, Frédéric, et al.
Published: (2025)
Minimal Order Recovery through Rank-adaptive Identification
by: Zheng, Frédéric, et al.
Published: (2025)
by: Zheng, Frédéric, et al.
Published: (2025)
Shift Before You Learn: Enabling Low-Rank Representations in Reinforcement Learning
by: Dubail, Bastien, et al.
Published: (2025)
by: Dubail, Bastien, et al.
Published: (2025)
Near-Optimal Clustering in Mixture of Markov Chains
by: Lee, Junghyun, et al.
Published: (2025)
by: Lee, Junghyun, et al.
Published: (2025)
Switching Successor Measures for Hierarchical Zero-shot Reinforcement Learning
by: Stojanovic, Stefan, et al.
Published: (2026)
by: Stojanovic, Stefan, et al.
Published: (2026)
Minimizing Human Intervention in Online Classification
by: Réveillard, William, et al.
Published: (2025)
by: Réveillard, William, et al.
Published: (2025)
Instance-Optimal Estimation with Multiple LLM Judges on a Budget
by: Lee, Junghyun, et al.
Published: (2026)
by: Lee, Junghyun, et al.
Published: (2026)
Multimodal Bandits: Regret Lower Bounds and Optimal Algorithms
by: Réveillard, William, et al.
Published: (2025)
by: Réveillard, William, et al.
Published: (2025)
$k$-SVD with Gradient Descent
by: Jedra, Yassir, et al.
Published: (2025)
by: Jedra, Yassir, et al.
Published: (2025)
Explore-then-Commit for Nonstationary Linear Bandits with Latent Dynamics
by: Choi, Sunmook, et al.
Published: (2025)
by: Choi, Sunmook, et al.
Published: (2025)
Learning Linear Dynamics from Bilinear Observations
by: Sattar, Yahya, et al.
Published: (2024)
by: Sattar, Yahya, et al.
Published: (2024)
Exploiting Observation Bias to Improve Matrix Completion
by: Jedra, Yassir, et al.
Published: (2023)
by: Jedra, Yassir, et al.
Published: (2023)
Finite Sample Identification of Partially Observed Bilinear Dynamical Systems
by: Sattar, Yahya, et al.
Published: (2025)
by: Sattar, Yahya, et al.
Published: (2025)
Revisiting Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model
by: Ariu, Kaito, et al.
Published: (2023)
by: Ariu, Kaito, et al.
Published: (2023)
Optimal Transfer Learning for Missing Not-at-Random Matrix Completion
by: Jalan, Akhil, et al.
Published: (2025)
by: Jalan, Akhil, et al.
Published: (2025)
Model-Free Active Exploration in Reinforcement Learning
by: Russo, Alessio, et al.
Published: (2024)
by: Russo, Alessio, et al.
Published: (2024)
Curvature-Guided LoRA: Steering in the pretrained NTK subspace
by: Zheng, Frédéric, et al.
Published: (2026)
by: Zheng, Frédéric, et al.
Published: (2026)
Sub-optimality of the Separation Principle for Quadratic Control from Bilinear Observations
by: Sattar, Yahya, et al.
Published: (2025)
by: Sattar, Yahya, et al.
Published: (2025)
Catching a Moving Subspace: Low-Rank Bandits Beyond Stationarity
by: Khosravi, Hamed, et al.
Published: (2026)
by: Khosravi, Hamed, et al.
Published: (2026)
Conformal Predictions under Markovian Data
by: Zheng, Frédéric, et al.
Published: (2024)
by: Zheng, Frédéric, et al.
Published: (2024)
Optimal Centered Active Excitation in Linear System Identification
by: Ito, Kaito, et al.
Published: (2026)
by: Ito, Kaito, et al.
Published: (2026)
A Tutorial on the Non-Asymptotic Theory of System Identification
by: Ziemann, Ingvar, et al.
Published: (2023)
by: Ziemann, Ingvar, et al.
Published: (2023)
On Universally Optimal Algorithms for A/B Testing
by: Wang, Po-An, et al.
Published: (2023)
by: Wang, Po-An, et al.
Published: (2023)
Adversarial Diffusion for Robust Reinforcement Learning
by: Foffano, Daniele, et al.
Published: (2025)
by: Foffano, Daniele, et al.
Published: (2025)
Best Arm Identification with Fixed Budget: A Large Deviation Perspective
by: Wang, Po-An, et al.
Published: (2023)
by: Wang, Po-An, et al.
Published: (2023)
Adaptive Reinforcement Learning for Unobservable Random Delays
by: Wikman, John, et al.
Published: (2025)
by: Wikman, John, et al.
Published: (2025)
Conformal Off-Policy Evaluation in Markov Decision Processes
by: Foffano, Daniele, et al.
Published: (2023)
by: Foffano, Daniele, et al.
Published: (2023)
Mixture-of-Subspaces in Low-Rank Adaptation
by: Wu, Taiqiang, et al.
Published: (2024)
by: Wu, Taiqiang, et al.
Published: (2024)
From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories, and Applications
by: Jaiswal, Ajay, et al.
Published: (2024)
by: Jaiswal, Ajay, et al.
Published: (2024)
Optimal Clustering from Noisy Binary Feedback
by: Ariu, Kaito, et al.
Published: (2019)
by: Ariu, Kaito, et al.
Published: (2019)
Policy Testing in Markov Decision Processes
by: Ariu, Kaito, et al.
Published: (2025)
by: Ariu, Kaito, et al.
Published: (2025)
Efficient Generalized Low-Rank Tensor Contextual Bandits
by: Yi, Qianxin, et al.
Published: (2023)
by: Yi, Qianxin, et al.
Published: (2023)
Low-Rank Compression of Pretrained Models via Randomized Subspace Iteration
by: Pourkamali-Anaraki, Farhad
Published: (2026)
by: Pourkamali-Anaraki, Farhad
Published: (2026)
Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank Bandits
by: Jang, Kyoungseok, et al.
Published: (2024)
by: Jang, Kyoungseok, et al.
Published: (2024)
Online Minimization of Polarization and Disagreement via Low-Rank Matrix Bandits
by: Cinus, Federico, et al.
Published: (2025)
by: Cinus, Federico, et al.
Published: (2025)
Efficient Frameworks for Generalized Low-Rank Matrix Bandit Problems
by: Kang, Yue, et al.
Published: (2024)
by: Kang, Yue, et al.
Published: (2024)
Generalized Low-Rank Matrix Contextual Bandits with Graph Information
by: Wang, Yao, et al.
Published: (2025)
by: Wang, Yao, et al.
Published: (2025)
Tight Rates for Bandit Control Beyond Quadratics
by: Sun, Y. Jennifer, et al.
Published: (2024)
by: Sun, Y. Jennifer, et al.
Published: (2024)
Interpretable Safety Alignment via SAE-Constructed Low-Rank Subspace Adaptation
by: Wang, Dianyun, et al.
Published: (2025)
by: Wang, Dianyun, et al.
Published: (2025)
Similar Items
-
Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise Matrix Estimation
by: Stojanovic, Stefan, et al.
Published: (2024) -
Near-optimal Rank Adaptive Inference of High Dimensional Matrices
by: Zheng, Frédéric, et al.
Published: (2025) -
Minimal Order Recovery through Rank-adaptive Identification
by: Zheng, Frédéric, et al.
Published: (2025) -
Shift Before You Learn: Enabling Low-Rank Representations in Reinforcement Learning
by: Dubail, Bastien, et al.
Published: (2025) -
Near-Optimal Clustering in Mixture of Markov Chains
by: Lee, Junghyun, et al.
Published: (2025)