:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Jedra, Yassir, Réveillard, William, Stojanovic, Stefan, Proutiere, Alexandre
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2402.15739
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise Matrix Estimation
by: Stojanovic, Stefan, et al.
Published: (2024)

Near-optimal Rank Adaptive Inference of High Dimensional Matrices
by: Zheng, Frédéric, et al.
Published: (2025)

Minimal Order Recovery through Rank-adaptive Identification
by: Zheng, Frédéric, et al.
Published: (2025)

Shift Before You Learn: Enabling Low-Rank Representations in Reinforcement Learning
by: Dubail, Bastien, et al.
Published: (2025)

Near-Optimal Clustering in Mixture of Markov Chains
by: Lee, Junghyun, et al.
Published: (2025)

Switching Successor Measures for Hierarchical Zero-shot Reinforcement Learning
by: Stojanovic, Stefan, et al.
Published: (2026)

Minimizing Human Intervention in Online Classification
by: Réveillard, William, et al.
Published: (2025)

Instance-Optimal Estimation with Multiple LLM Judges on a Budget
by: Lee, Junghyun, et al.
Published: (2026)

Multimodal Bandits: Regret Lower Bounds and Optimal Algorithms
by: Réveillard, William, et al.
Published: (2025)

$k$-SVD with Gradient Descent
by: Jedra, Yassir, et al.
Published: (2025)

Explore-then-Commit for Nonstationary Linear Bandits with Latent Dynamics
by: Choi, Sunmook, et al.
Published: (2025)

Learning Linear Dynamics from Bilinear Observations
by: Sattar, Yahya, et al.
Published: (2024)

Exploiting Observation Bias to Improve Matrix Completion
by: Jedra, Yassir, et al.
Published: (2023)

Finite Sample Identification of Partially Observed Bilinear Dynamical Systems
by: Sattar, Yahya, et al.
Published: (2025)

Revisiting Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model
by: Ariu, Kaito, et al.
Published: (2023)

Optimal Transfer Learning for Missing Not-at-Random Matrix Completion
by: Jalan, Akhil, et al.
Published: (2025)

Model-Free Active Exploration in Reinforcement Learning
by: Russo, Alessio, et al.
Published: (2024)

Curvature-Guided LoRA: Steering in the pretrained NTK subspace
by: Zheng, Frédéric, et al.
Published: (2026)

Sub-optimality of the Separation Principle for Quadratic Control from Bilinear Observations
by: Sattar, Yahya, et al.
Published: (2025)

Catching a Moving Subspace: Low-Rank Bandits Beyond Stationarity
by: Khosravi, Hamed, et al.
Published: (2026)

Conformal Predictions under Markovian Data
by: Zheng, Frédéric, et al.
Published: (2024)

Optimal Centered Active Excitation in Linear System Identification
by: Ito, Kaito, et al.
Published: (2026)

A Tutorial on the Non-Asymptotic Theory of System Identification
by: Ziemann, Ingvar, et al.
Published: (2023)

On Universally Optimal Algorithms for A/B Testing
by: Wang, Po-An, et al.
Published: (2023)

Adversarial Diffusion for Robust Reinforcement Learning
by: Foffano, Daniele, et al.
Published: (2025)

Best Arm Identification with Fixed Budget: A Large Deviation Perspective
by: Wang, Po-An, et al.
Published: (2023)

Adaptive Reinforcement Learning for Unobservable Random Delays
by: Wikman, John, et al.
Published: (2025)

Conformal Off-Policy Evaluation in Markov Decision Processes
by: Foffano, Daniele, et al.
Published: (2023)

Mixture-of-Subspaces in Low-Rank Adaptation
by: Wu, Taiqiang, et al.
Published: (2024)

From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories, and Applications
by: Jaiswal, Ajay, et al.
Published: (2024)

Optimal Clustering from Noisy Binary Feedback
by: Ariu, Kaito, et al.
Published: (2019)

Policy Testing in Markov Decision Processes
by: Ariu, Kaito, et al.
Published: (2025)

Efficient Generalized Low-Rank Tensor Contextual Bandits
by: Yi, Qianxin, et al.
Published: (2023)

Low-Rank Compression of Pretrained Models via Randomized Subspace Iteration
by: Pourkamali-Anaraki, Farhad
Published: (2026)

Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank Bandits
by: Jang, Kyoungseok, et al.
Published: (2024)

Online Minimization of Polarization and Disagreement via Low-Rank Matrix Bandits
by: Cinus, Federico, et al.
Published: (2025)

Efficient Frameworks for Generalized Low-Rank Matrix Bandit Problems
by: Kang, Yue, et al.
Published: (2024)

Generalized Low-Rank Matrix Contextual Bandits with Graph Information
by: Wang, Yao, et al.
Published: (2025)

Tight Rates for Bandit Control Beyond Quadratics
by: Sun, Y. Jennifer, et al.
Published: (2024)

Interpretable Safety Alignment via SAE-Constructed Low-Rank Subspace Adaptation
by: Wang, Dianyun, et al.
Published: (2025)