Saved in:
| Main Authors: | Kapoor, Vansh, Nair, Jayakrishnan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.03280 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Capacity Provisioning Motivated Online Non-Convex Optimization Problem with Memory and Switching Cost
by: Vaze, Rahul, et al.
Published: (2024)
by: Vaze, Rahul, et al.
Published: (2024)
Influence of Recommender Systems on Users: A Dynamical Systems Analysis
by: Lankireddy, Prabhat, et al.
Published: (2024)
by: Lankireddy, Prabhat, et al.
Published: (2024)
Representative Arm Identification: A fixed confidence approach to identify cluster representatives
by: Gharat, Sarvesh, et al.
Published: (2024)
by: Gharat, Sarvesh, et al.
Published: (2024)
TRIM: Hybrid Inference via Targeted Stepwise Routing in Multi-Step Reasoning Tasks
by: Kapoor, Vansh, et al.
Published: (2026)
by: Kapoor, Vansh, et al.
Published: (2026)
Primal-Only Actor Critic Algorithm for Robust Constrained Average Cost MDPs
by: Satheesh, Anirudh, et al.
Published: (2025)
by: Satheesh, Anirudh, et al.
Published: (2025)
A Reliable Knowledge Processing Framework for Combustion Science using Foundation Models
by: Sharma, Vansh, et al.
Published: (2023)
by: Sharma, Vansh, et al.
Published: (2023)
Adaptive Multi-Scale Goodness Aggregation for Forward-Forward Learning
by: Beigzad, Salar, et al.
Published: (2026)
by: Beigzad, Salar, et al.
Published: (2026)
Score-Guided Proximal Projection: A Unified Geometric Framework for Rectified Flow Editing
by: Bansal, Vansh, et al.
Published: (2026)
by: Bansal, Vansh, et al.
Published: (2026)
Convergence of Natural Policy Gradient for a Family of Infinite-State Queueing MDPs
by: Grosof, Isaac, et al.
Published: (2024)
by: Grosof, Isaac, et al.
Published: (2024)
A Machine Learning Based Approach for Statistical Analysis of Detonation Cells from Soot Foils
by: Sharma, Vansh, et al.
Published: (2024)
by: Sharma, Vansh, et al.
Published: (2024)
On the Optimizer Dependence of Neural Scaling Laws
by: Ramani, Vansh, et al.
Published: (2026)
by: Ramani, Vansh, et al.
Published: (2026)
Conformal C2ST: Turning weak classifiers into strong two-sample tests
by: Bansal, Vansh, et al.
Published: (2025)
by: Bansal, Vansh, et al.
Published: (2025)
CoLT: The conditional localization test for assessing the accuracy of neural posterior estimates
by: Chen, Tianyu, et al.
Published: (2025)
by: Chen, Tianyu, et al.
Published: (2025)
Time-Constrained Robust MDPs
by: Zouitine, Adil, et al.
Published: (2024)
by: Zouitine, Adil, et al.
Published: (2024)
Landscape of Policy Optimization for Finite Horizon MDPs with General State and Action
by: Chen, Xin, et al.
Published: (2024)
by: Chen, Xin, et al.
Published: (2024)
Reinforcement Learning for Infinite-Horizon Average-Reward Linear MDPs via Approximation by Discounted-Reward MDPs
by: Hong, Kihyuk, et al.
Published: (2024)
by: Hong, Kihyuk, et al.
Published: (2024)
On the Convergence and Straightness of Rectified Flow
by: Bansal, Vansh, et al.
Published: (2024)
by: Bansal, Vansh, et al.
Published: (2024)
On Value Iteration Convergence in Connected MDPs
by: Mustafin, Arsenii, et al.
Published: (2024)
by: Mustafin, Arsenii, et al.
Published: (2024)
Robust Parameter Learning for Uncertain MDPs
by: Schnitzer, Yannik, et al.
Published: (2026)
by: Schnitzer, Yannik, et al.
Published: (2026)
Truly No-Regret Learning in Constrained MDPs
by: Müller, Adrian, et al.
Published: (2024)
by: Müller, Adrian, et al.
Published: (2024)
Soft Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient, and Sample Complexity
by: Zhang, Runyu, et al.
Published: (2023)
by: Zhang, Runyu, et al.
Published: (2023)
Offline Bayesian Aleatoric and Epistemic Uncertainty Quantification and Posterior Value Optimisation in Finite-State MDPs
by: Valdettaro, Filippo, et al.
Published: (2024)
by: Valdettaro, Filippo, et al.
Published: (2024)
Position: Graph Condensation Needs a Reset -- Move Beyond Full-dataset Training and Model-Dependence
by: Gupta, Mridul, et al.
Published: (2026)
by: Gupta, Mridul, et al.
Published: (2026)
Prospect-Theory Behavior from Bellman Optimality in MDPs with Catastrophic States
by: Chen, Yujiao
Published: (2026)
by: Chen, Yujiao
Published: (2026)
An approximate graph elicits detonation lattice
by: Sharma, Vansh, et al.
Published: (2026)
by: Sharma, Vansh, et al.
Published: (2026)
Learning Adversarial MDPs with Stochastic Hard Constraints
by: Stradi, Francesco Emanuele, et al.
Published: (2024)
by: Stradi, Francesco Emanuele, et al.
Published: (2024)
Solving Robust MDPs through No-Regret Dynamics
by: Guha, Etash Kumar
Published: (2023)
by: Guha, Etash Kumar
Published: (2023)
Eluder-based Regret for Stochastic Contextual MDPs
by: Levy, Orin, et al.
Published: (2022)
by: Levy, Orin, et al.
Published: (2022)
Sample Complexity Characterization for Linear Contextual MDPs
by: Deng, Junze, et al.
Published: (2024)
by: Deng, Junze, et al.
Published: (2024)
Conditional diffusions for amortized neural posterior estimation
by: Chen, Tianyu, et al.
Published: (2024)
by: Chen, Tianyu, et al.
Published: (2024)
Solving robust MDPs as a sequence of static RL problems
by: Zouitine, Adil, et al.
Published: (2024)
by: Zouitine, Adil, et al.
Published: (2024)
No-Regret Reinforcement Learning in Smooth MDPs
by: Maran, Davide, et al.
Published: (2024)
by: Maran, Davide, et al.
Published: (2024)
Runtime Monitoring of Perception-Based Autonomous Systems via Embedding Temporal Logic
by: Kapoor, Parv, et al.
Published: (2026)
by: Kapoor, Parv, et al.
Published: (2026)
Efficiently Solving Discounted MDPs with Predictions on Transition Matrices
by: Lyu, Lixing, et al.
Published: (2025)
by: Lyu, Lixing, et al.
Published: (2025)
Reinforcement Learning from Adversarial Preferences in Tabular MDPs
by: Tsuchiya, Taira, et al.
Published: (2025)
by: Tsuchiya, Taira, et al.
Published: (2025)
Demystifying Linear MDPs and Novel Dynamics Aggregation Framework
by: Lee, Joongkyu, et al.
Published: (2024)
by: Lee, Joongkyu, et al.
Published: (2024)
Policy Gradient in Robust MDPs with Global Convergence Guarantee
by: Wang, Qiuhao, et al.
Published: (2022)
by: Wang, Qiuhao, et al.
Published: (2022)
Near-Optimal Sample Complexity for Online Constrained MDPs
by: Liu, Chang, et al.
Published: (2026)
by: Liu, Chang, et al.
Published: (2026)
Online Learning in MDPs with Partially Adversarial Transitions and Losses
by: Schlisselberg, Ofir, et al.
Published: (2026)
by: Schlisselberg, Ofir, et al.
Published: (2026)
Sample Complexity Bounds for Linear Constrained MDPs with a Generative Model
by: Liu, Xingtu, et al.
Published: (2025)
by: Liu, Xingtu, et al.
Published: (2025)
Similar Items
-
Capacity Provisioning Motivated Online Non-Convex Optimization Problem with Memory and Switching Cost
by: Vaze, Rahul, et al.
Published: (2024) -
Influence of Recommender Systems on Users: A Dynamical Systems Analysis
by: Lankireddy, Prabhat, et al.
Published: (2024) -
Representative Arm Identification: A fixed confidence approach to identify cluster representatives
by: Gharat, Sarvesh, et al.
Published: (2024) -
TRIM: Hybrid Inference via Targeted Stepwise Routing in Multi-Step Reasoning Tasks
by: Kapoor, Vansh, et al.
Published: (2026) -
Primal-Only Actor Critic Algorithm for Robust Constrained Average Cost MDPs
by: Satheesh, Anirudh, et al.
Published: (2025)