:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kapoor, Vansh, Nair, Jayakrishnan
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2505.03280
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Capacity Provisioning Motivated Online Non-Convex Optimization Problem with Memory and Switching Cost
by: Vaze, Rahul, et al.
Published: (2024)

Influence of Recommender Systems on Users: A Dynamical Systems Analysis
by: Lankireddy, Prabhat, et al.
Published: (2024)

Representative Arm Identification: A fixed confidence approach to identify cluster representatives
by: Gharat, Sarvesh, et al.
Published: (2024)

TRIM: Hybrid Inference via Targeted Stepwise Routing in Multi-Step Reasoning Tasks
by: Kapoor, Vansh, et al.
Published: (2026)

Primal-Only Actor Critic Algorithm for Robust Constrained Average Cost MDPs
by: Satheesh, Anirudh, et al.
Published: (2025)

A Reliable Knowledge Processing Framework for Combustion Science using Foundation Models
by: Sharma, Vansh, et al.
Published: (2023)

Adaptive Multi-Scale Goodness Aggregation for Forward-Forward Learning
by: Beigzad, Salar, et al.
Published: (2026)

Score-Guided Proximal Projection: A Unified Geometric Framework for Rectified Flow Editing
by: Bansal, Vansh, et al.
Published: (2026)

Convergence of Natural Policy Gradient for a Family of Infinite-State Queueing MDPs
by: Grosof, Isaac, et al.
Published: (2024)

A Machine Learning Based Approach for Statistical Analysis of Detonation Cells from Soot Foils
by: Sharma, Vansh, et al.
Published: (2024)

On the Optimizer Dependence of Neural Scaling Laws
by: Ramani, Vansh, et al.
Published: (2026)

Conformal C2ST: Turning weak classifiers into strong two-sample tests
by: Bansal, Vansh, et al.
Published: (2025)

CoLT: The conditional localization test for assessing the accuracy of neural posterior estimates
by: Chen, Tianyu, et al.
Published: (2025)

Time-Constrained Robust MDPs
by: Zouitine, Adil, et al.
Published: (2024)

Landscape of Policy Optimization for Finite Horizon MDPs with General State and Action
by: Chen, Xin, et al.
Published: (2024)

Reinforcement Learning for Infinite-Horizon Average-Reward Linear MDPs via Approximation by Discounted-Reward MDPs
by: Hong, Kihyuk, et al.
Published: (2024)

On the Convergence and Straightness of Rectified Flow
by: Bansal, Vansh, et al.
Published: (2024)

On Value Iteration Convergence in Connected MDPs
by: Mustafin, Arsenii, et al.
Published: (2024)

Robust Parameter Learning for Uncertain MDPs
by: Schnitzer, Yannik, et al.
Published: (2026)

Truly No-Regret Learning in Constrained MDPs
by: Müller, Adrian, et al.
Published: (2024)

Soft Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient, and Sample Complexity
by: Zhang, Runyu, et al.
Published: (2023)

Offline Bayesian Aleatoric and Epistemic Uncertainty Quantification and Posterior Value Optimisation in Finite-State MDPs
by: Valdettaro, Filippo, et al.
Published: (2024)

Position: Graph Condensation Needs a Reset -- Move Beyond Full-dataset Training and Model-Dependence
by: Gupta, Mridul, et al.
Published: (2026)

Prospect-Theory Behavior from Bellman Optimality in MDPs with Catastrophic States
by: Chen, Yujiao
Published: (2026)

An approximate graph elicits detonation lattice
by: Sharma, Vansh, et al.
Published: (2026)

Learning Adversarial MDPs with Stochastic Hard Constraints
by: Stradi, Francesco Emanuele, et al.
Published: (2024)

Solving Robust MDPs through No-Regret Dynamics
by: Guha, Etash Kumar
Published: (2023)

Eluder-based Regret for Stochastic Contextual MDPs
by: Levy, Orin, et al.
Published: (2022)

Sample Complexity Characterization for Linear Contextual MDPs
by: Deng, Junze, et al.
Published: (2024)

Conditional diffusions for amortized neural posterior estimation
by: Chen, Tianyu, et al.
Published: (2024)

Solving robust MDPs as a sequence of static RL problems
by: Zouitine, Adil, et al.
Published: (2024)

No-Regret Reinforcement Learning in Smooth MDPs
by: Maran, Davide, et al.
Published: (2024)

Runtime Monitoring of Perception-Based Autonomous Systems via Embedding Temporal Logic
by: Kapoor, Parv, et al.
Published: (2026)

Efficiently Solving Discounted MDPs with Predictions on Transition Matrices
by: Lyu, Lixing, et al.
Published: (2025)

Reinforcement Learning from Adversarial Preferences in Tabular MDPs
by: Tsuchiya, Taira, et al.
Published: (2025)

Demystifying Linear MDPs and Novel Dynamics Aggregation Framework
by: Lee, Joongkyu, et al.
Published: (2024)

Policy Gradient in Robust MDPs with Global Convergence Guarantee
by: Wang, Qiuhao, et al.
Published: (2022)

Near-Optimal Sample Complexity for Online Constrained MDPs
by: Liu, Chang, et al.
Published: (2026)

Online Learning in MDPs with Partially Adversarial Transitions and Losses
by: Schlisselberg, Ofir, et al.
Published: (2026)

Sample Complexity Bounds for Linear Constrained MDPs with a Generative Model
by: Liu, Xingtu, et al.
Published: (2025)