:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Pula, Sai Gana Sandeep, Kumar, Sathish A. P., Jha, Sumit, Ramanathan, Arvind
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2504.03163
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MORAL: A Multimodal Reinforcement Learning Framework for Decision Making in Autonomous Laboratories
by: Tirabassi, Natalie, et al.
Published: (2025)

ACE-RLHF: Automated Code Evaluation and Socratic Feedback Generation Tool using Large Language Models and Reinforcement Learning with Human Feedback
by: Rahman, Tasnia, et al.
Published: (2025)

DML-RAM: Deep Multimodal Learning Framework for Robotic Arm Manipulation using Pre-trained Models
by: Kumar, Sathish, et al.
Published: (2025)

Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF
by: Shen, Han, et al.
Published: (2024)

Automaton Distillation: Neuro-Symbolic Transfer Learning for Deep Reinforcement Learning
by: Singireddy, Suraj, et al.
Published: (2023)

Machine Learning Algorithms in Statistical Modelling Bridging Theory and Application
by: Rao, A. Ganapathi, et al.
Published: (2025)

BIONIX: A Wireless, Low-Cost Prosthetic Arm with Dual-Signal EEG and EMG Control
by: Kumar, Pranesh Sathish
Published: (2025)

Quantum-Enhanced Hybrid Reinforcement Learning Framework for Dynamic Path Planning in Autonomous Systems
by: Tomar, Sahil, et al.
Published: (2025)

Improving Reinforcement Learning Sample-Efficiency using Local Approximation
by: Prashant, Mohit, et al.
Published: (2025)

Solving Richly Constrained Reinforcement Learning through State Augmentation and Reward Penalties
by: Jiang, Hao, et al.
Published: (2023)

A Lyapunov Drift-Plus-Penalty Method Tailored for Reinforcement Learning with Queue Stability
by: Xu, Wenhan, et al.
Published: (2025)

Restless Bandits with Individual Penalty Constraints: Near-Optimal Indices and Deep Reinforcement Learning
by: Zamir, Nida, et al.
Published: (2026)

Just Enough Thinking: Efficient Reasoning with Adaptive Length Penalties Reinforcement Learning
by: Xiang, Violet, et al.
Published: (2025)

Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning
by: Xu, Yuanda, et al.
Published: (2026)

PokeRL: Reinforcement Learning for Pokemon Red
by: Mudireddy, Dheeraj, et al.
Published: (2026)

Enhancing Robustness of Graph Neural Networks through p-Laplacian
by: Sirohi, Anuj Kumar, et al.
Published: (2025)

Enhancing Robustness of Graph Neural Networks through p-Laplacian
by: Sirohi, Anuj Kumar, et al.
Published: (2024)

A Random Matrix Theory Perspective on the Learning Dynamics of Multi-head Latent Attention
by: Jha, Nandan Kumar, et al.
Published: (2025)

Bidirectional-Reachable Hierarchical Reinforcement Learning with Mutually Responsive Policies
by: Luo, Yu, et al.
Published: (2024)

On Penalty-based Bilevel Gradient Descent Method
by: Shen, Han, et al.
Published: (2023)

Primal-Only Actor Critic Algorithm for Robust Constrained Average Cost MDPs
by: Satheesh, Anirudh, et al.
Published: (2025)

Penalty Learning for Optimal Partitioning using Multilayer Perceptron
by: Nguyen, Tung L, et al.
Published: (2024)

CAPSULE: Control-Theoretic Action Perturbations for Safe Uncertainty-Aware Reinforcement Learning
by: Narava, Rahul, et al.
Published: (2026)

Exterior Penalty Policy Optimization with Penalty Metric Network under Constraints
by: Gao, Shiqing, et al.
Published: (2024)

DQ4FairIM: Fairness-aware Influence Maximization using Deep Reinforcement Learning
by: Saxena, Akrati, et al.
Published: (2025)

From Sequential to Recursive: Enhancing Decision-Focused Learning with Bidirectional Feedback
by: Wang, Xinyu, et al.
Published: (2025)

BiTrajDiff: Bidirectional Trajectory Generation with Diffusion Models for Offline Reinforcement Learning
by: Qing, Yunpeng, et al.
Published: (2025)

Flight Delay Prediction using Hybrid Machine Learning Approach: A Case Study of Major Airlines in the United States
by: Jha, Rajesh Kumar, et al.
Published: (2024)

SHARP-QoS: Sparsely-gated Hierarchical Adaptive Routing for joint Prediction of QoS
by: Kumar, Suraj, et al.
Published: (2025)

Convex Regression with a Penalty
by: Lim, Eunji
Published: (2025)

Quantum-Enhanced Forecasting for Deep Reinforcement Learning in Algorithmic Trading
by: Chen, Jun-Hao, et al.
Published: (2025)

SafeOR-Gym: A Benchmark Suite for Safe Reinforcement Learning Algorithms on Practical Operations Research Problems
by: Ramanujam, Asha, et al.
Published: (2025)

Curriculum Learning for Safety Alignment
by: Kumar, Sandeep, et al.
Published: (2026)

Enhancing Reinforcement Learning for Radiology Report Generation with Evidence-aware Rewards and Self-correcting Preference Learning
by: Zhou, Qin, et al.
Published: (2026)

CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning
by: Liu, Zeyuan, et al.
Published: (2024)

GraphFLEx: Structure Learning Framework for Large Expanding Graphs
by: Kataria, Mohit, et al.
Published: (2025)

SLVR: Securely Leveraging Client Validation for Robust Federated Learning
by: Choi, Jihye, et al.
Published: (2025)

Stochastic Penalty-Barrier Methods for Constrained Machine Learning
by: Bosák, Adam, et al.
Published: (2026)

Task Aware Modulation Using Representation Learning for Upsaling of Terrestrial Carbon Fluxes
by: Rozanov, Aleksei, et al.
Published: (2026)

Learning Penalty for Optimal Partitioning via Automatic Feature Extraction
by: Nguyen, Tung L, et al.
Published: (2025)