Saved in:
| Main Authors: | Pula, Sai Gana Sandeep, Kumar, Sathish A. P., Jha, Sumit, Ramanathan, Arvind |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.03163 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MORAL: A Multimodal Reinforcement Learning Framework for Decision Making in Autonomous Laboratories
by: Tirabassi, Natalie, et al.
Published: (2025)
by: Tirabassi, Natalie, et al.
Published: (2025)
ACE-RLHF: Automated Code Evaluation and Socratic Feedback Generation Tool using Large Language Models and Reinforcement Learning with Human Feedback
by: Rahman, Tasnia, et al.
Published: (2025)
by: Rahman, Tasnia, et al.
Published: (2025)
DML-RAM: Deep Multimodal Learning Framework for Robotic Arm Manipulation using Pre-trained Models
by: Kumar, Sathish, et al.
Published: (2025)
by: Kumar, Sathish, et al.
Published: (2025)
Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF
by: Shen, Han, et al.
Published: (2024)
by: Shen, Han, et al.
Published: (2024)
Automaton Distillation: Neuro-Symbolic Transfer Learning for Deep Reinforcement Learning
by: Singireddy, Suraj, et al.
Published: (2023)
by: Singireddy, Suraj, et al.
Published: (2023)
Machine Learning Algorithms in Statistical Modelling Bridging Theory and Application
by: Rao, A. Ganapathi, et al.
Published: (2025)
by: Rao, A. Ganapathi, et al.
Published: (2025)
BIONIX: A Wireless, Low-Cost Prosthetic Arm with Dual-Signal EEG and EMG Control
by: Kumar, Pranesh Sathish
Published: (2025)
by: Kumar, Pranesh Sathish
Published: (2025)
Quantum-Enhanced Hybrid Reinforcement Learning Framework for Dynamic Path Planning in Autonomous Systems
by: Tomar, Sahil, et al.
Published: (2025)
by: Tomar, Sahil, et al.
Published: (2025)
Improving Reinforcement Learning Sample-Efficiency using Local Approximation
by: Prashant, Mohit, et al.
Published: (2025)
by: Prashant, Mohit, et al.
Published: (2025)
Solving Richly Constrained Reinforcement Learning through State Augmentation and Reward Penalties
by: Jiang, Hao, et al.
Published: (2023)
by: Jiang, Hao, et al.
Published: (2023)
A Lyapunov Drift-Plus-Penalty Method Tailored for Reinforcement Learning with Queue Stability
by: Xu, Wenhan, et al.
Published: (2025)
by: Xu, Wenhan, et al.
Published: (2025)
Restless Bandits with Individual Penalty Constraints: Near-Optimal Indices and Deep Reinforcement Learning
by: Zamir, Nida, et al.
Published: (2026)
by: Zamir, Nida, et al.
Published: (2026)
Just Enough Thinking: Efficient Reasoning with Adaptive Length Penalties Reinforcement Learning
by: Xiang, Violet, et al.
Published: (2025)
by: Xiang, Violet, et al.
Published: (2025)
Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning
by: Xu, Yuanda, et al.
Published: (2026)
by: Xu, Yuanda, et al.
Published: (2026)
PokeRL: Reinforcement Learning for Pokemon Red
by: Mudireddy, Dheeraj, et al.
Published: (2026)
by: Mudireddy, Dheeraj, et al.
Published: (2026)
Enhancing Robustness of Graph Neural Networks through p-Laplacian
by: Sirohi, Anuj Kumar, et al.
Published: (2025)
by: Sirohi, Anuj Kumar, et al.
Published: (2025)
Enhancing Robustness of Graph Neural Networks through p-Laplacian
by: Sirohi, Anuj Kumar, et al.
Published: (2024)
by: Sirohi, Anuj Kumar, et al.
Published: (2024)
A Random Matrix Theory Perspective on the Learning Dynamics of Multi-head Latent Attention
by: Jha, Nandan Kumar, et al.
Published: (2025)
by: Jha, Nandan Kumar, et al.
Published: (2025)
Bidirectional-Reachable Hierarchical Reinforcement Learning with Mutually Responsive Policies
by: Luo, Yu, et al.
Published: (2024)
by: Luo, Yu, et al.
Published: (2024)
On Penalty-based Bilevel Gradient Descent Method
by: Shen, Han, et al.
Published: (2023)
by: Shen, Han, et al.
Published: (2023)
Primal-Only Actor Critic Algorithm for Robust Constrained Average Cost MDPs
by: Satheesh, Anirudh, et al.
Published: (2025)
by: Satheesh, Anirudh, et al.
Published: (2025)
Penalty Learning for Optimal Partitioning using Multilayer Perceptron
by: Nguyen, Tung L, et al.
Published: (2024)
by: Nguyen, Tung L, et al.
Published: (2024)
CAPSULE: Control-Theoretic Action Perturbations for Safe Uncertainty-Aware Reinforcement Learning
by: Narava, Rahul, et al.
Published: (2026)
by: Narava, Rahul, et al.
Published: (2026)
Exterior Penalty Policy Optimization with Penalty Metric Network under Constraints
by: Gao, Shiqing, et al.
Published: (2024)
by: Gao, Shiqing, et al.
Published: (2024)
DQ4FairIM: Fairness-aware Influence Maximization using Deep Reinforcement Learning
by: Saxena, Akrati, et al.
Published: (2025)
by: Saxena, Akrati, et al.
Published: (2025)
From Sequential to Recursive: Enhancing Decision-Focused Learning with Bidirectional Feedback
by: Wang, Xinyu, et al.
Published: (2025)
by: Wang, Xinyu, et al.
Published: (2025)
BiTrajDiff: Bidirectional Trajectory Generation with Diffusion Models for Offline Reinforcement Learning
by: Qing, Yunpeng, et al.
Published: (2025)
by: Qing, Yunpeng, et al.
Published: (2025)
Flight Delay Prediction using Hybrid Machine Learning Approach: A Case Study of Major Airlines in the United States
by: Jha, Rajesh Kumar, et al.
Published: (2024)
by: Jha, Rajesh Kumar, et al.
Published: (2024)
SHARP-QoS: Sparsely-gated Hierarchical Adaptive Routing for joint Prediction of QoS
by: Kumar, Suraj, et al.
Published: (2025)
by: Kumar, Suraj, et al.
Published: (2025)
Convex Regression with a Penalty
by: Lim, Eunji
Published: (2025)
by: Lim, Eunji
Published: (2025)
Quantum-Enhanced Forecasting for Deep Reinforcement Learning in Algorithmic Trading
by: Chen, Jun-Hao, et al.
Published: (2025)
by: Chen, Jun-Hao, et al.
Published: (2025)
SafeOR-Gym: A Benchmark Suite for Safe Reinforcement Learning Algorithms on Practical Operations Research Problems
by: Ramanujam, Asha, et al.
Published: (2025)
by: Ramanujam, Asha, et al.
Published: (2025)
Curriculum Learning for Safety Alignment
by: Kumar, Sandeep, et al.
Published: (2026)
by: Kumar, Sandeep, et al.
Published: (2026)
Enhancing Reinforcement Learning for Radiology Report Generation with Evidence-aware Rewards and Self-correcting Preference Learning
by: Zhou, Qin, et al.
Published: (2026)
by: Zhou, Qin, et al.
Published: (2026)
CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning
by: Liu, Zeyuan, et al.
Published: (2024)
by: Liu, Zeyuan, et al.
Published: (2024)
GraphFLEx: Structure Learning Framework for Large Expanding Graphs
by: Kataria, Mohit, et al.
Published: (2025)
by: Kataria, Mohit, et al.
Published: (2025)
SLVR: Securely Leveraging Client Validation for Robust Federated Learning
by: Choi, Jihye, et al.
Published: (2025)
by: Choi, Jihye, et al.
Published: (2025)
Stochastic Penalty-Barrier Methods for Constrained Machine Learning
by: Bosák, Adam, et al.
Published: (2026)
by: Bosák, Adam, et al.
Published: (2026)
Task Aware Modulation Using Representation Learning for Upsaling of Terrestrial Carbon Fluxes
by: Rozanov, Aleksei, et al.
Published: (2026)
by: Rozanov, Aleksei, et al.
Published: (2026)
Learning Penalty for Optimal Partitioning via Automatic Feature Extraction
by: Nguyen, Tung L, et al.
Published: (2025)
by: Nguyen, Tung L, et al.
Published: (2025)
Similar Items
-
MORAL: A Multimodal Reinforcement Learning Framework for Decision Making in Autonomous Laboratories
by: Tirabassi, Natalie, et al.
Published: (2025) -
ACE-RLHF: Automated Code Evaluation and Socratic Feedback Generation Tool using Large Language Models and Reinforcement Learning with Human Feedback
by: Rahman, Tasnia, et al.
Published: (2025) -
DML-RAM: Deep Multimodal Learning Framework for Robotic Arm Manipulation using Pre-trained Models
by: Kumar, Sathish, et al.
Published: (2025) -
Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF
by: Shen, Han, et al.
Published: (2024) -
Automaton Distillation: Neuro-Symbolic Transfer Learning for Deep Reinforcement Learning
by: Singireddy, Suraj, et al.
Published: (2023)