Saved in:
| Main Authors: | Margapuri, Venkat, Kazanjian, Garik, Kosaraju, Naren |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.20105 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Hybrid Safety Verification of Multi-Agent Systems using $ψ$-Weighted CBFs and PAC Guarantees
by: Margapuri, Venkat, et al.
Published: (2025)
by: Margapuri, Venkat, et al.
Published: (2025)
Diagnosis and Severity Assessment of Ulcerative Colitis using Self Supervised Learning
by: Margapuri, Venkat
Published: (2024)
by: Margapuri, Venkat
Published: (2024)
Predicting Mortality and Functional Status Scores of Traumatic Brain Injury Patients using Supervised Machine Learning
by: Steinmetz, Lucas, et al.
Published: (2024)
by: Steinmetz, Lucas, et al.
Published: (2024)
Seed Kernel Counting using Domain Randomization and Object Tracking Neural Networks
by: Margapuri, Venkat, et al.
Published: (2023)
by: Margapuri, Venkat, et al.
Published: (2023)
Leaf Angle Estimation using Mask R-CNN and LETR Vision Transformer
by: Margapuri, Venkat, et al.
Published: (2024)
by: Margapuri, Venkat, et al.
Published: (2024)
Prompt Informed Reinforcement Learning for Visual Coverage Path Planning
by: Margapuri, Venkat
Published: (2025)
by: Margapuri, Venkat
Published: (2025)
TSSR: Two-Stage Swap-Reward-Driven Reinforcement Learning for Character-Level SMILES Generation
by: Levine, Jacob Ede, et al.
Published: (2026)
by: Levine, Jacob Ede, et al.
Published: (2026)
LLMs as Layout Designers: Enhanced Spatial Reasoning for Content-Aware Layout Generation
by: Li, Sha, et al.
Published: (2025)
by: Li, Sha, et al.
Published: (2025)
BERT Learns (and Teaches) Chemistry
by: Payne, Josh, et al.
Published: (2020)
by: Payne, Josh, et al.
Published: (2020)
Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning
by: Singh, Joykirat, et al.
Published: (2025)
by: Singh, Joykirat, et al.
Published: (2025)
Answering the Wrong Question: Reasoning Trace Inversion for Abstention in LLMs
by: Gourabathina, Abinitha, et al.
Published: (2026)
by: Gourabathina, Abinitha, et al.
Published: (2026)
Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
by: Wang, Haozhe, et al.
Published: (2025)
by: Wang, Haozhe, et al.
Published: (2025)
Experience as a Compass: Multi-agent RAG with Evolving Orchestration and Agent Prompts
by: Li, Sha, et al.
Published: (2026)
by: Li, Sha, et al.
Published: (2026)
Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning
by: Wang, Jiayu, et al.
Published: (2025)
by: Wang, Jiayu, et al.
Published: (2025)
CosmicFish-HRM: Adaptive Reasoning via Hierarchical Recurrent Mechanisms in Compact Language Models
by: Lakkapragada, Venkat Akhil
Published: (2026)
by: Lakkapragada, Venkat Akhil
Published: (2026)
Hidden Thoughts Are Not Secret: Reasoning Trace Exposure in LLMs
by: Lu, Yu-An, et al.
Published: (2026)
by: Lu, Yu-An, et al.
Published: (2026)
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
by: Chen, Mingyang, et al.
Published: (2025)
by: Chen, Mingyang, et al.
Published: (2025)
G1: Teaching LLMs to Reason on Graphs with Reinforcement Learning
by: Guo, Xiaojun, et al.
Published: (2025)
by: Guo, Xiaojun, et al.
Published: (2025)
SATURN: SAT-based Reinforcement Learning to Unleash LLMs Reasoning
by: Liu, Huanyu, et al.
Published: (2025)
by: Liu, Huanyu, et al.
Published: (2025)
Toward Better EHR Reasoning in LLMs: Reinforcement Learning with Expert Attention Guidance
by: Fang, Yue, et al.
Published: (2025)
by: Fang, Yue, et al.
Published: (2025)
The Loupe: A Plug-and-Play Attention Module for Amplifying Discriminative Features in Vision Transformers
by: Sengodan, Naren
Published: (2025)
by: Sengodan, Naren
Published: (2025)
Strat-Reasoner: Reinforcing Strategic Reasoning of LLMs in Multi-Agent Games
by: He, Yidong, et al.
Published: (2026)
by: He, Yidong, et al.
Published: (2026)
LLMSense: Harnessing LLMs for High-level Reasoning Over Spatiotemporal Sensor Traces
by: Ouyang, Xiaomin, et al.
Published: (2024)
by: Ouyang, Xiaomin, et al.
Published: (2024)
Toward Scientific Reasoning in LLMs: Training from Expert Discussions via Reinforcement Learning
by: Yin, Ming, et al.
Published: (2025)
by: Yin, Ming, et al.
Published: (2025)
DRAFT-RL: Multi-Agent Chain-of-Draft Reasoning for Reinforcement Learning-Enhanced LLMs
by: Li, Yuanhao, et al.
Published: (2025)
by: Li, Yuanhao, et al.
Published: (2025)
Boosting Accuracy and Efficiency of Budget Forcing in LLMs via Reinforcement Learning for Mathematical Reasoning
by: Tarunokusumo, Ravindra Aribowo, et al.
Published: (2025)
by: Tarunokusumo, Ravindra Aribowo, et al.
Published: (2025)
Human-Inspired Multi-Level Reinforcement Learning
by: Wu, Mingkang, et al.
Published: (2025)
by: Wu, Mingkang, et al.
Published: (2025)
LLM Augmentations to support Analytical Reasoning over Multiple Documents
by: Yousuf, Raquib Bin, et al.
Published: (2024)
by: Yousuf, Raquib Bin, et al.
Published: (2024)
Human-Inspired Framework to Accelerate Reinforcement Learning
by: Beikmohammadi, Ali, et al.
Published: (2023)
by: Beikmohammadi, Ali, et al.
Published: (2023)
Emotion-Coherent Reasoning for Multimodal LLMs via Emotional Rationale Verifier
by: Rha, Hyeongseop, et al.
Published: (2025)
by: Rha, Hyeongseop, et al.
Published: (2025)
Unlocking Reasoning Capabilities in LLMs via Reinforcement Learning Exploration
by: Deng, Wenhao, et al.
Published: (2025)
by: Deng, Wenhao, et al.
Published: (2025)
Prompted Policy Search: Reinforcement Learning through Linguistic and Numerical Reasoning in LLMs
by: Zhou, Yifan, et al.
Published: (2025)
by: Zhou, Yifan, et al.
Published: (2025)
Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs
by: Wen, Xumeng, et al.
Published: (2025)
by: Wen, Xumeng, et al.
Published: (2025)
ARDNS-FN-Quantum: A Quantum-Enhanced Reinforcement Learning Framework with Cognitive-Inspired Adaptive Exploration for Dynamic Environments
by: de Sousa, Umberto Gonçalves
Published: (2025)
by: de Sousa, Umberto Gonçalves
Published: (2025)
Mathematical Framework for Custom Reward Functions in Job Application Evaluation using Reinforcement Learning
by: Jain, Shreyansh, et al.
Published: (2025)
by: Jain, Shreyansh, et al.
Published: (2025)
Brain-Inspired Planning for Better Generalization in Reinforcement Learning
by: Zhao, Mingde "Harry"
Published: (2025)
by: Zhao, Mingde "Harry"
Published: (2025)
R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning
by: Li, Yuan, et al.
Published: (2025)
by: Li, Yuan, et al.
Published: (2025)
What's in an embedding? Would a rose by any embedding smell as sweet?
by: Venkatasubramanian, Venkat
Published: (2024)
by: Venkatasubramanian, Venkat
Published: (2024)
Deep Reinforcement Learning with Gradient Eligibility Traces
by: Elelimy, Esraa, et al.
Published: (2025)
by: Elelimy, Esraa, et al.
Published: (2025)
SUPERNOVA: Eliciting General Reasoning in LLMs with Reinforcement Learning on Natural Instructions
by: Suvarna, Ashima, et al.
Published: (2026)
by: Suvarna, Ashima, et al.
Published: (2026)
Similar Items
-
Hybrid Safety Verification of Multi-Agent Systems using $ψ$-Weighted CBFs and PAC Guarantees
by: Margapuri, Venkat, et al.
Published: (2025) -
Diagnosis and Severity Assessment of Ulcerative Colitis using Self Supervised Learning
by: Margapuri, Venkat
Published: (2024) -
Predicting Mortality and Functional Status Scores of Traumatic Brain Injury Patients using Supervised Machine Learning
by: Steinmetz, Lucas, et al.
Published: (2024) -
Seed Kernel Counting using Domain Randomization and Object Tracking Neural Networks
by: Margapuri, Venkat, et al.
Published: (2023) -
Leaf Angle Estimation using Mask R-CNN and LETR Vision Transformer
by: Margapuri, Venkat, et al.
Published: (2024)