:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Margapuri, Venkat, Kazanjian, Garik, Kosaraju, Naren
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2509.20105
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Hybrid Safety Verification of Multi-Agent Systems using $ψ$-Weighted CBFs and PAC Guarantees
by: Margapuri, Venkat, et al.
Published: (2025)

Diagnosis and Severity Assessment of Ulcerative Colitis using Self Supervised Learning
by: Margapuri, Venkat
Published: (2024)

Predicting Mortality and Functional Status Scores of Traumatic Brain Injury Patients using Supervised Machine Learning
by: Steinmetz, Lucas, et al.
Published: (2024)

Seed Kernel Counting using Domain Randomization and Object Tracking Neural Networks
by: Margapuri, Venkat, et al.
Published: (2023)

Leaf Angle Estimation using Mask R-CNN and LETR Vision Transformer
by: Margapuri, Venkat, et al.
Published: (2024)

Prompt Informed Reinforcement Learning for Visual Coverage Path Planning
by: Margapuri, Venkat
Published: (2025)

TSSR: Two-Stage Swap-Reward-Driven Reinforcement Learning for Character-Level SMILES Generation
by: Levine, Jacob Ede, et al.
Published: (2026)

LLMs as Layout Designers: Enhanced Spatial Reasoning for Content-Aware Layout Generation
by: Li, Sha, et al.
Published: (2025)

BERT Learns (and Teaches) Chemistry
by: Payne, Josh, et al.
Published: (2020)

Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning
by: Singh, Joykirat, et al.
Published: (2025)

Answering the Wrong Question: Reasoning Trace Inversion for Abstention in LLMs
by: Gourabathina, Abinitha, et al.
Published: (2026)

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
by: Wang, Haozhe, et al.
Published: (2025)

Experience as a Compass: Multi-agent RAG with Evolving Orchestration and Agent Prompts
by: Li, Sha, et al.
Published: (2026)

Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning
by: Wang, Jiayu, et al.
Published: (2025)

CosmicFish-HRM: Adaptive Reasoning via Hierarchical Recurrent Mechanisms in Compact Language Models
by: Lakkapragada, Venkat Akhil
Published: (2026)

Hidden Thoughts Are Not Secret: Reasoning Trace Exposure in LLMs
by: Lu, Yu-An, et al.
Published: (2026)

ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
by: Chen, Mingyang, et al.
Published: (2025)

G1: Teaching LLMs to Reason on Graphs with Reinforcement Learning
by: Guo, Xiaojun, et al.
Published: (2025)

SATURN: SAT-based Reinforcement Learning to Unleash LLMs Reasoning
by: Liu, Huanyu, et al.
Published: (2025)

Toward Better EHR Reasoning in LLMs: Reinforcement Learning with Expert Attention Guidance
by: Fang, Yue, et al.
Published: (2025)

The Loupe: A Plug-and-Play Attention Module for Amplifying Discriminative Features in Vision Transformers
by: Sengodan, Naren
Published: (2025)

Strat-Reasoner: Reinforcing Strategic Reasoning of LLMs in Multi-Agent Games
by: He, Yidong, et al.
Published: (2026)

LLMSense: Harnessing LLMs for High-level Reasoning Over Spatiotemporal Sensor Traces
by: Ouyang, Xiaomin, et al.
Published: (2024)

Toward Scientific Reasoning in LLMs: Training from Expert Discussions via Reinforcement Learning
by: Yin, Ming, et al.
Published: (2025)

DRAFT-RL: Multi-Agent Chain-of-Draft Reasoning for Reinforcement Learning-Enhanced LLMs
by: Li, Yuanhao, et al.
Published: (2025)

Boosting Accuracy and Efficiency of Budget Forcing in LLMs via Reinforcement Learning for Mathematical Reasoning
by: Tarunokusumo, Ravindra Aribowo, et al.
Published: (2025)

Human-Inspired Multi-Level Reinforcement Learning
by: Wu, Mingkang, et al.
Published: (2025)

LLM Augmentations to support Analytical Reasoning over Multiple Documents
by: Yousuf, Raquib Bin, et al.
Published: (2024)

Human-Inspired Framework to Accelerate Reinforcement Learning
by: Beikmohammadi, Ali, et al.
Published: (2023)

Emotion-Coherent Reasoning for Multimodal LLMs via Emotional Rationale Verifier
by: Rha, Hyeongseop, et al.
Published: (2025)

Unlocking Reasoning Capabilities in LLMs via Reinforcement Learning Exploration
by: Deng, Wenhao, et al.
Published: (2025)

Prompted Policy Search: Reinforcement Learning through Linguistic and Numerical Reasoning in LLMs
by: Zhou, Yifan, et al.
Published: (2025)

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs
by: Wen, Xumeng, et al.
Published: (2025)

ARDNS-FN-Quantum: A Quantum-Enhanced Reinforcement Learning Framework with Cognitive-Inspired Adaptive Exploration for Dynamic Environments
by: de Sousa, Umberto Gonçalves
Published: (2025)

Mathematical Framework for Custom Reward Functions in Job Application Evaluation using Reinforcement Learning
by: Jain, Shreyansh, et al.
Published: (2025)

Brain-Inspired Planning for Better Generalization in Reinforcement Learning
by: Zhao, Mingde "Harry"
Published: (2025)

R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning
by: Li, Yuan, et al.
Published: (2025)

What's in an embedding? Would a rose by any embedding smell as sweet?
by: Venkatasubramanian, Venkat
Published: (2024)

Deep Reinforcement Learning with Gradient Eligibility Traces
by: Elelimy, Esraa, et al.
Published: (2025)

SUPERNOVA: Eliciting General Reasoning in LLMs with Reinforcement Learning on Natural Instructions
by: Suvarna, Ashima, et al.
Published: (2026)