Saved in:
| Main Authors: | Wang, Haonan, Sun, Junfeng, Zhao, Mingjia, Liu, Wei |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.01076 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Design and Optimization of Reinforcement Learning-Based Agents in Text-Based Games
by: Wang, Haonan, et al.
Published: (2025)
by: Wang, Haonan, et al.
Published: (2025)
Research on geometric figure classification algorithm based on Deep Learning
by: Wang, Ruiyang, et al.
Published: (2024)
by: Wang, Ruiyang, et al.
Published: (2024)
ByteSized32Refactored: Towards an Extensible Interactive Text Games Corpus for LLM World Modeling and Evaluation
by: Wang, Haonan, et al.
Published: (2025)
by: Wang, Haonan, et al.
Published: (2025)
Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence
by: Hong, Yining, et al.
Published: (2025)
by: Hong, Yining, et al.
Published: (2025)
Reinforcement Learning for Tool-Integrated Interleaved Thinking towards Cross-Domain Generalization
by: Chen, Zhengyu, et al.
Published: (2025)
by: Chen, Zhengyu, et al.
Published: (2025)
EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning
by: Basu, Kinjal, et al.
Published: (2024)
by: Basu, Kinjal, et al.
Published: (2024)
PoLi-RL: A Point-to-List Reinforcement Learning Framework for Conditional Semantic Textual Similarity
by: Song, Zixin, et al.
Published: (2025)
by: Song, Zixin, et al.
Published: (2025)
AgentSkiller: Scaling Generalist Agent Intelligence through Semantically Integrated Cross-Domain Data Synthesis
by: Sun, Zexu, et al.
Published: (2026)
by: Sun, Zexu, et al.
Published: (2026)
Reversible Jump Attack to Textual Classifiers with Modification Reduction
by: Ni, Mingze, et al.
Published: (2024)
by: Ni, Mingze, et al.
Published: (2024)
Adaptive Federated Distillation for Multi-Domain Non-IID Textual Data
by: Xiao, Jiahao, et al.
Published: (2025)
by: Xiao, Jiahao, et al.
Published: (2025)
Unveil: Unified Visual-Textual Integration and Distillation for Multi-modal Document Retrieval
by: Sun, Hao, et al.
Published: (2026)
by: Sun, Hao, et al.
Published: (2026)
GITA: Graph to Visual and Textual Integration for Vision-Language Graph Reasoning
by: Wei, Yanbin, et al.
Published: (2024)
by: Wei, Yanbin, et al.
Published: (2024)
Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning
by: Fei, Zhaoye, et al.
Published: (2025)
by: Fei, Zhaoye, et al.
Published: (2025)
Grid Spatial Understanding: A Dataset for Textual Spatial Reasoning over Grids, Embodied Settings, and Coordinate Structures
by: Sidhu, Risham, et al.
Published: (2026)
by: Sidhu, Risham, et al.
Published: (2026)
More Than Catastrophic Forgetting: Integrating General Capabilities For Domain-Specific LLMs
by: Liu, Chengyuan, et al.
Published: (2024)
by: Liu, Chengyuan, et al.
Published: (2024)
MIRL: Mutual Information-Guided Reinforcement Learning for Vision-Language Models
by: Zhang, Yin, et al.
Published: (2026)
by: Zhang, Yin, et al.
Published: (2026)
Table Transformers for Imputing Textual Attributes
by: Wei, Ting-Ruen, et al.
Published: (2024)
by: Wei, Ting-Ruen, et al.
Published: (2024)
Transforming Surgical Interventions with Embodied Intelligence for Ultrasound Robotics
by: Xu, Huan, et al.
Published: (2024)
by: Xu, Huan, et al.
Published: (2024)
Learning to Rewrite: Generalized LLM-Generated Text Detection
by: Li, Ran, et al.
Published: (2024)
by: Li, Ran, et al.
Published: (2024)
True Knowledge Comes from Practice: Aligning LLMs with Embodied Environments via Reinforcement Learning
by: Tan, Weihao, et al.
Published: (2024)
by: Tan, Weihao, et al.
Published: (2024)
Reinforced Lifelong Editing for Language Models
by: Li, Zherui, et al.
Published: (2025)
by: Li, Zherui, et al.
Published: (2025)
DISCO Balances the Scales: Adaptive Domain- and Difficulty-Aware Reinforcement Learning on Imbalanced Data
by: Zhou, Yuhang, et al.
Published: (2025)
by: Zhou, Yuhang, et al.
Published: (2025)
HiEdit: Lifelong Model Editing with Hierarchical Reinforcement Learning
by: Wang, Yangfan, et al.
Published: (2026)
by: Wang, Yangfan, et al.
Published: (2026)
Enhancing Surgical Robots with Embodied Intelligence for Autonomous Ultrasound Scanning
by: Xu, Huan, et al.
Published: (2024)
by: Xu, Huan, et al.
Published: (2024)
ProMed: Shapley Information Gain Guided Reinforcement Learning for Proactive Medical LLMs
by: Ding, Hongxin, et al.
Published: (2025)
by: Ding, Hongxin, et al.
Published: (2025)
Beyond the Textual: Generating Coherent Visual Options for MCQs
by: Wang, Wanqiang, et al.
Published: (2025)
by: Wang, Wanqiang, et al.
Published: (2025)
Textual Similarity as a Key Metric in Machine Translation Quality Estimation
by: Sun, Kun, et al.
Published: (2024)
by: Sun, Kun, et al.
Published: (2024)
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
by: Cheng, Zhoujun, et al.
Published: (2025)
by: Cheng, Zhoujun, et al.
Published: (2025)
Looking Beyond Text: Reducing Language bias in Large Vision-Language Models via Multimodal Dual-Attention and Soft-Image Guidance
by: Zhao, Haozhe, et al.
Published: (2024)
by: Zhao, Haozhe, et al.
Published: (2024)
Adam's Law: Textual Frequency Law on Large Language Models
by: Lu, Hongyuan Adam, et al.
Published: (2026)
by: Lu, Hongyuan Adam, et al.
Published: (2026)
Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains
by: Gunjal, Anisha, et al.
Published: (2025)
by: Gunjal, Anisha, et al.
Published: (2025)
High Fidelity Textual User Representation over Heterogeneous Sources via Reinforcement Learning
by: Arora, Rajat, et al.
Published: (2026)
by: Arora, Rajat, et al.
Published: (2026)
LiveThinking: Enabling Real-Time Efficient Reasoning for AI-Powered Livestreaming via Reinforcement Learning
by: Sun, Yuhan, et al.
Published: (2025)
by: Sun, Yuhan, et al.
Published: (2025)
AutoTIR: Autonomous Tools Integrated Reasoning via Reinforcement Learning
by: Wei, Yifan, et al.
Published: (2025)
by: Wei, Yifan, et al.
Published: (2025)
Graph-Guided Textual Explanation Generation Framework
by: Yuan, Shuzhou, et al.
Published: (2024)
by: Yuan, Shuzhou, et al.
Published: (2024)
Dr. Assistant: Enhancing Clinical Diagnostic Inquiry via Structured Diagnostic Reasoning Data and Reinforcement Learning
by: Guo, Yue, et al.
Published: (2026)
by: Guo, Yue, et al.
Published: (2026)
Textual-to-Visual Iterative Self-Verification for Slide Generation
by: Xu, Yunqing, et al.
Published: (2025)
by: Xu, Yunqing, et al.
Published: (2025)
Integrating Multi-view Analysis: Multi-view Mixture-of-Expert for Textual Personality Detection
by: Zhu, Haohao, et al.
Published: (2024)
by: Zhu, Haohao, et al.
Published: (2024)
Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents
by: Wang, Renxi, et al.
Published: (2024)
by: Wang, Renxi, et al.
Published: (2024)
EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents
by: Cheng, Zhili, et al.
Published: (2025)
by: Cheng, Zhili, et al.
Published: (2025)
Similar Items
-
Design and Optimization of Reinforcement Learning-Based Agents in Text-Based Games
by: Wang, Haonan, et al.
Published: (2025) -
Research on geometric figure classification algorithm based on Deep Learning
by: Wang, Ruiyang, et al.
Published: (2024) -
ByteSized32Refactored: Towards an Extensible Interactive Text Games Corpus for LLM World Modeling and Evaluation
by: Wang, Haonan, et al.
Published: (2025) -
Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence
by: Hong, Yining, et al.
Published: (2025) -
Reinforcement Learning for Tool-Integrated Interleaved Thinking towards Cross-Domain Generalization
by: Chen, Zhengyu, et al.
Published: (2025)