Guardado en:
| Autores principales: | Yang, Qi, Ni, Bolin, Xiang, Shiming, Hu, Han, Peng, Houwen, Jiang, Jie |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2508.21113 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Beyond Sequential Distance: Inter-Modal Distance Invariant Position Encoding
por: Chen, Lin, et al.
Publicado: (2026)
por: Chen, Lin, et al.
Publicado: (2026)
VideoRFT: Incentivizing Video Reasoning Capability in MLLMs via Reinforced Fine-Tuning
por: Wang, Qi, et al.
Publicado: (2025)
por: Wang, Qi, et al.
Publicado: (2025)
Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully Open MLLMs
por: Zhang, Yi, et al.
Publicado: (2025)
por: Zhang, Yi, et al.
Publicado: (2025)
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
por: Song, Huatong, et al.
Publicado: (2025)
por: Song, Huatong, et al.
Publicado: (2025)
Xwin-LM: Strong and Scalable Alignment Practice for LLMs
por: Ni, Bolin, et al.
Publicado: (2024)
por: Ni, Bolin, et al.
Publicado: (2024)
Context Reasoner: Incentivizing Reasoning Capability for Contextualized Privacy and Safety Compliance via Reinforcement Learning
por: Hu, Wenbin, et al.
Publicado: (2025)
por: Hu, Wenbin, et al.
Publicado: (2025)
Setting Up General Purpose CD-ROM Workstations.
por: Bolin, Robert L.
Publicado: (1991)
por: Bolin, Robert L.
Publicado: (1991)
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
por: DeepSeek-AI, et al.
Publicado: (2025)
por: DeepSeek-AI, et al.
Publicado: (2025)
R-Log: Incentivizing Log Analysis Capability in LLMs via Reasoning-based Reinforcement Learning
por: Liu, Yilun, et al.
Publicado: (2025)
por: Liu, Yilun, et al.
Publicado: (2025)
SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward
por: Fan, Kaixuan, et al.
Publicado: (2025)
por: Fan, Kaixuan, et al.
Publicado: (2025)
Enhancing Visual Continual Learning with Language-Guided Supervision
por: Ni, Bolin, et al.
Publicado: (2024)
por: Ni, Bolin, et al.
Publicado: (2024)
Common 7B Language Models Already Possess Strong Math Capabilities
por: Li, Chen, et al.
Publicado: (2024)
por: Li, Chen, et al.
Publicado: (2024)
Thinking with Deltas: Incentivizing Reinforcement Learning via Differential Visual Reasoning Policy
por: Gao, Shujian, et al.
Publicado: (2026)
por: Gao, Shujian, et al.
Publicado: (2026)
Defying Imbalanced Forgetting in Class Incremental Learning
por: Xu, Shixiong, et al.
Publicado: (2024)
por: Xu, Shixiong, et al.
Publicado: (2024)
DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning
por: Zheng, Ziwei, et al.
Publicado: (2025)
por: Zheng, Ziwei, et al.
Publicado: (2025)
Saliency-R1: Incentivizing Unified Saliency Reasoning Capability in MLLM with Confidence-Guided Reinforcement Learning
por: Li, Long, et al.
Publicado: (2025)
por: Li, Long, et al.
Publicado: (2025)
Perception-R1: Advancing Multimodal Reasoning Capabilities of MLLMs via Visual Perception Reward
por: Xiao, Tong, et al.
Publicado: (2025)
por: Xiao, Tong, et al.
Publicado: (2025)
VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice
por: Liu, Shuming, et al.
Publicado: (2026)
por: Liu, Shuming, et al.
Publicado: (2026)
Think 360°: Evaluating the Width-centric Reasoning Capability of MLLMs Beyond Depth
por: Chen, Mingrui, et al.
Publicado: (2026)
por: Chen, Mingrui, et al.
Publicado: (2026)
IF-Bench: Benchmarking and Enhancing MLLMs for Infrared Images with Generative Visual Prompting
por: Zhang, Tao, et al.
Publicado: (2025)
por: Zhang, Tao, et al.
Publicado: (2025)
SpaceR: Reinforcing MLLMs in Video Spatial Reasoning
por: Ouyang, Kun, et al.
Publicado: (2025)
por: Ouyang, Kun, et al.
Publicado: (2025)
J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning
por: Whitehouse, Chenxi, et al.
Publicado: (2025)
por: Whitehouse, Chenxi, et al.
Publicado: (2025)
rSIM: Incentivizing Reasoning Capabilities of LLMs via Reinforced Strategy Injection
por: Chen, Sijia, et al.
Publicado: (2025)
por: Chen, Sijia, et al.
Publicado: (2025)
Video-R1: Reinforcing Video Reasoning in MLLMs
por: Feng, Kaituo, et al.
Publicado: (2025)
por: Feng, Kaituo, et al.
Publicado: (2025)
MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning
por: Pan, Jiazhen, et al.
Publicado: (2025)
por: Pan, Jiazhen, et al.
Publicado: (2025)
WorkForceAgent-R1: Incentivizing Reasoning Capability in LLM-based Web Agents via Reinforcement Learning
por: Zhuang, Yuchen, et al.
Publicado: (2025)
por: Zhuang, Yuchen, et al.
Publicado: (2025)
VideoCap-R1: Enhancing MLLMs for Video Captioning via Structured Thinking
por: Meng, Desen, et al.
Publicado: (2025)
por: Meng, Desen, et al.
Publicado: (2025)
Urban-R1: Reinforced MLLMs Mitigate Geospatial Biases for Urban General Intelligence
por: Wang, Qiongyan, et al.
Publicado: (2025)
por: Wang, Qiongyan, et al.
Publicado: (2025)
Incentivizing Tool-augmented Thinking with Images for Medical Image Analysis
por: Jiang, Yankai, et al.
Publicado: (2025)
por: Jiang, Yankai, et al.
Publicado: (2025)
ChartEdit: How Far Are MLLMs From Automating Chart Analysis? Evaluating MLLMs' Capability via Chart Editing
por: Zhao, Xuanle, et al.
Publicado: (2025)
por: Zhao, Xuanle, et al.
Publicado: (2025)
Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning
por: Sun, Hai-Long, et al.
Publicado: (2025)
por: Sun, Hai-Long, et al.
Publicado: (2025)
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models
por: Huang, Wenxuan, et al.
Publicado: (2025)
por: Huang, Wenxuan, et al.
Publicado: (2025)
Omni-AutoThink: Adaptive Multimodal Reasoning via Reinforcement Learning
por: Yang, Dongchao, et al.
Publicado: (2025)
por: Yang, Dongchao, et al.
Publicado: (2025)
R-Bench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation
por: Guo, Meng-Hao, et al.
Publicado: (2025)
por: Guo, Meng-Hao, et al.
Publicado: (2025)
Timelike entanglement entropy in dS$_3$/CFT$_2$
por: Jiang, Xin, et al.
Publicado: (2023)
por: Jiang, Xin, et al.
Publicado: (2023)
How Einstein's Equation Emerges From CFT$_2$
por: Jiang, Xin, et al.
Publicado: (2024)
por: Jiang, Xin, et al.
Publicado: (2024)
Timelike entanglement entropy and $T\bar{T}$ deformation
por: Jiang, Xin, et al.
Publicado: (2023)
por: Jiang, Xin, et al.
Publicado: (2023)
Realization of "ER=EPR"
por: Jiang, Xin, et al.
Publicado: (2024)
por: Jiang, Xin, et al.
Publicado: (2024)
An alternative to purification in CFT
por: Jiang, Xin, et al.
Publicado: (2024)
por: Jiang, Xin, et al.
Publicado: (2024)
Mixed State Entanglement Entropy in CFT
por: Jiang, Xin, et al.
Publicado: (2025)
por: Jiang, Xin, et al.
Publicado: (2025)
Ejemplares similares
-
Beyond Sequential Distance: Inter-Modal Distance Invariant Position Encoding
por: Chen, Lin, et al.
Publicado: (2026) -
VideoRFT: Incentivizing Video Reasoning Capability in MLLMs via Reinforced Fine-Tuning
por: Wang, Qi, et al.
Publicado: (2025) -
Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully Open MLLMs
por: Zhang, Yi, et al.
Publicado: (2025) -
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
por: Song, Huatong, et al.
Publicado: (2025) -
Xwin-LM: Strong and Scalable Alignment Practice for LLMs
por: Ni, Bolin, et al.
Publicado: (2024)