Guardado en:
| Autores principales: | Zhang, Zhixing, Zhang, Jesen, Liu, Hao, Lv, Qinhan, Yang, Jing, Cai, Kaitong, Wang, Keze |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2602.15325 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
CoAgent: Collaborative Planning and Consistency Agent for Coherent Video Generation
por: Zeng, Qinglin, et al.
Publicado: (2025)
por: Zeng, Qinglin, et al.
Publicado: (2025)
Self-Rewarded Multimodal Coherent Reasoning Across Diverse Visual Domains
por: Zhang, Jesen, et al.
Publicado: (2025)
por: Zhang, Jesen, et al.
Publicado: (2025)
STORM: Search-Guided Generative World Models for Robotic Manipulation
por: Lin, Wenjun, et al.
Publicado: (2025)
por: Lin, Wenjun, et al.
Publicado: (2025)
RaCoT: Plug-and-Play Contrastive Example Generation Mechanism for Enhanced LLM Reasoning Reliability
por: Cai, Kaitong, et al.
Publicado: (2025)
por: Cai, Kaitong, et al.
Publicado: (2025)
SirenPose: Dynamic Scene Reconstruction via Geometric Supervision
por: Cai, Kaitong, et al.
Publicado: (2025)
por: Cai, Kaitong, et al.
Publicado: (2025)
Guardian: Decoupling Exploration from Safety in Reinforcement Learning
por: Cai, Kaitong, et al.
Publicado: (2025)
por: Cai, Kaitong, et al.
Publicado: (2025)
Learning Dynamics of VLM Finetuning
por: Zhang, Jusheng, et al.
Publicado: (2025)
por: Zhang, Jusheng, et al.
Publicado: (2025)
HybridToken-VLM: Hybrid Token Compression for Vision-Language Models
por: Zhang, Jusheng, et al.
Publicado: (2025)
por: Zhang, Jusheng, et al.
Publicado: (2025)
OSC: Cognitive Orchestration through Dynamic Knowledge Alignment in Multi-Agent LLM Collaboration
por: Zhang, Jusheng, et al.
Publicado: (2025)
por: Zhang, Jusheng, et al.
Publicado: (2025)
RevFFN: Memory-Efficient Full-Parameter Fine-Tuning of Mixture-of-Experts LLMs with Reversible Blocks
por: Liu, Ningyuan, et al.
Publicado: (2025)
por: Liu, Ningyuan, et al.
Publicado: (2025)
MAT-Agent: Adaptive Multi-Agent Training Optimization
por: Zhang, Jusheng, et al.
Publicado: (2025)
por: Zhang, Jusheng, et al.
Publicado: (2025)
AgriAgent: Contract-Driven Planning and Capability-Aware Tool Orchestration in Real-World Agriculture
por: Yang, Bo, et al.
Publicado: (2026)
por: Yang, Bo, et al.
Publicado: (2026)
HiVA: Self-organized Hierarchical Variable Agent via Goal-driven Semantic-Topological Evolution
por: Tang, Jinzhou, et al.
Publicado: (2025)
por: Tang, Jinzhou, et al.
Publicado: (2025)
MM-CoT:A Benchmark for Probing Visual Chain-of-Thought Reasoning in Multimodal Models
por: Zhang, Jusheng, et al.
Publicado: (2025)
por: Zhang, Jusheng, et al.
Publicado: (2025)
Causal Invariance and Counterfactual Learning Driven Cooperative Game for Multi-Label Classification
por: Fan, Yijia, et al.
Publicado: (2025)
por: Fan, Yijia, et al.
Publicado: (2025)
PTTA: A Pure Text-to-Animation Framework for High-Quality Creation
por: Chen, Ruiqi, et al.
Publicado: (2025)
por: Chen, Ruiqi, et al.
Publicado: (2025)
Rational ANOVA Networks
por: Zhang, Jusheng, et al.
Publicado: (2026)
por: Zhang, Jusheng, et al.
Publicado: (2026)
Cost-Effective Communication: An Auction-based Method for Language Agent Interaction
por: Fan, Yijia, et al.
Publicado: (2025)
por: Fan, Yijia, et al.
Publicado: (2025)
Backward-Friendly Optimization: Training Large Language Models with Approximate Gradients under Memory Constraints
por: Yang, Jing, et al.
Publicado: (2025)
por: Yang, Jing, et al.
Publicado: (2025)
3DAlign-DAER: Dynamic Attention Policy and Efficient Retrieval Strategy for Fine-grained 3D-Text Alignment at Scale
por: Fan, Yijia, et al.
Publicado: (2025)
por: Fan, Yijia, et al.
Publicado: (2025)
Top-Down Semantic Refinement for Image Captioning
por: Zhang, Jusheng, et al.
Publicado: (2025)
por: Zhang, Jusheng, et al.
Publicado: (2025)
Process-of-Thought Reasoning for Videos
por: Zhang, Jusheng, et al.
Publicado: (2026)
por: Zhang, Jusheng, et al.
Publicado: (2026)
Kolmogorov-Arnold Fourier Networks
por: Zhang, Jusheng, et al.
Publicado: (2025)
por: Zhang, Jusheng, et al.
Publicado: (2025)
CF-VLM:CounterFactual Vision-Language Fine-tuning
por: Zhang, Jusheng, et al.
Publicado: (2025)
por: Zhang, Jusheng, et al.
Publicado: (2025)
FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models
por: Cai, Kaitong, et al.
Publicado: (2025)
por: Cai, Kaitong, et al.
Publicado: (2025)
Why Keep Your Doubts to Yourself? Trading Visual Uncertainties in Multi-Agent Bandit Systems
por: Zhang, Jusheng, et al.
Publicado: (2026)
por: Zhang, Jusheng, et al.
Publicado: (2026)
LLM-CAS: Dynamic Neuron Perturbation for Real-Time Hallucination Correction
por: Zhang, Jensen, et al.
Publicado: (2025)
por: Zhang, Jensen, et al.
Publicado: (2025)
AgriChain Visually Grounded Expert Verified Reasoning for Interpretable Agricultural Vision Language Models
por: Mahmood, Hazza, et al.
Publicado: (2026)
por: Mahmood, Hazza, et al.
Publicado: (2026)
Right to History: A Sovereignty Kernel for Verifiable AI Agent Execution
por: Zhang, Jing
Publicado: (2026)
por: Zhang, Jing
Publicado: (2026)
An Execution-Verified Multi-Language Benchmark for Code Semantic Reasoning
por: Li, Yikun, et al.
Publicado: (2026)
por: Li, Yikun, et al.
Publicado: (2026)
Failure-Driven Workflow Refinement
por: Zhang, Jusheng, et al.
Publicado: (2025)
por: Zhang, Jusheng, et al.
Publicado: (2025)
Spectral Gating Networks
por: Zhang, Jusheng, et al.
Publicado: (2026)
por: Zhang, Jusheng, et al.
Publicado: (2026)
FinMCP-Bench: Benchmarking LLM Agents for Real-World Financial Tool Use under the Model Context Protocol
por: Zhu, Jie, et al.
Publicado: (2026)
por: Zhu, Jie, et al.
Publicado: (2026)
ToolGate: Contract-Grounded and Verified Tool Execution for LLMs
por: Liu, Yanming, et al.
Publicado: (2026)
por: Liu, Yanming, et al.
Publicado: (2026)
Agent-GSPO: Communication-Efficient Multi-Agent Systems via Group Sequence Policy Optimization
por: Fan, Yijia, et al.
Publicado: (2025)
por: Fan, Yijia, et al.
Publicado: (2025)
ExecVerify: White-Box RL with Verifiable Stepwise Rewards for Code Execution Reasoning
por: Tang, Lingxiao, et al.
Publicado: (2026)
por: Tang, Lingxiao, et al.
Publicado: (2026)
Executable Code Actions Elicit Better LLM Agents
por: Wang, Xingyao, et al.
Publicado: (2024)
por: Wang, Xingyao, et al.
Publicado: (2024)
A Scalable Curiosity-Driven Game-Theoretic Framework for Long-Tail Multi-Label Learning in Data Mining
por: Yang, Jing, et al.
Publicado: (2026)
por: Yang, Jing, et al.
Publicado: (2026)
Agri-R1: Agricultural Reasoning for Disease Diagnosis via Automated-Synthesis and Reinforcement Learning
por: Zhang, Wentao, et al.
Publicado: (2026)
por: Zhang, Wentao, et al.
Publicado: (2026)
Beyond Pixels: Introducing Geometric-Semantic World Priors for Video-based Embodied Models via Spatio-temporal Alignment
por: Tang, Jinzhou, et al.
Publicado: (2025)
por: Tang, Jinzhou, et al.
Publicado: (2025)
Ejemplares similares
-
CoAgent: Collaborative Planning and Consistency Agent for Coherent Video Generation
por: Zeng, Qinglin, et al.
Publicado: (2025) -
Self-Rewarded Multimodal Coherent Reasoning Across Diverse Visual Domains
por: Zhang, Jesen, et al.
Publicado: (2025) -
STORM: Search-Guided Generative World Models for Robotic Manipulation
por: Lin, Wenjun, et al.
Publicado: (2025) -
RaCoT: Plug-and-Play Contrastive Example Generation Mechanism for Enhanced LLM Reasoning Reliability
por: Cai, Kaitong, et al.
Publicado: (2025) -
SirenPose: Dynamic Scene Reconstruction via Geometric Supervision
por: Cai, Kaitong, et al.
Publicado: (2025)