Saved in:
| Main Authors: | Chen, Yanyu, Jiang, Jiyue, Liu, Jiahong, Zhang, Yifei, Guo, Xiao, King, Irwin |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.21230 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LC-ERD: Mining Latent Logic for Self-Evolving Reasoning via Consistency-Regulated Reward Decomposition
by: Chen, Yanyu, et al.
Published: (2026)
by: Chen, Yanyu, et al.
Published: (2026)
Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey
by: Li, Jindong, et al.
Published: (2025)
by: Li, Jindong, et al.
Published: (2025)
Implicit Reasoning in Large Language Models: A Comprehensive Survey
by: Li, Jindong, et al.
Published: (2025)
by: Li, Jindong, et al.
Published: (2025)
ADRA-Bank: A Modular Benchmark for Academic Deep Research Agents
by: Guo, Zhihan, et al.
Published: (2025)
by: Guo, Zhihan, et al.
Published: (2025)
A Principle-Driven Adaptive Policy for Group Cognitive Stimulation Dialogue for Elderly with Cognitive Impairment
by: Jiang, Jiyue, et al.
Published: (2026)
by: Jiang, Jiyue, et al.
Published: (2026)
Following the TRACE: A Structured Path to Empathetic Response Generation with Multi-Agent Models
by: Liu, Ziqi, et al.
Published: (2025)
by: Liu, Ziqi, et al.
Published: (2025)
From General to Targeted Rewards: Surpassing GPT-4 in Open-Ended Long-Context Generation
by: Guo, Zhihan, et al.
Published: (2025)
by: Guo, Zhihan, et al.
Published: (2025)
DeepTRACE: Auditing Deep Research AI Systems for Tracking Reliability Across Citations and Evidence
by: Venkit, Pranav Narayanan, et al.
Published: (2025)
by: Venkit, Pranav Narayanan, et al.
Published: (2025)
Less is More: Extreme Gradient Boost Rank-1 Adaption for Efficient Finetuning of LLMs
by: Zhang, Yifei, et al.
Published: (2024)
by: Zhang, Yifei, et al.
Published: (2024)
DS-ProGen: A Dual-Structure Deep Language Model for Functional Protein Design
by: Li, Yanting, et al.
Published: (2025)
by: Li, Yanting, et al.
Published: (2025)
AgentIR: Reasoning-Aware Retrieval for Deep Research Agents
by: Chen, Zijian, et al.
Published: (2026)
by: Chen, Zijian, et al.
Published: (2026)
TRACE: Discovering Task-Specific Parameter via Adaptation-Aware Probing for Continual Fine-Tuning
by: Han, Xiaosong, et al.
Published: (2026)
by: Han, Xiaosong, et al.
Published: (2026)
Developing and Utilizing a Large-Scale Cantonese Dataset for Multi-Tasking in Large Language Models
by: Jiang, Jiyue, et al.
Published: (2025)
by: Jiang, Jiyue, et al.
Published: (2025)
DeepSearchQA: Bridging the Comprehensiveness Gap for Deep Research Agents
by: Gupta, Nikita, et al.
Published: (2026)
by: Gupta, Nikita, et al.
Published: (2026)
From Evidence to Trajectory: Abductive Reasoning Path Synthesis for Training Retrieval-Augmented Generation Agents
by: Li, Muzhi, et al.
Published: (2025)
by: Li, Muzhi, et al.
Published: (2025)
TRACE: Trajectory Correction from Cross-layer Evidence for Hallucination Reduction
by: Ranade, Tej Sanibh
Published: (2026)
by: Ranade, Tej Sanibh
Published: (2026)
Minimizing Modality Gap from the Input Side: Your Speech LLM Can Be a Prosody-Aware Text LLM
by: Cui, Wenqian, et al.
Published: (2026)
by: Cui, Wenqian, et al.
Published: (2026)
FinDeepResearch: Evaluating Deep Research Agents in Rigorous Financial Analysis
by: Zhu, Fengbin, et al.
Published: (2025)
by: Zhu, Fengbin, et al.
Published: (2025)
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
by: Du, Mingxuan, et al.
Published: (2025)
by: Du, Mingxuan, et al.
Published: (2025)
How Far Are We from Genuinely Useful Deep Research Agents?
by: Zhang, Dingling, et al.
Published: (2025)
by: Zhang, Dingling, et al.
Published: (2025)
MTR-DuplexBench: Towards a Comprehensive Evaluation of Multi-Round Conversations for Full-Duplex Speech Language Models
by: Zhang, He, et al.
Published: (2025)
by: Zhang, He, et al.
Published: (2025)
HeLa-Mem: Hebbian Learning and Associative Memory for LLM Agents
by: Zhu, Jinchang, et al.
Published: (2026)
by: Zhu, Jinchang, et al.
Published: (2026)
Towards Comprehensive Semantic Speech Embeddings for Chinese Dialects
by: Chang, Kalvin, et al.
Published: (2026)
by: Chang, Kalvin, et al.
Published: (2026)
DR-Arena: an Automated Evaluation Framework for Deep Research Agents
by: Gao, Yiwen, et al.
Published: (2026)
by: Gao, Yiwen, et al.
Published: (2026)
From Web Search towards Agentic Deep Research: Incentivizing Search with Reasoning Agents
by: Zhang, Weizhi, et al.
Published: (2025)
by: Zhang, Weizhi, et al.
Published: (2025)
TRACE for Tracking the Emergence of Semantic Representations in Transformers
by: Aljaafari, Nura, et al.
Published: (2025)
by: Aljaafari, Nura, et al.
Published: (2025)
CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models
by: Qiu, Zexuan, et al.
Published: (2024)
by: Qiu, Zexuan, et al.
Published: (2024)
MedResearcher-R1: Expert-Level Medical Deep Researcher via A Knowledge-Informed Trajectory Synthesis Framework
by: Yu, Ailing, et al.
Published: (2025)
by: Yu, Ailing, et al.
Published: (2025)
Data Augmentation of Multi-turn Psychological Dialogue via Knowledge-driven Progressive Thought Prompting
by: Jiang, Jiyue, et al.
Published: (2024)
by: Jiang, Jiyue, et al.
Published: (2024)
An Entropy-based Text Watermarking Detection Method
by: Lu, Yijian, et al.
Published: (2024)
by: Lu, Yijian, et al.
Published: (2024)
MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding
by: Luo, Fuwen, et al.
Published: (2025)
by: Luo, Fuwen, et al.
Published: (2025)
Quantifying the Climate Risk of Generative AI: Region-Aware Carbon Accounting with G-TRACE and the AI Sustainability Pyramid
by: Kausar, Zahida, et al.
Published: (2025)
by: Kausar, Zahida, et al.
Published: (2025)
Large Language Models in Bioinformatics: A Survey
by: Wang, Zhenyu, et al.
Published: (2025)
by: Wang, Zhenyu, et al.
Published: (2025)
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis
by: Li, Zhuofeng, et al.
Published: (2026)
by: Li, Zhuofeng, et al.
Published: (2026)
QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks
by: Xie, Jian, et al.
Published: (2026)
by: Xie, Jian, et al.
Published: (2026)
Watermarking LLM Agent Trajectories
by: Meng, Wenlong, et al.
Published: (2026)
by: Meng, Wenlong, et al.
Published: (2026)
TRACE: Tourism Recommendation with Accountable Citation Evidence
by: Zhao, Zixu, et al.
Published: (2026)
by: Zhao, Zixu, et al.
Published: (2026)
Locomo-Plus: Beyond-Factual Cognitive Memory Evaluation Framework for LLM Agents
by: Li, Yifei, et al.
Published: (2026)
by: Li, Yifei, et al.
Published: (2026)
WebCoT: Enhancing Web Agent Reasoning by Reconstructing Chain-of-Thought in Reflection, Branching, and Rollback
by: Hu, Minda, et al.
Published: (2025)
by: Hu, Minda, et al.
Published: (2025)
LoRA Meets Dropout under a Unified Framework
by: Wang, Sheng, et al.
Published: (2024)
by: Wang, Sheng, et al.
Published: (2024)
Similar Items
-
LC-ERD: Mining Latent Logic for Self-Evolving Reasoning via Consistency-Regulated Reward Decomposition
by: Chen, Yanyu, et al.
Published: (2026) -
Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey
by: Li, Jindong, et al.
Published: (2025) -
Implicit Reasoning in Large Language Models: A Comprehensive Survey
by: Li, Jindong, et al.
Published: (2025) -
ADRA-Bank: A Modular Benchmark for Academic Deep Research Agents
by: Guo, Zhihan, et al.
Published: (2025) -
A Principle-Driven Adaptive Policy for Group Cognitive Stimulation Dialogue for Elderly with Cognitive Impairment
by: Jiang, Jiyue, et al.
Published: (2026)