Saved in:
| Main Authors: | Zhang, Wentao, Zhang, Yutong, Zhu, Yifan, Mo, Wentao |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.02591 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Bridging the Gap between 2D and 3D Visual Question Answering: A Fusion Approach for 3D VQA
by: Mo, Wentao, et al.
Published: (2024)
by: Mo, Wentao, et al.
Published: (2024)
Distilling Neuro-Symbolic Programs into 3D Multi-modal LLMs
by: Mo, Wentao, et al.
Published: (2026)
by: Mo, Wentao, et al.
Published: (2026)
VecCity: A Taxonomy-guided Library for Map Entity Representation Learning
by: Zhang, Wentao, et al.
Published: (2024)
by: Zhang, Wentao, et al.
Published: (2024)
Language Models Represent Beliefs of Self and Others
by: Zhu, Wentao, et al.
Published: (2024)
by: Zhu, Wentao, et al.
Published: (2024)
Vote-Tree-Planner: Optimizing Execution Order in LLM-based Task Planning Pipeline via Voting
by: Zhang, Chaoyuan, et al.
Published: (2025)
by: Zhang, Chaoyuan, et al.
Published: (2025)
Deconfounded Time Series Forecasting: A Causal Inference Approach
by: Gao, Wentao, et al.
Published: (2024)
by: Gao, Wentao, et al.
Published: (2024)
Can LLMs be Good Graph Judge for Knowledge Graph Construction?
by: Huang, Haoyu, et al.
Published: (2024)
by: Huang, Haoyu, et al.
Published: (2024)
TPC-ViT: Token Propagation Controller for Efficient Vision Transformer
by: Zhu, Wentao
Published: (2024)
by: Zhu, Wentao
Published: (2024)
STRAP: Spatio-Temporal Pattern Retrieval for Out-of-Distribution Generalization
by: Zhang, Haoyu, et al.
Published: (2025)
by: Zhang, Haoyu, et al.
Published: (2025)
Efficient Multiscale Multimodal Bottleneck Transformer for Audio-Video Classification
by: Zhu, Wentao
Published: (2024)
by: Zhu, Wentao
Published: (2024)
Efficient Selective Audio Masked Multimodal Bottleneck Transformer for Audio-Video Classification
by: Zhu, Wentao
Published: (2024)
by: Zhu, Wentao
Published: (2024)
DataCross: A Unified Benchmark and Agent Framework for Cross-Modal Heterogeneous Data Analysis
by: Qi, Ruyi, et al.
Published: (2026)
by: Qi, Ruyi, et al.
Published: (2026)
Bias in Decision-Making for AI's Ethical Dilemmas: A Comparative Study of ChatGPT and Claude
by: Xu, Wentao, et al.
Published: (2025)
by: Xu, Wentao, et al.
Published: (2025)
Exponential Approximation Rates and Parameter Efficiency of Learnable Bernstein Activations
by: Albool, Ibrahim, et al.
Published: (2026)
by: Albool, Ibrahim, et al.
Published: (2026)
ORACLE: Optimizing Reasoning Abilities of Large Language Models via Constraint-Led Synthetic Data Elicitation
by: Yang, Zhuojie, et al.
Published: (2026)
by: Yang, Zhuojie, et al.
Published: (2026)
Rethinking and Accelerating Graph Condensation: A Training-Free Approach with Class Partition
by: Gao, Xinyi, et al.
Published: (2024)
by: Gao, Xinyi, et al.
Published: (2024)
TransDex: Pre-training Visuo-Tactile Policy with Point Cloud Reconstruction for Dexterous Manipulation of Transparent Objects
by: Li, Fengguan, et al.
Published: (2026)
by: Li, Fengguan, et al.
Published: (2026)
TRACER: Turn-level Regret Matching with Inner Reinforcement Credit for Cooperative Multi-LLM Reasoning
by: Li, Chusen, et al.
Published: (2026)
by: Li, Chusen, et al.
Published: (2026)
Seeing the Unseen: Learning Basis Confounder Representations for Robust Traffic Prediction
by: Ji, Jiahao, et al.
Published: (2023)
by: Ji, Jiahao, et al.
Published: (2023)
From Chat Logs to Collective Insights: Aggregative Question Answering
by: Zhang, Wentao, et al.
Published: (2025)
by: Zhang, Wentao, et al.
Published: (2025)
Paper2SysArch: Structure-Constrained System Architecture Generation from Scientific Papers
by: Guo, Ziyi, et al.
Published: (2025)
by: Guo, Ziyi, et al.
Published: (2025)
Certainty-Guided Reasoning in Large Language Models: A Dynamic Thinking Budget Approach
by: Nogueira, João Paulo, et al.
Published: (2025)
by: Nogueira, João Paulo, et al.
Published: (2025)
The Curse of Helpfulness: Inverse Scaling Law in Robustness to Distractor Instructions via DistractionIF
by: Su, Zeli, et al.
Published: (2026)
by: Su, Zeli, et al.
Published: (2026)
Embodied Representation Alignment with Mirror Neurons
by: Zhu, Wentao, et al.
Published: (2025)
by: Zhu, Wentao, et al.
Published: (2025)
VerifyBench: A Systematic Benchmark for Evaluating Reasoning Verifiers Across Domains
by: Li, Xuzhao, et al.
Published: (2025)
by: Li, Xuzhao, et al.
Published: (2025)
Characterization of Political Polarized Users Attacked by Language Toxicity on Twitter
by: Xu, Wentao
Published: (2024)
by: Xu, Wentao
Published: (2024)
Budget-aware Query Tuning: An AutoML Perspective
by: Wu, Wentao, et al.
Published: (2024)
by: Wu, Wentao, et al.
Published: (2024)
The Impact of Big Five Personality Traits on AI Agent Decision-Making in Public Spaces: A Social Simulation Study
by: Ren, Mingjun, et al.
Published: (2025)
by: Ren, Mingjun, et al.
Published: (2025)
A Unified and General Humanoid Whole-Body Controller for Versatile Locomotion
by: Xue, Yufei, et al.
Published: (2025)
by: Xue, Yufei, et al.
Published: (2025)
Interactive Training: Feedback-Driven Neural Network Optimization
by: Zhang, Wentao, et al.
Published: (2025)
by: Zhang, Wentao, et al.
Published: (2025)
Beyond Explicit Refusals: Soft-Failure Attacks on Retrieval-Augmented Generation
by: Zhang, Wentao, et al.
Published: (2026)
by: Zhang, Wentao, et al.
Published: (2026)
Arbitrary Time Information Modeling via Polynomial Approximation for Temporal Knowledge Graph Embedding
by: Fang, Zhiyu, et al.
Published: (2024)
by: Fang, Zhiyu, et al.
Published: (2024)
VisualTrap: A Stealthy Backdoor Attack on GUI Agents via Visual Grounding Manipulation
by: Ye, Ziang, et al.
Published: (2025)
by: Ye, Ziang, et al.
Published: (2025)
Attention-Guided Patch-Wise Sparse Adversarial Attacks on Vision-Language-Action Models
by: Zhang, Naifu, et al.
Published: (2025)
by: Zhang, Naifu, et al.
Published: (2025)
CapGeo: A Caption-Assisted Approach to Geometric Reasoning
by: Li, Yuying, et al.
Published: (2025)
by: Li, Yuying, et al.
Published: (2025)
A Dataset of Open-Domain Question Answering with Multiple-Span Answers
by: Luo, Zhiyi, et al.
Published: (2024)
by: Luo, Zhiyi, et al.
Published: (2024)
Leash: Adaptive Length Penalty and Reward Shaping for Efficient Large Reasoning Model
by: Li, Yanhao, et al.
Published: (2025)
by: Li, Yanhao, et al.
Published: (2025)
Adaptable and Precise: Enterprise-Scenario LLM Function-Calling Capability Training Pipeline
by: Zeng, Guancheng, et al.
Published: (2024)
by: Zeng, Guancheng, et al.
Published: (2024)
TGC-Net: A Structure-Aware and Semantically-Aligned Framework for Text-Guided Medical Image Segmentation
by: Lin, Gaoren, et al.
Published: (2025)
by: Lin, Gaoren, et al.
Published: (2025)
Bridging Dual Knowledge Graphs for Multi-Hop Question Answering in Construction Safety
by: Zhang, Yuxin, et al.
Published: (2025)
by: Zhang, Yuxin, et al.
Published: (2025)
Similar Items
-
Bridging the Gap between 2D and 3D Visual Question Answering: A Fusion Approach for 3D VQA
by: Mo, Wentao, et al.
Published: (2024) -
Distilling Neuro-Symbolic Programs into 3D Multi-modal LLMs
by: Mo, Wentao, et al.
Published: (2026) -
VecCity: A Taxonomy-guided Library for Map Entity Representation Learning
by: Zhang, Wentao, et al.
Published: (2024) -
Language Models Represent Beliefs of Self and Others
by: Zhu, Wentao, et al.
Published: (2024) -
Vote-Tree-Planner: Optimizing Execution Order in LLM-based Task Planning Pipeline via Voting
by: Zhang, Chaoyuan, et al.
Published: (2025)