Saved in:
| Main Authors: | Wu, Hanbing, Jiang, Ping, Su, Anyang, Zhao, Chenxu, Fu, Tianyu, Wu, Minghui, Tan, Beiping, Li, Huiying |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.19213 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding
by: Wu, Minghui, et al.
Published: (2024)
by: Wu, Minghui, et al.
Published: (2024)
Reinforced Domain Selection for Continuous Domain Adaptation
by: Liu, Hanbing, et al.
Published: (2025)
by: Liu, Hanbing, et al.
Published: (2025)
EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning
by: Jiang, Yue, et al.
Published: (2024)
by: Jiang, Yue, et al.
Published: (2024)
Full‐thickness nasolabial facial artery flap: A modified surgical approach for reconstruction of lower lip defects
by: Jia Kang, et al.
Published: (2024)
by: Jia Kang, et al.
Published: (2024)
SEF-MAP: Subspace-Decomposed Expert Fusion for Robust Multimodal HD Map Prediction
by: Fu, Haoxiang, et al.
Published: (2026)
by: Fu, Haoxiang, et al.
Published: (2026)
Boundary-Guided Learning for Gene Expression Prediction in Spatial Transcriptomics
by: Qu, Mingcheng, et al.
Published: (2024)
by: Qu, Mingcheng, et al.
Published: (2024)
Insight-A: Attribution-aware for Multimodal Misinformation Detection
by: Wu, Junjie, et al.
Published: (2025)
by: Wu, Junjie, et al.
Published: (2025)
UPA: Unsupervised Prompt Agent via Tree-Based Search and Selection
by: Peng, Siran, et al.
Published: (2026)
by: Peng, Siran, et al.
Published: (2026)
MAP: Multi-user Personalization with Collaborative LLM-powered Agents
by: Lee, Christine, et al.
Published: (2025)
by: Lee, Christine, et al.
Published: (2025)
MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation
by: Li, Lu, et al.
Published: (2024)
by: Li, Lu, et al.
Published: (2024)
SDGOCC: Semantic and Depth-Guided Bird's-Eye View Transformation for 3D Multimodal Occupancy Prediction
by: Duan, Zaipeng, et al.
Published: (2025)
by: Duan, Zaipeng, et al.
Published: (2025)
Mano Technical Report
by: Fu, Tianyu, et al.
Published: (2025)
by: Fu, Tianyu, et al.
Published: (2025)
Provable Multi-Party Reinforcement Learning with Diverse Human Feedback
by: Zhong, Huiying, et al.
Published: (2024)
by: Zhong, Huiying, et al.
Published: (2024)
Multi-Task Reinforcement Learning for Enhanced Multimodal LLM-as-a-Judge
by: Wu, Junjie, et al.
Published: (2026)
by: Wu, Junjie, et al.
Published: (2026)
Trajectory Entropy Reinforcement Learning for Predictable and Robust Control
by: You, Bang, et al.
Published: (2025)
by: You, Bang, et al.
Published: (2025)
ReMAP-DP: Reprojected Multi-view Aligned PointMaps for Diffusion Policy
by: Yang, Xinzhang, et al.
Published: (2026)
by: Yang, Xinzhang, et al.
Published: (2026)
E2E Learning Massive MIMO for Multimodal Semantic Non-Orthogonal Transmission and Fusion
by: Wu, Minghui, et al.
Published: (2025)
by: Wu, Minghui, et al.
Published: (2025)
DeepRAHT: Learning Predictive RAHT for Point Cloud Attribute Compression
by: Fu, Chunyang, et al.
Published: (2026)
by: Fu, Chunyang, et al.
Published: (2026)
Two-Point Resolution in Spectral Super-Resolution
by: He, Xiaole, et al.
Published: (2026)
by: He, Xiaole, et al.
Published: (2026)
Point Cloud Quantization through Multimodal Prompting for 3D Understanding
by: Li, Hongxuan, et al.
Published: (2025)
by: Li, Hongxuan, et al.
Published: (2025)
Deep Reinforcement Learning with Task-Adaptive Retrieval via Hypernetwork
by: Jin, Yonggang, et al.
Published: (2023)
by: Jin, Yonggang, et al.
Published: (2023)
WorldMAP: Bootstrapping Vision-Language Navigation Trajectory Prediction with Generative World Models
by: Chen, Hongjin, et al.
Published: (2026)
by: Chen, Hongjin, et al.
Published: (2026)
Efficient MAP Estimation of LLM Judgment Performance with Prior Transfer
by: Qu, Huaizhi, et al.
Published: (2025)
by: Qu, Huaizhi, et al.
Published: (2025)
CollabVLA: Self-Reflective Vision-Language-Action Model Dreaming Together with Human
by: Sun, Nan, et al.
Published: (2025)
by: Sun, Nan, et al.
Published: (2025)
Online Self-Calibration Against Hallucination in Vision-Language Models
by: Chen, Minghui, et al.
Published: (2026)
by: Chen, Minghui, et al.
Published: (2026)
Construction of Healthy Liver of Largemouth Bass (Micropterus salmoides) in the Short Term by Steroidal Saponins before Heat Season Comes
by: Tao Cheng, et al.
Published: (2024)
by: Tao Cheng, et al.
Published: (2024)
Tea Saponin Exerts Dose-Dependent Dual Effects on Growth and Hepatic Health in Hybrid Grouper ( ♀ × ♂) Fed a High-Lipid, Low-Protein Diet via Redox-Immune Regulation.
by: Guo, Shengrong, et al.
Published: (2026)
by: Guo, Shengrong, et al.
Published: (2026)
EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing
by: Jiang, Hongxiang, et al.
Published: (2025)
by: Jiang, Hongxiang, et al.
Published: (2025)
SDG-L: A Semiparametric Deep Gaussian Process based Framework for Battery Capacity Prediction
by: Liu, Hanbing, et al.
Published: (2025)
by: Liu, Hanbing, et al.
Published: (2025)
NSF-MAP: Neurosymbolic Multimodal Fusion for Robust and Interpretable Anomaly Prediction in Assembly Pipelines
by: Shyalika, Chathurangi, et al.
Published: (2025)
by: Shyalika, Chathurangi, et al.
Published: (2025)
S2AM3D: Scale-controllable Part Segmentation of 3D Point Clouds
by: Su, Han, et al.
Published: (2025)
by: Su, Han, et al.
Published: (2025)
Unreal-MAP: Unreal-Engine-Based General Platform for Multi-Agent Reinforcement Learning
by: Hu, Tianyi, et al.
Published: (2025)
by: Hu, Tianyi, et al.
Published: (2025)
FingerEye: Continuous and Unified Vision-Tactile Sensing for Dexterous Manipulation
by: Xu, Zhixuan, et al.
Published: (2026)
by: Xu, Zhixuan, et al.
Published: (2026)
Universal Legal Article Prediction via Tight Collaboration between Supervised Classification Model and LLM
by: Chi, Xiao, et al.
Published: (2025)
by: Chi, Xiao, et al.
Published: (2025)
MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
by: Zhang, Yi-Fan, et al.
Published: (2024)
by: Zhang, Yi-Fan, et al.
Published: (2024)
Res-Bench: Benchmarking the Robustness of Multimodal Large Language Models to Dynamic Resolution Input
by: Li, Chenxu, et al.
Published: (2025)
by: Li, Chenxu, et al.
Published: (2025)
RAG or Learning? Understanding the Limits of LLM Adaptation under Continuous Knowledge Drift in the Real World
by: Liu, Hanbing, et al.
Published: (2026)
by: Liu, Hanbing, et al.
Published: (2026)
From Personal to Collective: On the Role of Local and Global Memory in LLM Personalization
by: Wang, Zehong, et al.
Published: (2025)
by: Wang, Zehong, et al.
Published: (2025)
Look & Mark: Leveraging Radiologist Eye Fixations and Bounding boxes in Multimodal Large Language Models for Chest X-ray Report Generation
by: Kim, Yunsoo, et al.
Published: (2025)
by: Kim, Yunsoo, et al.
Published: (2025)
ConMeC: A Dataset for Metonymy Resolution with Common Nouns
by: Ghosh, Saptarshi, et al.
Published: (2025)
by: Ghosh, Saptarshi, et al.
Published: (2025)
Similar Items
-
Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding
by: Wu, Minghui, et al.
Published: (2024) -
Reinforced Domain Selection for Continuous Domain Adaptation
by: Liu, Hanbing, et al.
Published: (2025) -
EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning
by: Jiang, Yue, et al.
Published: (2024) -
Full‐thickness nasolabial facial artery flap: A modified surgical approach for reconstruction of lower lip defects
by: Jia Kang, et al.
Published: (2024) -
SEF-MAP: Subspace-Decomposed Expert Fusion for Robust Multimodal HD Map Prediction
by: Fu, Haoxiang, et al.
Published: (2026)