:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wu, Hanbing, Jiang, Ping, Su, Anyang, Zhao, Chenxu, Fu, Tianyu, Wu, Minghui, Tan, Beiping, Li, Huiying
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2507.19213
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding
by: Wu, Minghui, et al.
Published: (2024)

Reinforced Domain Selection for Continuous Domain Adaptation
by: Liu, Hanbing, et al.
Published: (2025)

EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning
by: Jiang, Yue, et al.
Published: (2024)

Full‐thickness nasolabial facial artery flap: A modified surgical approach for reconstruction of lower lip defects
by: Jia Kang, et al.
Published: (2024)

SEF-MAP: Subspace-Decomposed Expert Fusion for Robust Multimodal HD Map Prediction
by: Fu, Haoxiang, et al.
Published: (2026)

Boundary-Guided Learning for Gene Expression Prediction in Spatial Transcriptomics
by: Qu, Mingcheng, et al.
Published: (2024)

Insight-A: Attribution-aware for Multimodal Misinformation Detection
by: Wu, Junjie, et al.
Published: (2025)

UPA: Unsupervised Prompt Agent via Tree-Based Search and Selection
by: Peng, Siran, et al.
Published: (2026)

MAP: Multi-user Personalization with Collaborative LLM-powered Agents
by: Lee, Christine, et al.
Published: (2025)

MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation
by: Li, Lu, et al.
Published: (2024)

SDGOCC: Semantic and Depth-Guided Bird's-Eye View Transformation for 3D Multimodal Occupancy Prediction
by: Duan, Zaipeng, et al.
Published: (2025)

Mano Technical Report
by: Fu, Tianyu, et al.
Published: (2025)

Provable Multi-Party Reinforcement Learning with Diverse Human Feedback
by: Zhong, Huiying, et al.
Published: (2024)

Multi-Task Reinforcement Learning for Enhanced Multimodal LLM-as-a-Judge
by: Wu, Junjie, et al.
Published: (2026)

Trajectory Entropy Reinforcement Learning for Predictable and Robust Control
by: You, Bang, et al.
Published: (2025)

ReMAP-DP: Reprojected Multi-view Aligned PointMaps for Diffusion Policy
by: Yang, Xinzhang, et al.
Published: (2026)

E2E Learning Massive MIMO for Multimodal Semantic Non-Orthogonal Transmission and Fusion
by: Wu, Minghui, et al.
Published: (2025)

DeepRAHT: Learning Predictive RAHT for Point Cloud Attribute Compression
by: Fu, Chunyang, et al.
Published: (2026)

Two-Point Resolution in Spectral Super-Resolution
by: He, Xiaole, et al.
Published: (2026)

Point Cloud Quantization through Multimodal Prompting for 3D Understanding
by: Li, Hongxuan, et al.
Published: (2025)

Deep Reinforcement Learning with Task-Adaptive Retrieval via Hypernetwork
by: Jin, Yonggang, et al.
Published: (2023)

WorldMAP: Bootstrapping Vision-Language Navigation Trajectory Prediction with Generative World Models
by: Chen, Hongjin, et al.
Published: (2026)

Efficient MAP Estimation of LLM Judgment Performance with Prior Transfer
by: Qu, Huaizhi, et al.
Published: (2025)

CollabVLA: Self-Reflective Vision-Language-Action Model Dreaming Together with Human
by: Sun, Nan, et al.
Published: (2025)

Online Self-Calibration Against Hallucination in Vision-Language Models
by: Chen, Minghui, et al.
Published: (2026)

Construction of Healthy Liver of Largemouth Bass (Micropterus salmoides) in the Short Term by Steroidal Saponins before Heat Season Comes
by: Tao Cheng, et al.
Published: (2024)

Tea Saponin Exerts Dose-Dependent Dual Effects on Growth and Hepatic Health in Hybrid Grouper ( ♀ × ♂) Fed a High-Lipid, Low-Protein Diet via Redox-Immune Regulation.
by: Guo, Shengrong, et al.
Published: (2026)

EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing
by: Jiang, Hongxiang, et al.
Published: (2025)

SDG-L: A Semiparametric Deep Gaussian Process based Framework for Battery Capacity Prediction
by: Liu, Hanbing, et al.
Published: (2025)

NSF-MAP: Neurosymbolic Multimodal Fusion for Robust and Interpretable Anomaly Prediction in Assembly Pipelines
by: Shyalika, Chathurangi, et al.
Published: (2025)

S2AM3D: Scale-controllable Part Segmentation of 3D Point Clouds
by: Su, Han, et al.
Published: (2025)

Unreal-MAP: Unreal-Engine-Based General Platform for Multi-Agent Reinforcement Learning
by: Hu, Tianyi, et al.
Published: (2025)

FingerEye: Continuous and Unified Vision-Tactile Sensing for Dexterous Manipulation
by: Xu, Zhixuan, et al.
Published: (2026)

Universal Legal Article Prediction via Tight Collaboration between Supervised Classification Model and LLM
by: Chi, Xiao, et al.
Published: (2025)

MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
by: Zhang, Yi-Fan, et al.
Published: (2024)

Res-Bench: Benchmarking the Robustness of Multimodal Large Language Models to Dynamic Resolution Input
by: Li, Chenxu, et al.
Published: (2025)

RAG or Learning? Understanding the Limits of LLM Adaptation under Continuous Knowledge Drift in the Real World
by: Liu, Hanbing, et al.
Published: (2026)

From Personal to Collective: On the Role of Local and Global Memory in LLM Personalization
by: Wang, Zehong, et al.
Published: (2025)

Look & Mark: Leveraging Radiologist Eye Fixations and Bounding boxes in Multimodal Large Language Models for Chest X-ray Report Generation
by: Kim, Yunsoo, et al.
Published: (2025)

ConMeC: A Dataset for Metonymy Resolution with Common Nouns
by: Ghosh, Saptarshi, et al.
Published: (2025)