Guardado en:
| Autores principales: | Zhang, Chongsheng, Wu, Shuwen, Chen, Yingqi, Men, Yi, Fan, Gaojuan, Aßenmacher, Matthias, Heumann, Christian, Gama, João |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2505.03836 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Are All Genders Equal in the Eyes of Algorithms? -- Analysing Search and Retrieval Algorithms for Algorithmic Gender Fairness
por: Urchs, Stefanie, et al.
Publicado: (2025)
por: Urchs, Stefanie, et al.
Publicado: (2025)
Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing
por: Cui, Cheng, et al.
Publicado: (2026)
por: Cui, Cheng, et al.
Publicado: (2026)
Decoding Ancient Oracle Bone Script via Generative Dictionary Retrieval
por: Wu, Yin, et al.
Publicado: (2026)
por: Wu, Yin, et al.
Publicado: (2026)
Studying Illustrations in Manuscripts: An Efficient Deep-Learning Approach
por: Evron, Yoav, et al.
Publicado: (2025)
por: Evron, Yoav, et al.
Publicado: (2025)
Low-Data Classification of Historical Music Manuscripts: A Few-Shot Learning Approach
por: Shatri, Elona, et al.
Publicado: (2024)
por: Shatri, Elona, et al.
Publicado: (2024)
CMIE: Combining MLLM Insights with External Evidence for Explainable Out-of-Context Misinformation Detection
por: Li, Fanxiao, et al.
Publicado: (2025)
por: Li, Fanxiao, et al.
Publicado: (2025)
EndoFinder: Online Image Retrieval for Explainable Colorectal Polyp Diagnosis
por: Yang, Ruijie, et al.
Publicado: (2024)
por: Yang, Ruijie, et al.
Publicado: (2024)
Beyond Global Similarity: Towards Fine-Grained, Multi-Condition Multimodal Retrieval
por: Lu, Xuan, et al.
Publicado: (2026)
por: Lu, Xuan, et al.
Publicado: (2026)
TIGER-FG: Text-Guided Implicit Fine-Grained Grounding for E-commerce Retrieval
por: Sun, Xinyu, et al.
Publicado: (2026)
por: Sun, Xinyu, et al.
Publicado: (2026)
MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval
por: Tu, Rong-Cheng, et al.
Publicado: (2025)
por: Tu, Rong-Cheng, et al.
Publicado: (2025)
Fine-grained Motion Retrieval via Joint-Angle Motion Images and Token-Patch Late Interaction
por: Zhang, Yao, et al.
Publicado: (2026)
por: Zhang, Yao, et al.
Publicado: (2026)
Knowledge Discovery in Optical Music Recognition: Enhancing Information Retrieval with Instance Segmentation
por: Shatri, Elona, et al.
Publicado: (2024)
por: Shatri, Elona, et al.
Publicado: (2024)
Sustainable transparency in Recommender Systems: Bayesian Ranking of Images for Explainability
por: Paz-Ruza, Jorge, et al.
Publicado: (2023)
por: Paz-Ruza, Jorge, et al.
Publicado: (2023)
Rethinking Composed Image Retrieval Evaluation: A Fine-Grained Benchmark from Image Editing
por: Song, Tingyu, et al.
Publicado: (2026)
por: Song, Tingyu, et al.
Publicado: (2026)
Learning Positional Attention for Sequential Recommendation
por: Luo, Fan, et al.
Publicado: (2024)
por: Luo, Fan, et al.
Publicado: (2024)
DEMO: A Statistical Perspective for Efficient Image-Text Matching
por: Zhang, Fan, et al.
Publicado: (2024)
por: Zhang, Fan, et al.
Publicado: (2024)
Attributes Grouping and Mining Hashing for Fine-Grained Image Retrieval
por: Lu, Xin, et al.
Publicado: (2023)
por: Lu, Xin, et al.
Publicado: (2023)
GraphRevisedIE: Multimodal Information Extraction with Graph-Revised Network
por: Cao, Panfeng, et al.
Publicado: (2024)
por: Cao, Panfeng, et al.
Publicado: (2024)
Prototype-Driven Structure Synergy Network for Remote Sensing Images Segmentation
por: Wang, Junyi, et al.
Publicado: (2025)
por: Wang, Junyi, et al.
Publicado: (2025)
Transfer Learning and Mixup for Fine-Grained Few-Shot Fungi Classification
por: Tam, Jason Kahei, et al.
Publicado: (2025)
por: Tam, Jason Kahei, et al.
Publicado: (2025)
ITSELF: Attention Guided Fine-Grained Alignment for Vision-Language Retrieval
por: Nguyen, Tien-Huy, et al.
Publicado: (2026)
por: Nguyen, Tien-Huy, et al.
Publicado: (2026)
Iterative Optimal Attention and Local Model for Single Image Rain Streak Removal
por: Li, Xiangyu, et al.
Publicado: (2025)
por: Li, Xiangyu, et al.
Publicado: (2025)
CaReBench: A Fine-Grained Benchmark for Video Captioning and Retrieval
por: Xu, Yifan, et al.
Publicado: (2024)
por: Xu, Yifan, et al.
Publicado: (2024)
PHPQ: Pyramid Hybrid Pooling Quantization for Efficient Fine-Grained Image Retrieval
por: Zeng, Ziyun, et al.
Publicado: (2021)
por: Zeng, Ziyun, et al.
Publicado: (2021)
Supervised Fine-Tuning or Contrastive Learning? Towards Better Multimodal LLM Reranking
por: Dai, Ziqi, et al.
Publicado: (2025)
por: Dai, Ziqi, et al.
Publicado: (2025)
Learning Partially-Decorrelated Common Spaces for Ad-hoc Video Search
por: Hu, Fan, et al.
Publicado: (2025)
por: Hu, Fan, et al.
Publicado: (2025)
Automatic Synthetic Data and Fine-grained Adaptive Feature Alignment for Composed Person Retrieval
por: Liu, Delong, et al.
Publicado: (2023)
por: Liu, Delong, et al.
Publicado: (2023)
Compressing then Matching: An Efficient Pre-training Paradigm for Multimodal Embedding
por: Li, Da, et al.
Publicado: (2025)
por: Li, Da, et al.
Publicado: (2025)
Modality-Balanced Learning for Multimedia Recommendation
por: Zhang, Jinghao, et al.
Publicado: (2024)
por: Zhang, Jinghao, et al.
Publicado: (2024)
A Systematic Review on Long-Tailed Learning
por: Zhang, Chongsheng, et al.
Publicado: (2024)
por: Zhang, Chongsheng, et al.
Publicado: (2024)
Chain-of-Thought Re-ranking for Image Retrieval Tasks
por: Wu, Shangrong, et al.
Publicado: (2025)
por: Wu, Shangrong, et al.
Publicado: (2025)
Nested Hash Layer: A Plug-and-play Module for Multiple-length Hash Code Learning
por: He, Liyang, et al.
Publicado: (2024)
por: He, Liyang, et al.
Publicado: (2024)
Composed Multi-modal Retrieval: A Survey of Approaches and Applications
por: Zhang, Kun, et al.
Publicado: (2025)
por: Zhang, Kun, et al.
Publicado: (2025)
Deep learning enables urban change profiling through alignment of historical maps
por: Wu, Sidi, et al.
Publicado: (2026)
por: Wu, Sidi, et al.
Publicado: (2026)
PATFinger: Prompt-Adapted Transferable Fingerprinting against Unauthorized Multimodal Dataset Usage
por: Zhang, Wenyi, et al.
Publicado: (2025)
por: Zhang, Wenyi, et al.
Publicado: (2025)
Uncertainty-aware sign language video retrieval with probability distribution modeling
por: Wu, Xuan, et al.
Publicado: (2024)
por: Wu, Xuan, et al.
Publicado: (2024)
LLM-Enhanced Multimodal Fusion for Cross-Domain Sequential Recommendation
por: Wu, Wangyu, et al.
Publicado: (2025)
por: Wu, Wangyu, et al.
Publicado: (2025)
Multi-Branch Collaborative Learning Network for Video Quality Assessment in Industrial Video Search
por: Tang, Hengzhu, et al.
Publicado: (2025)
por: Tang, Hengzhu, et al.
Publicado: (2025)
Content-based 3D Image Retrieval and a ColBERT-inspired Re-ranking for Tumor Flagging and Staging
por: Jush, Farnaz Khun, et al.
Publicado: (2025)
por: Jush, Farnaz Khun, et al.
Publicado: (2025)
Dynamic Multimodal Fusion via Meta-Learning Towards Micro-Video Recommendation
por: Liu, Han, et al.
Publicado: (2025)
por: Liu, Han, et al.
Publicado: (2025)
Ejemplares similares
-
Are All Genders Equal in the Eyes of Algorithms? -- Analysing Search and Retrieval Algorithms for Algorithmic Gender Fairness
por: Urchs, Stefanie, et al.
Publicado: (2025) -
Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing
por: Cui, Cheng, et al.
Publicado: (2026) -
Decoding Ancient Oracle Bone Script via Generative Dictionary Retrieval
por: Wu, Yin, et al.
Publicado: (2026) -
Studying Illustrations in Manuscripts: An Efficient Deep-Learning Approach
por: Evron, Yoav, et al.
Publicado: (2025) -
Low-Data Classification of Historical Music Manuscripts: A Few-Shot Learning Approach
por: Shatri, Elona, et al.
Publicado: (2024)