:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Menzner, Tim, Leidner, Jochen L., Mittag, Florian
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Information Retrieval
Online Access:	https://arxiv.org/abs/2406.07227
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

BiasScanner: Automatic Detection and Classification of News Bias to Strengthen Democracy
by: Menzner, Tim, et al.
Published: (2024)

Automatic Creative Selection with Cross-Modal Matching
by: Kim, Alex, et al.
Published: (2024)

MMMORRF: Multimodal Multilingual Modularized Reciprocal Rank Fusion
by: Samuel, Saron, et al.
Published: (2025)

Vote-in-Context: Turning VLMs into Zero-Shot Rank Fusers
by: Eltahir, Mohamed, et al.
Published: (2025)

Automatic Funny Scene Extraction from Long-form Cinematic Videos
by: Paul, Sibendu, et al.
Published: (2026)

UniNote: A Unified Embedding Model for Multimodal Representation and Ranking
by: Zhao, Jinghan, et al.
Published: (2026)

Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos
by: Gao, Haowen, et al.
Published: (2025)

RDP: Ranked Differential Privacy for Facial Feature Protection in Multiscale Sparsified Subspace
by: Ou, Lu, et al.
Published: (2024)

PhotoBench: Beyond Visual Matching Towards Personalized Intent-Driven Photo Retrieval
by: Xu, Tianyi, et al.
Published: (2026)

Advancing Re-Ranking with Multimodal Fusion and Target-Oriented Auxiliary Tasks in E-Commerce Search
by: Xu, Enqiang, et al.
Published: (2024)

MTMD: A Multi-Task Multi-Domain Framework for Unified Ad Lightweight Ranking at Pinterest
by: Yang, Xiao, et al.
Published: (2025)

LSC-ADL: An Activity of Daily Living (ADL)-Annotated Lifelog Dataset Generated via Semi-Automatic Clustering
by: Ho-Le, Minh-Quan, et al.
Published: (2025)

CNN-Based Framework for Pedestrian Age and Gender Classification Using Far-View Surveillance in Mixed-Traffic Intersections
by: Arif, Shisir Shahriar, et al.
Published: (2025)

Generalized Contrastive Learning for Multi-Modal Retrieval and Ranking
by: Zhu, Tianyu, et al.
Published: (2024)

Sustainable transparency in Recommender Systems: Bayesian Ranking of Images for Explainability
by: Paz-Ruza, Jorge, et al.
Published: (2023)

Automatic Synthetic Data and Fine-grained Adaptive Feature Alignment for Composed Person Retrieval
by: Liu, Delong, et al.
Published: (2023)

Image Hashing via Cross-View Code Alignment in the Age of Foundation Models
by: Moummad, Ilyass, et al.
Published: (2025)

Toward Automatic Relevance Judgment using Vision--Language Models for Image--Text Retrieval Evaluation
by: Yang, Jheng-Hong, et al.
Published: (2024)

CollEX -- A Multimodal Agentic RAG System Enabling Interactive Exploration of Scientific Collections
by: Schneider, Florian, et al.
Published: (2025)

Interactive Mars Image Content-Based Search with Interpretable Machine Learning
by: Vasu, Bhavan, et al.
Published: (2024)

A Multi-Stage Hybrid Framework for Automated Interpretation of Multi-View Engineering Drawings Using Vision Language Model
by: Khan, Muhammad Tayyab, et al.
Published: (2025)

Bridge the Gap between Past and Future: Siamese Model Optimization for Context-Aware Document Ranking
by: Wu, Songhao, et al.
Published: (2025)

Leveraging Foundation Models for Content-Based Image Retrieval in Radiology
by: Denner, Stefan, et al.
Published: (2024)

POBEVM: Real-time Video Matting via Progressively Optimize the Target Body and Edge
by: Xian, Jianming
Published: (2024)

Interactive Garment Recommendation with User in the Loop
by: Becattini, Federico, et al.
Published: (2024)

DEMO: A Statistical Perspective for Efficient Image-Text Matching
by: Zhang, Fan, et al.
Published: (2024)

Black carbon plumes from gas flaring in North Africa identified from multi-spectral imagery with deep learning
by: Alexandre, Tuel, et al.
Published: (2024)

YOLO-Vehicle-Pro: A Cloud-Edge Collaborative Framework for Object Detection in Autonomous Driving under Adverse Weather Conditions
by: Li, Xiguang, et al.
Published: (2024)

Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval
by: Sun, Zengbao, et al.
Published: (2024)

Heterogeneous Graph-based Framework with Disentangled Representations Learning for Multi-target Cross Domain Recommendation
by: Liu, Xiaopeng, et al.
Published: (2024)

Active Learning via Classifier Impact and Greedy Selection for Interactive Image Retrieval
by: Bar, Leah, et al.
Published: (2024)

Offline Evaluation of Set-Based Text-to-Image Generation
by: Arabzadeh, Negar, et al.
Published: (2024)

Video Editing for Video Retrieval
by: Zhu, Bin, et al.
Published: (2024)

Image-text matching for large-scale book collections
by: Llabrés, Artemis, et al.
Published: (2024)

Improving Video Corpus Moment Retrieval with Partial Relevance Enhancement
by: Hou, Danyang, et al.
Published: (2024)

ViFi-ReID: A Two-Stream Vision-WiFi Multimodal Approach for Person Re-identification
by: Mao, Chen, et al.
Published: (2024)

Unity is Strength: Unifying Convolutional and Transformeral Features for Better Person Re-Identification
by: Wang, Yuhao, et al.
Published: (2024)

Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models
by: Nakata, Kengo, et al.
Published: (2024)

LSVOS Challenge 3rd Place Report: SAM2 and Cutie based VOS
by: Liu, Xinyu, et al.
Published: (2024)

DistillGrasp: Integrating Features Correlation with Knowledge Distillation for Depth Completion of Transparent Objects
by: Huang, Yiheng, et al.
Published: (2024)