Saved in:
| Main Authors: | Menzner, Tim, Leidner, Jochen L., Mittag, Florian |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.07227 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
BiasScanner: Automatic Detection and Classification of News Bias to Strengthen Democracy
by: Menzner, Tim, et al.
Published: (2024)
by: Menzner, Tim, et al.
Published: (2024)
Automatic Creative Selection with Cross-Modal Matching
by: Kim, Alex, et al.
Published: (2024)
by: Kim, Alex, et al.
Published: (2024)
MMMORRF: Multimodal Multilingual Modularized Reciprocal Rank Fusion
by: Samuel, Saron, et al.
Published: (2025)
by: Samuel, Saron, et al.
Published: (2025)
Vote-in-Context: Turning VLMs into Zero-Shot Rank Fusers
by: Eltahir, Mohamed, et al.
Published: (2025)
by: Eltahir, Mohamed, et al.
Published: (2025)
Automatic Funny Scene Extraction from Long-form Cinematic Videos
by: Paul, Sibendu, et al.
Published: (2026)
by: Paul, Sibendu, et al.
Published: (2026)
UniNote: A Unified Embedding Model for Multimodal Representation and Ranking
by: Zhao, Jinghan, et al.
Published: (2026)
by: Zhao, Jinghan, et al.
Published: (2026)
Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos
by: Gao, Haowen, et al.
Published: (2025)
by: Gao, Haowen, et al.
Published: (2025)
RDP: Ranked Differential Privacy for Facial Feature Protection in Multiscale Sparsified Subspace
by: Ou, Lu, et al.
Published: (2024)
by: Ou, Lu, et al.
Published: (2024)
PhotoBench: Beyond Visual Matching Towards Personalized Intent-Driven Photo Retrieval
by: Xu, Tianyi, et al.
Published: (2026)
by: Xu, Tianyi, et al.
Published: (2026)
Advancing Re-Ranking with Multimodal Fusion and Target-Oriented Auxiliary Tasks in E-Commerce Search
by: Xu, Enqiang, et al.
Published: (2024)
by: Xu, Enqiang, et al.
Published: (2024)
MTMD: A Multi-Task Multi-Domain Framework for Unified Ad Lightweight Ranking at Pinterest
by: Yang, Xiao, et al.
Published: (2025)
by: Yang, Xiao, et al.
Published: (2025)
LSC-ADL: An Activity of Daily Living (ADL)-Annotated Lifelog Dataset Generated via Semi-Automatic Clustering
by: Ho-Le, Minh-Quan, et al.
Published: (2025)
by: Ho-Le, Minh-Quan, et al.
Published: (2025)
CNN-Based Framework for Pedestrian Age and Gender Classification Using Far-View Surveillance in Mixed-Traffic Intersections
by: Arif, Shisir Shahriar, et al.
Published: (2025)
by: Arif, Shisir Shahriar, et al.
Published: (2025)
Generalized Contrastive Learning for Multi-Modal Retrieval and Ranking
by: Zhu, Tianyu, et al.
Published: (2024)
by: Zhu, Tianyu, et al.
Published: (2024)
Sustainable transparency in Recommender Systems: Bayesian Ranking of Images for Explainability
by: Paz-Ruza, Jorge, et al.
Published: (2023)
by: Paz-Ruza, Jorge, et al.
Published: (2023)
Automatic Synthetic Data and Fine-grained Adaptive Feature Alignment for Composed Person Retrieval
by: Liu, Delong, et al.
Published: (2023)
by: Liu, Delong, et al.
Published: (2023)
Image Hashing via Cross-View Code Alignment in the Age of Foundation Models
by: Moummad, Ilyass, et al.
Published: (2025)
by: Moummad, Ilyass, et al.
Published: (2025)
Toward Automatic Relevance Judgment using Vision--Language Models for Image--Text Retrieval Evaluation
by: Yang, Jheng-Hong, et al.
Published: (2024)
by: Yang, Jheng-Hong, et al.
Published: (2024)
CollEX -- A Multimodal Agentic RAG System Enabling Interactive Exploration of Scientific Collections
by: Schneider, Florian, et al.
Published: (2025)
by: Schneider, Florian, et al.
Published: (2025)
Interactive Mars Image Content-Based Search with Interpretable Machine Learning
by: Vasu, Bhavan, et al.
Published: (2024)
by: Vasu, Bhavan, et al.
Published: (2024)
A Multi-Stage Hybrid Framework for Automated Interpretation of Multi-View Engineering Drawings Using Vision Language Model
by: Khan, Muhammad Tayyab, et al.
Published: (2025)
by: Khan, Muhammad Tayyab, et al.
Published: (2025)
Bridge the Gap between Past and Future: Siamese Model Optimization for Context-Aware Document Ranking
by: Wu, Songhao, et al.
Published: (2025)
by: Wu, Songhao, et al.
Published: (2025)
Leveraging Foundation Models for Content-Based Image Retrieval in Radiology
by: Denner, Stefan, et al.
Published: (2024)
by: Denner, Stefan, et al.
Published: (2024)
POBEVM: Real-time Video Matting via Progressively Optimize the Target Body and Edge
by: Xian, Jianming
Published: (2024)
by: Xian, Jianming
Published: (2024)
Interactive Garment Recommendation with User in the Loop
by: Becattini, Federico, et al.
Published: (2024)
by: Becattini, Federico, et al.
Published: (2024)
DEMO: A Statistical Perspective for Efficient Image-Text Matching
by: Zhang, Fan, et al.
Published: (2024)
by: Zhang, Fan, et al.
Published: (2024)
Black carbon plumes from gas flaring in North Africa identified from multi-spectral imagery with deep learning
by: Alexandre, Tuel, et al.
Published: (2024)
by: Alexandre, Tuel, et al.
Published: (2024)
YOLO-Vehicle-Pro: A Cloud-Edge Collaborative Framework for Object Detection in Autonomous Driving under Adverse Weather Conditions
by: Li, Xiguang, et al.
Published: (2024)
by: Li, Xiguang, et al.
Published: (2024)
Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval
by: Sun, Zengbao, et al.
Published: (2024)
by: Sun, Zengbao, et al.
Published: (2024)
Heterogeneous Graph-based Framework with Disentangled Representations Learning for Multi-target Cross Domain Recommendation
by: Liu, Xiaopeng, et al.
Published: (2024)
by: Liu, Xiaopeng, et al.
Published: (2024)
Active Learning via Classifier Impact and Greedy Selection for Interactive Image Retrieval
by: Bar, Leah, et al.
Published: (2024)
by: Bar, Leah, et al.
Published: (2024)
Offline Evaluation of Set-Based Text-to-Image Generation
by: Arabzadeh, Negar, et al.
Published: (2024)
by: Arabzadeh, Negar, et al.
Published: (2024)
Video Editing for Video Retrieval
by: Zhu, Bin, et al.
Published: (2024)
by: Zhu, Bin, et al.
Published: (2024)
Image-text matching for large-scale book collections
by: Llabrés, Artemis, et al.
Published: (2024)
by: Llabrés, Artemis, et al.
Published: (2024)
Improving Video Corpus Moment Retrieval with Partial Relevance Enhancement
by: Hou, Danyang, et al.
Published: (2024)
by: Hou, Danyang, et al.
Published: (2024)
ViFi-ReID: A Two-Stream Vision-WiFi Multimodal Approach for Person Re-identification
by: Mao, Chen, et al.
Published: (2024)
by: Mao, Chen, et al.
Published: (2024)
Unity is Strength: Unifying Convolutional and Transformeral Features for Better Person Re-Identification
by: Wang, Yuhao, et al.
Published: (2024)
by: Wang, Yuhao, et al.
Published: (2024)
Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models
by: Nakata, Kengo, et al.
Published: (2024)
by: Nakata, Kengo, et al.
Published: (2024)
LSVOS Challenge 3rd Place Report: SAM2 and Cutie based VOS
by: Liu, Xinyu, et al.
Published: (2024)
by: Liu, Xinyu, et al.
Published: (2024)
DistillGrasp: Integrating Features Correlation with Knowledge Distillation for Depth Completion of Transparent Objects
by: Huang, Yiheng, et al.
Published: (2024)
by: Huang, Yiheng, et al.
Published: (2024)
Similar Items
-
BiasScanner: Automatic Detection and Classification of News Bias to Strengthen Democracy
by: Menzner, Tim, et al.
Published: (2024) -
Automatic Creative Selection with Cross-Modal Matching
by: Kim, Alex, et al.
Published: (2024) -
MMMORRF: Multimodal Multilingual Modularized Reciprocal Rank Fusion
by: Samuel, Saron, et al.
Published: (2025) -
Vote-in-Context: Turning VLMs into Zero-Shot Rank Fusers
by: Eltahir, Mohamed, et al.
Published: (2025) -
Automatic Funny Scene Extraction from Long-form Cinematic Videos
by: Paul, Sibendu, et al.
Published: (2026)