:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Lutsai, Kateryna
Format:	Preprint
Published:	2025
Subjects:	Information Retrieval Artificial Intelligence Computer Vision and Pattern Recognition 68T10, 68T09, 62H30 I.7.5; H.3.7
Online Access:	https://arxiv.org/abs/2507.21114
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LinkedOut: Linking World Knowledge Representation Out of Video LLM for Next-Generation Video Recommendation
by: Zhang, Haichao, et al.
Published: (2025)

Perception-Aware Bias Detection for Query Suggestions
by: Haak, Fabian, et al.
Published: (2026)

Diverse capability and scaling of diffusion and auto-regressive models when learning abstract rules
by: Wang, Binxu, et al.
Published: (2024)

LLM-supported document separation for printed reviews from zbMATH Open
by: Pluzhnikov, Ivan, et al.
Published: (2026)

Narrative Fingerprints: Multi-Scale Author Identification via Novelty Curve Dynamics
by: Zimmerman, Fred, et al.
Published: (2026)

PlotPick: AI-powered batch extraction of numerical data from scientific figures
by: Carstensen, Tommy
Published: (2026)

Spatially-Grounded Document Retrieval via Patch-to-Region Relevance Propagation
by: Georgiou, Athos
Published: (2025)

Visualizing the Evolution of Twitter (X.com) Conversations: A Comprehensive Methodology Applied to AI Training Discussions on ChatGPT
by: Jess, Nicole, et al.
Published: (2024)

Supervised Dimensionality Reduction Revisited: Why LDA on Frozen CNN Features Deserves a Second Look
by: Kumar, Indar, et al.
Published: (2026)

AVATAAR: Agentic Video Answering via Temporal Adaptive Alignment and Reasoning
by: Patel, Urjitkumar, et al.
Published: (2025)

Unraveling Media Perspectives: A Comprehensive Methodology Combining Large Language Models, Topic Modeling, Sentiment Analysis, and Ontology Learning to Analyse Media Bias
by: Jähde, Orlando, et al.
Published: (2025)

Dense Video Understanding with Gated Residual Tokenization
by: Zhang, Haichao, et al.
Published: (2025)

Enhancing OCR for Sino-Vietnamese Language Processing via Fine-tuned PaddleOCRv5
by: Nguyen, Minh Hoang, et al.
Published: (2025)

Detection of high-frequency oscillations using time-frequency analysis
by: Mohammadpour, Mostafa, et al.
Published: (2025)

Interpretable Machine Learning-Derived Spectral Indices for Vegetation Monitoring
by: Lotfi, Ali, et al.
Published: (2025)

Time Step Generating: A Universal Synthesized Deepfake Image Detector
by: Zeng, Ziyue, et al.
Published: (2024)

A Reproducible, Scalable Pipeline for Synthesizing Autoregressive Model Literature
by: Alpay, Faruk, et al.
Published: (2025)

DPDisc: From Factoid Questions to Data Product Requests for Open-World Data Product Discovery over Tables and Text
by: Zhang, Liangliang, et al.
Published: (2025)

A Practical Synthesis of Detecting AI-Generated Textual, Visual, and Audio Content
by: Cao, Lele
Published: (2025)

A Tractography Analysis Framework Using Diffusion Maps to Study Thalamic Connectivity in Traumatic Brain Injury
by: Sharma, Akul, et al.
Published: (2025)

Gesture Evaluation in Virtual Reality
by: Werner, Axel Wiebe, et al.
Published: (2025)

Hydra: Unifying Document Retrieval and Generation in a Single Vision-Language Model
by: Georgiou, Athos
Published: (2026)

Causal Deep Learning
by: Vasilescu, M. Alex O.
Published: (2023)

A Multimodal Pipeline for Clinical Data Extraction: Applying Vision-Language Models to Scans of Transfusion Reaction Reports
by: Schäfer, Henning, et al.
Published: (2025)

Multi-source Heterogeneous Public Opinion Analysis via Collaborative Reasoning and Adaptive Fusion: A Systematically Integrated Approach
by: Liu, Yi
Published: (2026)

AgenticAI-DialogGen: Topic-Guided Conversation Generation for Fine-Tuning and Evaluating Short- and Long-Term Memories of LLMs
by: Perera, Manoj Madushanka, et al.
Published: (2026)

SQUARE: Semantic Query-Augmented Fusion and Efficient Batch Reranking for Training-free Zero-Shot Composed Image Retrieval
by: Wu, Ren-Di, et al.
Published: (2025)

Autodiscover: A reinforcement learning recommendation system for the cold-start imbalance challenge in active learning, powered by graph-aware thompson sampling
by: Vares, Parsa
Published: (2026)

Deep Outdated Fact Detection in Knowledge Graphs
by: Tu, Huiling, et al.
Published: (2024)

Emerging-properties Mapping Using Spatial Embedding Statistics: EMUSES
by: Foulon, Chris, et al.
Published: (2024)

Deep Global Clustering for Hyperspectral Image Segmentation: Concepts, Applications, and Open Challenges
by: Chang, Yu-Tang, et al.
Published: (2025)

Mapping the Web of Science, a large-scale graph and text-based dataset with LLM embeddings
by: Kunt, Tim, et al.
Published: (2026)

SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game Understanding
by: Gautam, Sushant, et al.
Published: (2025)

FARSIQA: Faithful and Advanced RAG System for Islamic Question Answering
by: Asl, Mohammad Aghajani, et al.
Published: (2025)

Multimodal AI-based visualization of strategic leaders' emotional dynamics: a deep behavioral analysis of Trump's trade war discourse
by: Meng, Wei
Published: (2025)

Predicting the Geolocation of Tweets Using transformer models on Customized Data
by: Lutsai, Kateryna, et al.
Published: (2023)

Advancing Annotat3D with Harpia: A CUDA-Accelerated Library For Large-Scale Volumetric Data Segmentation
by: de Araujo, Camila Machado, et al.
Published: (2025)

Software architecture and manual for novel versatile CT image analysis toolbox -- AnatomyArchive
by: Xu, Lei, et al.
Published: (2025)

TriAlignGR: Triangular Multitask Alignment with Multimodal Deep Interest Mining for Generative Recommendation
by: Zeng, Yangchen, et al.
Published: (2026)

MVTamperBench: Evaluating Robustness of Vision-Language Models
by: Agarwal, Amit, et al.
Published: (2024)