Saved in:
| Main Author: | Lutsai, Kateryna |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.21114 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LinkedOut: Linking World Knowledge Representation Out of Video LLM for Next-Generation Video Recommendation
by: Zhang, Haichao, et al.
Published: (2025)
by: Zhang, Haichao, et al.
Published: (2025)
Perception-Aware Bias Detection for Query Suggestions
by: Haak, Fabian, et al.
Published: (2026)
by: Haak, Fabian, et al.
Published: (2026)
Diverse capability and scaling of diffusion and auto-regressive models when learning abstract rules
by: Wang, Binxu, et al.
Published: (2024)
by: Wang, Binxu, et al.
Published: (2024)
LLM-supported document separation for printed reviews from zbMATH Open
by: Pluzhnikov, Ivan, et al.
Published: (2026)
by: Pluzhnikov, Ivan, et al.
Published: (2026)
Narrative Fingerprints: Multi-Scale Author Identification via Novelty Curve Dynamics
by: Zimmerman, Fred, et al.
Published: (2026)
by: Zimmerman, Fred, et al.
Published: (2026)
PlotPick: AI-powered batch extraction of numerical data from scientific figures
by: Carstensen, Tommy
Published: (2026)
by: Carstensen, Tommy
Published: (2026)
Spatially-Grounded Document Retrieval via Patch-to-Region Relevance Propagation
by: Georgiou, Athos
Published: (2025)
by: Georgiou, Athos
Published: (2025)
Visualizing the Evolution of Twitter (X.com) Conversations: A Comprehensive Methodology Applied to AI Training Discussions on ChatGPT
by: Jess, Nicole, et al.
Published: (2024)
by: Jess, Nicole, et al.
Published: (2024)
Supervised Dimensionality Reduction Revisited: Why LDA on Frozen CNN Features Deserves a Second Look
by: Kumar, Indar, et al.
Published: (2026)
by: Kumar, Indar, et al.
Published: (2026)
AVATAAR: Agentic Video Answering via Temporal Adaptive Alignment and Reasoning
by: Patel, Urjitkumar, et al.
Published: (2025)
by: Patel, Urjitkumar, et al.
Published: (2025)
Unraveling Media Perspectives: A Comprehensive Methodology Combining Large Language Models, Topic Modeling, Sentiment Analysis, and Ontology Learning to Analyse Media Bias
by: Jähde, Orlando, et al.
Published: (2025)
by: Jähde, Orlando, et al.
Published: (2025)
Dense Video Understanding with Gated Residual Tokenization
by: Zhang, Haichao, et al.
Published: (2025)
by: Zhang, Haichao, et al.
Published: (2025)
Enhancing OCR for Sino-Vietnamese Language Processing via Fine-tuned PaddleOCRv5
by: Nguyen, Minh Hoang, et al.
Published: (2025)
by: Nguyen, Minh Hoang, et al.
Published: (2025)
Detection of high-frequency oscillations using time-frequency analysis
by: Mohammadpour, Mostafa, et al.
Published: (2025)
by: Mohammadpour, Mostafa, et al.
Published: (2025)
Interpretable Machine Learning-Derived Spectral Indices for Vegetation Monitoring
by: Lotfi, Ali, et al.
Published: (2025)
by: Lotfi, Ali, et al.
Published: (2025)
Time Step Generating: A Universal Synthesized Deepfake Image Detector
by: Zeng, Ziyue, et al.
Published: (2024)
by: Zeng, Ziyue, et al.
Published: (2024)
A Reproducible, Scalable Pipeline for Synthesizing Autoregressive Model Literature
by: Alpay, Faruk, et al.
Published: (2025)
by: Alpay, Faruk, et al.
Published: (2025)
DPDisc: From Factoid Questions to Data Product Requests for Open-World Data Product Discovery over Tables and Text
by: Zhang, Liangliang, et al.
Published: (2025)
by: Zhang, Liangliang, et al.
Published: (2025)
A Practical Synthesis of Detecting AI-Generated Textual, Visual, and Audio Content
by: Cao, Lele
Published: (2025)
by: Cao, Lele
Published: (2025)
A Tractography Analysis Framework Using Diffusion Maps to Study Thalamic Connectivity in Traumatic Brain Injury
by: Sharma, Akul, et al.
Published: (2025)
by: Sharma, Akul, et al.
Published: (2025)
Gesture Evaluation in Virtual Reality
by: Werner, Axel Wiebe, et al.
Published: (2025)
by: Werner, Axel Wiebe, et al.
Published: (2025)
Hydra: Unifying Document Retrieval and Generation in a Single Vision-Language Model
by: Georgiou, Athos
Published: (2026)
by: Georgiou, Athos
Published: (2026)
Causal Deep Learning
by: Vasilescu, M. Alex O.
Published: (2023)
by: Vasilescu, M. Alex O.
Published: (2023)
A Multimodal Pipeline for Clinical Data Extraction: Applying Vision-Language Models to Scans of Transfusion Reaction Reports
by: Schäfer, Henning, et al.
Published: (2025)
by: Schäfer, Henning, et al.
Published: (2025)
Multi-source Heterogeneous Public Opinion Analysis via Collaborative Reasoning and Adaptive Fusion: A Systematically Integrated Approach
by: Liu, Yi
Published: (2026)
by: Liu, Yi
Published: (2026)
AgenticAI-DialogGen: Topic-Guided Conversation Generation for Fine-Tuning and Evaluating Short- and Long-Term Memories of LLMs
by: Perera, Manoj Madushanka, et al.
Published: (2026)
by: Perera, Manoj Madushanka, et al.
Published: (2026)
SQUARE: Semantic Query-Augmented Fusion and Efficient Batch Reranking for Training-free Zero-Shot Composed Image Retrieval
by: Wu, Ren-Di, et al.
Published: (2025)
by: Wu, Ren-Di, et al.
Published: (2025)
Autodiscover: A reinforcement learning recommendation system for the cold-start imbalance challenge in active learning, powered by graph-aware thompson sampling
by: Vares, Parsa
Published: (2026)
by: Vares, Parsa
Published: (2026)
Deep Outdated Fact Detection in Knowledge Graphs
by: Tu, Huiling, et al.
Published: (2024)
by: Tu, Huiling, et al.
Published: (2024)
Emerging-properties Mapping Using Spatial Embedding Statistics: EMUSES
by: Foulon, Chris, et al.
Published: (2024)
by: Foulon, Chris, et al.
Published: (2024)
Deep Global Clustering for Hyperspectral Image Segmentation: Concepts, Applications, and Open Challenges
by: Chang, Yu-Tang, et al.
Published: (2025)
by: Chang, Yu-Tang, et al.
Published: (2025)
Mapping the Web of Science, a large-scale graph and text-based dataset with LLM embeddings
by: Kunt, Tim, et al.
Published: (2026)
by: Kunt, Tim, et al.
Published: (2026)
SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game Understanding
by: Gautam, Sushant, et al.
Published: (2025)
by: Gautam, Sushant, et al.
Published: (2025)
FARSIQA: Faithful and Advanced RAG System for Islamic Question Answering
by: Asl, Mohammad Aghajani, et al.
Published: (2025)
by: Asl, Mohammad Aghajani, et al.
Published: (2025)
Multimodal AI-based visualization of strategic leaders' emotional dynamics: a deep behavioral analysis of Trump's trade war discourse
by: Meng, Wei
Published: (2025)
by: Meng, Wei
Published: (2025)
Predicting the Geolocation of Tweets Using transformer models on Customized Data
by: Lutsai, Kateryna, et al.
Published: (2023)
by: Lutsai, Kateryna, et al.
Published: (2023)
Advancing Annotat3D with Harpia: A CUDA-Accelerated Library For Large-Scale Volumetric Data Segmentation
by: de Araujo, Camila Machado, et al.
Published: (2025)
by: de Araujo, Camila Machado, et al.
Published: (2025)
Software architecture and manual for novel versatile CT image analysis toolbox -- AnatomyArchive
by: Xu, Lei, et al.
Published: (2025)
by: Xu, Lei, et al.
Published: (2025)
TriAlignGR: Triangular Multitask Alignment with Multimodal Deep Interest Mining for Generative Recommendation
by: Zeng, Yangchen, et al.
Published: (2026)
by: Zeng, Yangchen, et al.
Published: (2026)
MVTamperBench: Evaluating Robustness of Vision-Language Models
by: Agarwal, Amit, et al.
Published: (2024)
by: Agarwal, Amit, et al.
Published: (2024)
Similar Items
-
LinkedOut: Linking World Knowledge Representation Out of Video LLM for Next-Generation Video Recommendation
by: Zhang, Haichao, et al.
Published: (2025) -
Perception-Aware Bias Detection for Query Suggestions
by: Haak, Fabian, et al.
Published: (2026) -
Diverse capability and scaling of diffusion and auto-regressive models when learning abstract rules
by: Wang, Binxu, et al.
Published: (2024) -
LLM-supported document separation for printed reviews from zbMATH Open
by: Pluzhnikov, Ivan, et al.
Published: (2026) -
Narrative Fingerprints: Multi-Scale Author Identification via Novelty Curve Dynamics
by: Zimmerman, Fred, et al.
Published: (2026)