:: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Oh, Nick, Vrakas, Giorgos D., Brooke, Siân J. M., Morinière, Sasha, Duke, Toju
Format:	Preprint
Published:	2025
Subjects:	Multimedia Artificial Intelligence Databases
Online Access:	https://arxiv.org/abs/2508.09232
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Human-Data Interaction, Exploration, and Visualization in the AI Era: Challenges and Opportunities
by: Fekete, Jean-Daniel, et al.
Published: (2026)

HeGTa: Leveraging Heterogeneous Graph-enhanced Large Language Models for Few-shot Complex Table Understanding
by: Jin, Rihui, et al.
Published: (2024)

Solving Copyright Infringement on Short Video Platforms: Novel Datasets and an Audio Restoration Deep Learning Pipeline
by: Oh, Minwoo, et al.
Published: (2025)

Health+: Empowering Individuals via Unifying Health Data
by: Maiyya, Sujaya, et al.
Published: (2026)

Continual Multimodal Knowledge Graph Construction
by: Chen, Xiang, et al.
Published: (2023)

End-to-End Learning-based Video Streaming Enhancement Pipeline: A Generative AI Approach
by: Artioli, Emanuele, et al.
Published: (2025)

A Review of Media Copyright Management using Blockchain Technologies from the Academic and Business Perspectives
by: García, Roberto, et al.
Published: (2023)

Introduction of a tree-based technique for efficient and real-time label retrieval in the object tracking system
by: Benrazek, Ala-Eddine, et al.
Published: (2022)

LazyVLM: Neuro-Symbolic Approach to Video Analytics
by: Jian, Xiangru, et al.
Published: (2025)

NeedleDB: A Generative-AI Based System for Accurate and Efficient Image Retrieval using Complex Natural Language Queries
by: Erfanian, Mahdi, et al.
Published: (2026)

Sensorium Arc: AI Agent System for Oceanic Data Exploration and Interactive Eco-Art
by: Bissell, Noah, et al.
Published: (2025)

Differential Multimodal Transformers
by: Li, Jerry, et al.
Published: (2025)

The Synthetic Media Shift: Tracking the Rise, Virality, and Detectability of AI-Generated Multimodal Misinformation
by: Chrysidis, Zacharias, et al.
Published: (2026)

PTA: Enhancing Multimodal Sentiment Analysis through Pipelined Prediction and Translation-based Alignment
by: Song, Shezheng, et al.
Published: (2024)

Modeling Human Responses to Multimodal AI Content
by: Shen, Zhiqi, et al.
Published: (2025)

Signals of Provenance: Practices & Challenges of Navigating Indicators in AI-Generated Media for Sighted and Blind Individuals
by: Ide, Ayae, et al.
Published: (2025)

Vidformer: Drop-in Declarative Optimization for Rendering Video-Native Query Results
by: Winecki, Dominik, et al.
Published: (2026)

A Survey on Multimodal Benchmarks: In the Era of Large AI Models
by: Li, Lin, et al.
Published: (2024)

MindFuse: Towards GenAI Explainability in Marketing Strategy Co-Creation
by: Farseev, Aleksandr, et al.
Published: (2025)

A Conceptual Model of Intelligent Multimedia Data Rendered using Flying Light Specks
by: Yazdani, Nima, et al.
Published: (2024)

Large Language Model Based Multi-Agent System Augmented Complex Event Processing Pipeline for Internet of Multimedia Things
by: Zeeshan, Talha, et al.
Published: (2025)

A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects
by: Ruan, Shulan, et al.
Published: (2025)

The Dream Within Huang Long Cave: AI-Driven Interactive Narrative for Family Storytelling and Emotional Reflection
by: Huang, Jiayang, et al.
Published: (2025)

Synthesizing Sentiment-Controlled Feedback For Multimodal Text and Image Data
by: Kumar, Puneet, et al.
Published: (2024)

Narrative-to-Scene Generation: An LLM-Driven Pipeline for 2D Game Environments
by: Chen, Yi-Chun, et al.
Published: (2025)

PRISM-XR: Empowering Privacy-Aware XR Collaboration with Multimodal Large Language Models
by: Chen, Jiangong, et al.
Published: (2026)

TalkPlayData 2: An Agentic Synthetic Data Pipeline for Multimodal Conversational Music Recommendation
by: Choi, Keunwoo, et al.
Published: (2025)

LLMER: Crafting Interactive Extended Reality Worlds with JSON Data Generated by Large Language Models
by: Chen, Jiangong, et al.
Published: (2025)

SynthGuard: An Open Platform for Detecting AI-Generated Multimedia with Multimodal LLMs
by: Desai, Shail, et al.
Published: (2025)

MetaDesigner: Advancing Artistic Typography Through AI-Driven, User-Centric, and Multilingual WordArt Synthesis
by: He, Jun-Yan, et al.
Published: (2024)

Designing Singing Syllabi with Virtual Avatars: AI-Assisted Syllabus Reauthoring
by: Wu, Xinxing
Published: (2025)

Q-Ponder: A Unified Training Pipeline for Reasoning-based Visual Quality Assessment
by: Cai, Zhuoxuan, et al.
Published: (2025)

Proceedings of The third international workshop on eXplainable AI for the Arts (XAIxArts)
by: Ford, Corey, et al.
Published: (2025)

Dynamic and Super-Personalized Media Ecosystem Driven by Generative AI: Unpredictable Plays Never Repeating The Same
by: Ahn, Sungjun, et al.
Published: (2024)

SMP Challenge: An Overview and Analysis of Social Media Prediction Challenge
by: Wu, Bo, et al.
Published: (2024)

Media Forensics and Deepfake Systematic Survey
by: CH, Nadeem Jabbar, et al.
Published: (2024)

Detecting Multimedia Generated by Large AI Models: A Survey
by: Lin, Li, et al.
Published: (2024)

Manimator: Transforming Research Papers into Visual Explanations
by: P, Samarth, et al.
Published: (2025)

FedVideoMAE: Efficient Privacy-Preserving Federated Video Moderation
by: Tao, Ziyuan, et al.
Published: (2025)

QoS-QoE Translation with Large Language Model
by: Yu, Yingjie, et al.
Published: (2026)