Saved in:
| Main Authors: | Oh, Nick, Vrakas, Giorgos D., Brooke, Siân J. M., Morinière, Sasha, Duke, Toju |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.09232 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Human-Data Interaction, Exploration, and Visualization in the AI Era: Challenges and Opportunities
by: Fekete, Jean-Daniel, et al.
Published: (2026)
by: Fekete, Jean-Daniel, et al.
Published: (2026)
HeGTa: Leveraging Heterogeneous Graph-enhanced Large Language Models for Few-shot Complex Table Understanding
by: Jin, Rihui, et al.
Published: (2024)
by: Jin, Rihui, et al.
Published: (2024)
Solving Copyright Infringement on Short Video Platforms: Novel Datasets and an Audio Restoration Deep Learning Pipeline
by: Oh, Minwoo, et al.
Published: (2025)
by: Oh, Minwoo, et al.
Published: (2025)
Health+: Empowering Individuals via Unifying Health Data
by: Maiyya, Sujaya, et al.
Published: (2026)
by: Maiyya, Sujaya, et al.
Published: (2026)
Continual Multimodal Knowledge Graph Construction
by: Chen, Xiang, et al.
Published: (2023)
by: Chen, Xiang, et al.
Published: (2023)
End-to-End Learning-based Video Streaming Enhancement Pipeline: A Generative AI Approach
by: Artioli, Emanuele, et al.
Published: (2025)
by: Artioli, Emanuele, et al.
Published: (2025)
A Review of Media Copyright Management using Blockchain Technologies from the Academic and Business Perspectives
by: García, Roberto, et al.
Published: (2023)
by: García, Roberto, et al.
Published: (2023)
Introduction of a tree-based technique for efficient and real-time label retrieval in the object tracking system
by: Benrazek, Ala-Eddine, et al.
Published: (2022)
by: Benrazek, Ala-Eddine, et al.
Published: (2022)
LazyVLM: Neuro-Symbolic Approach to Video Analytics
by: Jian, Xiangru, et al.
Published: (2025)
by: Jian, Xiangru, et al.
Published: (2025)
NeedleDB: A Generative-AI Based System for Accurate and Efficient Image Retrieval using Complex Natural Language Queries
by: Erfanian, Mahdi, et al.
Published: (2026)
by: Erfanian, Mahdi, et al.
Published: (2026)
Sensorium Arc: AI Agent System for Oceanic Data Exploration and Interactive Eco-Art
by: Bissell, Noah, et al.
Published: (2025)
by: Bissell, Noah, et al.
Published: (2025)
Differential Multimodal Transformers
by: Li, Jerry, et al.
Published: (2025)
by: Li, Jerry, et al.
Published: (2025)
The Synthetic Media Shift: Tracking the Rise, Virality, and Detectability of AI-Generated Multimodal Misinformation
by: Chrysidis, Zacharias, et al.
Published: (2026)
by: Chrysidis, Zacharias, et al.
Published: (2026)
PTA: Enhancing Multimodal Sentiment Analysis through Pipelined Prediction and Translation-based Alignment
by: Song, Shezheng, et al.
Published: (2024)
by: Song, Shezheng, et al.
Published: (2024)
Modeling Human Responses to Multimodal AI Content
by: Shen, Zhiqi, et al.
Published: (2025)
by: Shen, Zhiqi, et al.
Published: (2025)
Signals of Provenance: Practices & Challenges of Navigating Indicators in AI-Generated Media for Sighted and Blind Individuals
by: Ide, Ayae, et al.
Published: (2025)
by: Ide, Ayae, et al.
Published: (2025)
Vidformer: Drop-in Declarative Optimization for Rendering Video-Native Query Results
by: Winecki, Dominik, et al.
Published: (2026)
by: Winecki, Dominik, et al.
Published: (2026)
A Survey on Multimodal Benchmarks: In the Era of Large AI Models
by: Li, Lin, et al.
Published: (2024)
by: Li, Lin, et al.
Published: (2024)
MindFuse: Towards GenAI Explainability in Marketing Strategy Co-Creation
by: Farseev, Aleksandr, et al.
Published: (2025)
by: Farseev, Aleksandr, et al.
Published: (2025)
A Conceptual Model of Intelligent Multimedia Data Rendered using Flying Light Specks
by: Yazdani, Nima, et al.
Published: (2024)
by: Yazdani, Nima, et al.
Published: (2024)
Large Language Model Based Multi-Agent System Augmented Complex Event Processing Pipeline for Internet of Multimedia Things
by: Zeeshan, Talha, et al.
Published: (2025)
by: Zeeshan, Talha, et al.
Published: (2025)
A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects
by: Ruan, Shulan, et al.
Published: (2025)
by: Ruan, Shulan, et al.
Published: (2025)
The Dream Within Huang Long Cave: AI-Driven Interactive Narrative for Family Storytelling and Emotional Reflection
by: Huang, Jiayang, et al.
Published: (2025)
by: Huang, Jiayang, et al.
Published: (2025)
Synthesizing Sentiment-Controlled Feedback For Multimodal Text and Image Data
by: Kumar, Puneet, et al.
Published: (2024)
by: Kumar, Puneet, et al.
Published: (2024)
Narrative-to-Scene Generation: An LLM-Driven Pipeline for 2D Game Environments
by: Chen, Yi-Chun, et al.
Published: (2025)
by: Chen, Yi-Chun, et al.
Published: (2025)
PRISM-XR: Empowering Privacy-Aware XR Collaboration with Multimodal Large Language Models
by: Chen, Jiangong, et al.
Published: (2026)
by: Chen, Jiangong, et al.
Published: (2026)
TalkPlayData 2: An Agentic Synthetic Data Pipeline for Multimodal Conversational Music Recommendation
by: Choi, Keunwoo, et al.
Published: (2025)
by: Choi, Keunwoo, et al.
Published: (2025)
LLMER: Crafting Interactive Extended Reality Worlds with JSON Data Generated by Large Language Models
by: Chen, Jiangong, et al.
Published: (2025)
by: Chen, Jiangong, et al.
Published: (2025)
SynthGuard: An Open Platform for Detecting AI-Generated Multimedia with Multimodal LLMs
by: Desai, Shail, et al.
Published: (2025)
by: Desai, Shail, et al.
Published: (2025)
MetaDesigner: Advancing Artistic Typography Through AI-Driven, User-Centric, and Multilingual WordArt Synthesis
by: He, Jun-Yan, et al.
Published: (2024)
by: He, Jun-Yan, et al.
Published: (2024)
Designing Singing Syllabi with Virtual Avatars: AI-Assisted Syllabus Reauthoring
by: Wu, Xinxing
Published: (2025)
by: Wu, Xinxing
Published: (2025)
Q-Ponder: A Unified Training Pipeline for Reasoning-based Visual Quality Assessment
by: Cai, Zhuoxuan, et al.
Published: (2025)
by: Cai, Zhuoxuan, et al.
Published: (2025)
Proceedings of The third international workshop on eXplainable AI for the Arts (XAIxArts)
by: Ford, Corey, et al.
Published: (2025)
by: Ford, Corey, et al.
Published: (2025)
Dynamic and Super-Personalized Media Ecosystem Driven by Generative AI: Unpredictable Plays Never Repeating The Same
by: Ahn, Sungjun, et al.
Published: (2024)
by: Ahn, Sungjun, et al.
Published: (2024)
SMP Challenge: An Overview and Analysis of Social Media Prediction Challenge
by: Wu, Bo, et al.
Published: (2024)
by: Wu, Bo, et al.
Published: (2024)
Media Forensics and Deepfake Systematic Survey
by: CH, Nadeem Jabbar, et al.
Published: (2024)
by: CH, Nadeem Jabbar, et al.
Published: (2024)
Detecting Multimedia Generated by Large AI Models: A Survey
by: Lin, Li, et al.
Published: (2024)
by: Lin, Li, et al.
Published: (2024)
Manimator: Transforming Research Papers into Visual Explanations
by: P, Samarth, et al.
Published: (2025)
by: P, Samarth, et al.
Published: (2025)
FedVideoMAE: Efficient Privacy-Preserving Federated Video Moderation
by: Tao, Ziyuan, et al.
Published: (2025)
by: Tao, Ziyuan, et al.
Published: (2025)
QoS-QoE Translation with Large Language Model
by: Yu, Yingjie, et al.
Published: (2026)
by: Yu, Yingjie, et al.
Published: (2026)
Similar Items
-
Human-Data Interaction, Exploration, and Visualization in the AI Era: Challenges and Opportunities
by: Fekete, Jean-Daniel, et al.
Published: (2026) -
HeGTa: Leveraging Heterogeneous Graph-enhanced Large Language Models for Few-shot Complex Table Understanding
by: Jin, Rihui, et al.
Published: (2024) -
Solving Copyright Infringement on Short Video Platforms: Novel Datasets and an Audio Restoration Deep Learning Pipeline
by: Oh, Minwoo, et al.
Published: (2025) -
Health+: Empowering Individuals via Unifying Health Data
by: Maiyya, Sujaya, et al.
Published: (2026) -
Continual Multimodal Knowledge Graph Construction
by: Chen, Xiang, et al.
Published: (2023)