:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kim, Jin, Lee, Byunghwee, You, Taekho, Yun, Jinhyuk
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence Computers and Society Machine Learning
Online Access:	https://arxiv.org/abs/2503.13531
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Investigating the diversity and stylization of contemporary user generated visual arts in the complexity entropy plane
by: Kim, Seunghwan, et al.
Published: (2024)

Social Links vs. Language Barriers: Decoding the Global Spread of Streaming Content
by: Park, Seoyoung, et al.
Published: (2024)

Aligning AI with Public Values: Deliberation and Decision-Making for Governing Multimodal LLMs in Political Video Analysis
by: Sharma, Tanusree, et al.
Published: (2024)

SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection
by: Qi, Peng, et al.
Published: (2024)

Direction-Flipped Influence Audits Reveal Hidden Structure in Moral Choices of LLMs
by: Blandfort, Phil, et al.
Published: (2026)

Are Multimodal LLMs Ready for Clinical Dermatology? A Real-World Evaluation in Dermatology
by: Jiang, Roy, et al.
Published: (2026)

Mitigating GenAI-powered Evidence Pollution for Out-of-Context Multimodal Misinformation Detection
by: Yan, Zehong, et al.
Published: (2025)

Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection
by: Um, Sung Jin, et al.
Published: (2025)

Decoding Tourist Perception in Historic Urban Quarters with Multimodal Social Media Data: An AI-Based Framework and Evidence from Shanghai
by: Tan, Kaizhen, et al.
Published: (2025)

Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation
by: Shi, Chuancheng, et al.
Published: (2025)

ArtSplat: Feed-Forward Articulated 3D Gaussian Splatting from Sparse Multi-State Uncalibrated Views
by: Lee, Inseo, et al.
Published: (2026)

Bridging the Gap: Doubles Badminton Analysis with Singles-Trained Models
by: Baek, Seungheon, et al.
Published: (2025)

Multimodal Political Bias Identification and Neutralization
by: Bernard, Cedric, et al.
Published: (2025)

Map the Flow: Revealing Hidden Pathways of Information in VideoLLMs
by: Kim, Minji, et al.
Published: (2025)

Hidden Bias in the Machine: Stereotypes in Text-to-Image Models
by: Porikli, Sedat, et al.
Published: (2025)

AI-based Multimodal Biometrics for Detecting Smartphone Distractions: Application to Online Learning
by: Becerra, Alvaro, et al.
Published: (2025)

Emergent AI Surveillance: Overlearned Person Re-Identification and Its Mitigation in Law Enforcement Context
by: Nguyen, An Thi, et al.
Published: (2025)

Visual Chronicles: Using Multimodal LLMs to Analyze Massive Collections of Images
by: Deng, Boyang, et al.
Published: (2025)

Rainbow Noise: Stress-Testing Multimodal Harmful-Meme Detectors on LGBTQ Content
by: Tong, Ran, et al.
Published: (2025)

From Content to Audience: A Multimodal Annotation Framework for Broadcast Television Analytics
by: Cupini, Paolo, et al.
Published: (2026)

ECMF: Enhanced Cross-Modal Fusion for Multimodal Emotion Recognition in MER-SEMI Challenge
by: Hu, Juewen, et al.
Published: (2025)

Can Multimodal LLMs See Science Instruction? Benchmarking Pedagogical Reasoning in K-12 Classroom Videos
by: Shen, Yixuan, et al.
Published: (2026)

Deepfake-Eval-2024: A Multi-Modal In-the-Wild Benchmark of Deepfakes Circulated in 2024
by: Chandra, Nuria Alina, et al.
Published: (2025)

BuildingView: Constructing Urban Building Exteriors Databases with Street View Imagery and Multimodal Large Language Mode
by: Li, Zongrong, et al.
Published: (2024)

RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models
by: Xia, Peng, et al.
Published: (2024)

Intelligent Systems in Neuroimaging: Pioneering AI Techniques for Brain Tumor Detection
by: Islam, Md. Mohaiminul, et al.
Published: (2025)

Safer Prompts: Reducing Risks from Memorization in Visual Generative AI
by: Reissinger, Lena, et al.
Published: (2025)

An AI-Enabled Framework Within Reach for Enhancing Healthcare Sustainability and Fairness
by: Huang, Bin, et al.
Published: (2024)

EDU-CIRCUIT-HW: Evaluating Multimodal Large Language Models on Real-World University-Level STEM Student Handwritten Solutions
by: Sun, Weiyu, et al.
Published: (2026)

Egocentric Co-Pilot: Web-Native Smart-Glasses Agents for Assistive Egocentric AI
by: Yang, Sicheng, et al.
Published: (2026)

Multimodal Learning with Augmentation Techniques for Natural Disaster Assessment
by: Urse, Adrian-Dinu, et al.
Published: (2025)

AI's Blind Spots: Geographic Knowledge and Diversity Deficit in Generated Urban Scenario
by: Beneduce, Ciro, et al.
Published: (2025)

Edge-AI for Agriculture: Lightweight Vision Models for Disease Detection in Resource-Limited Settings
by: Joshi, Harsh
Published: (2024)

Climatic & Anthropogenic Hazards to the Nasca World Heritage: Application of Remote Sensing, AI, and Flood Modelling
by: Sakai, Masato, et al.
Published: (2024)

Silicon Minds versus Human Hearts: The Wisdom of Crowds Beats the Wisdom of AI in Emotion Recognition
by: Akben, Mustafa, et al.
Published: (2025)

Smiling Women Pitching Down: Auditing Representational and Presentational Gender Biases in Image Generative AI
by: Sun, Luhang, et al.
Published: (2023)

Two Stage Context Learning with Large Language Models for Multimodal Stance Detection on Climate Change
by: Pangtey, Lata, et al.
Published: (2025)

Assessing Greenspace Attractiveness with ChatGPT, Claude, and Gemini: Do AI Models Reflect Human Perceptions?
by: Malekzadeh, Milad, et al.
Published: (2025)

AI-generated data contamination erodes pathological variability and diagnostic reliability
by: He, Hongyu, et al.
Published: (2026)

When VLMs 'Fix' Students: Identifying and Penalizing Over-Correction in the Evaluation of Multi-line Handwritten Math OCR
by: Seong, Jin, et al.
Published: (2026)