Saved in:
| Main Authors: | Kim, Jin, Lee, Byunghwee, You, Taekho, Yun, Jinhyuk |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.13531 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Investigating the diversity and stylization of contemporary user generated visual arts in the complexity entropy plane
by: Kim, Seunghwan, et al.
Published: (2024)
by: Kim, Seunghwan, et al.
Published: (2024)
Social Links vs. Language Barriers: Decoding the Global Spread of Streaming Content
by: Park, Seoyoung, et al.
Published: (2024)
by: Park, Seoyoung, et al.
Published: (2024)
Aligning AI with Public Values: Deliberation and Decision-Making for Governing Multimodal LLMs in Political Video Analysis
by: Sharma, Tanusree, et al.
Published: (2024)
by: Sharma, Tanusree, et al.
Published: (2024)
SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection
by: Qi, Peng, et al.
Published: (2024)
by: Qi, Peng, et al.
Published: (2024)
Direction-Flipped Influence Audits Reveal Hidden Structure in Moral Choices of LLMs
by: Blandfort, Phil, et al.
Published: (2026)
by: Blandfort, Phil, et al.
Published: (2026)
Are Multimodal LLMs Ready for Clinical Dermatology? A Real-World Evaluation in Dermatology
by: Jiang, Roy, et al.
Published: (2026)
by: Jiang, Roy, et al.
Published: (2026)
Mitigating GenAI-powered Evidence Pollution for Out-of-Context Multimodal Misinformation Detection
by: Yan, Zehong, et al.
Published: (2025)
by: Yan, Zehong, et al.
Published: (2025)
Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection
by: Um, Sung Jin, et al.
Published: (2025)
by: Um, Sung Jin, et al.
Published: (2025)
Decoding Tourist Perception in Historic Urban Quarters with Multimodal Social Media Data: An AI-Based Framework and Evidence from Shanghai
by: Tan, Kaizhen, et al.
Published: (2025)
by: Tan, Kaizhen, et al.
Published: (2025)
Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation
by: Shi, Chuancheng, et al.
Published: (2025)
by: Shi, Chuancheng, et al.
Published: (2025)
ArtSplat: Feed-Forward Articulated 3D Gaussian Splatting from Sparse Multi-State Uncalibrated Views
by: Lee, Inseo, et al.
Published: (2026)
by: Lee, Inseo, et al.
Published: (2026)
Bridging the Gap: Doubles Badminton Analysis with Singles-Trained Models
by: Baek, Seungheon, et al.
Published: (2025)
by: Baek, Seungheon, et al.
Published: (2025)
Multimodal Political Bias Identification and Neutralization
by: Bernard, Cedric, et al.
Published: (2025)
by: Bernard, Cedric, et al.
Published: (2025)
Map the Flow: Revealing Hidden Pathways of Information in VideoLLMs
by: Kim, Minji, et al.
Published: (2025)
by: Kim, Minji, et al.
Published: (2025)
Hidden Bias in the Machine: Stereotypes in Text-to-Image Models
by: Porikli, Sedat, et al.
Published: (2025)
by: Porikli, Sedat, et al.
Published: (2025)
AI-based Multimodal Biometrics for Detecting Smartphone Distractions: Application to Online Learning
by: Becerra, Alvaro, et al.
Published: (2025)
by: Becerra, Alvaro, et al.
Published: (2025)
Emergent AI Surveillance: Overlearned Person Re-Identification and Its Mitigation in Law Enforcement Context
by: Nguyen, An Thi, et al.
Published: (2025)
by: Nguyen, An Thi, et al.
Published: (2025)
Visual Chronicles: Using Multimodal LLMs to Analyze Massive Collections of Images
by: Deng, Boyang, et al.
Published: (2025)
by: Deng, Boyang, et al.
Published: (2025)
Rainbow Noise: Stress-Testing Multimodal Harmful-Meme Detectors on LGBTQ Content
by: Tong, Ran, et al.
Published: (2025)
by: Tong, Ran, et al.
Published: (2025)
From Content to Audience: A Multimodal Annotation Framework for Broadcast Television Analytics
by: Cupini, Paolo, et al.
Published: (2026)
by: Cupini, Paolo, et al.
Published: (2026)
ECMF: Enhanced Cross-Modal Fusion for Multimodal Emotion Recognition in MER-SEMI Challenge
by: Hu, Juewen, et al.
Published: (2025)
by: Hu, Juewen, et al.
Published: (2025)
Can Multimodal LLMs See Science Instruction? Benchmarking Pedagogical Reasoning in K-12 Classroom Videos
by: Shen, Yixuan, et al.
Published: (2026)
by: Shen, Yixuan, et al.
Published: (2026)
Deepfake-Eval-2024: A Multi-Modal In-the-Wild Benchmark of Deepfakes Circulated in 2024
by: Chandra, Nuria Alina, et al.
Published: (2025)
by: Chandra, Nuria Alina, et al.
Published: (2025)
BuildingView: Constructing Urban Building Exteriors Databases with Street View Imagery and Multimodal Large Language Mode
by: Li, Zongrong, et al.
Published: (2024)
by: Li, Zongrong, et al.
Published: (2024)
RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models
by: Xia, Peng, et al.
Published: (2024)
by: Xia, Peng, et al.
Published: (2024)
Intelligent Systems in Neuroimaging: Pioneering AI Techniques for Brain Tumor Detection
by: Islam, Md. Mohaiminul, et al.
Published: (2025)
by: Islam, Md. Mohaiminul, et al.
Published: (2025)
Safer Prompts: Reducing Risks from Memorization in Visual Generative AI
by: Reissinger, Lena, et al.
Published: (2025)
by: Reissinger, Lena, et al.
Published: (2025)
An AI-Enabled Framework Within Reach for Enhancing Healthcare Sustainability and Fairness
by: Huang, Bin, et al.
Published: (2024)
by: Huang, Bin, et al.
Published: (2024)
EDU-CIRCUIT-HW: Evaluating Multimodal Large Language Models on Real-World University-Level STEM Student Handwritten Solutions
by: Sun, Weiyu, et al.
Published: (2026)
by: Sun, Weiyu, et al.
Published: (2026)
Egocentric Co-Pilot: Web-Native Smart-Glasses Agents for Assistive Egocentric AI
by: Yang, Sicheng, et al.
Published: (2026)
by: Yang, Sicheng, et al.
Published: (2026)
Multimodal Learning with Augmentation Techniques for Natural Disaster Assessment
by: Urse, Adrian-Dinu, et al.
Published: (2025)
by: Urse, Adrian-Dinu, et al.
Published: (2025)
AI's Blind Spots: Geographic Knowledge and Diversity Deficit in Generated Urban Scenario
by: Beneduce, Ciro, et al.
Published: (2025)
by: Beneduce, Ciro, et al.
Published: (2025)
Edge-AI for Agriculture: Lightweight Vision Models for Disease Detection in Resource-Limited Settings
by: Joshi, Harsh
Published: (2024)
by: Joshi, Harsh
Published: (2024)
Climatic & Anthropogenic Hazards to the Nasca World Heritage: Application of Remote Sensing, AI, and Flood Modelling
by: Sakai, Masato, et al.
Published: (2024)
by: Sakai, Masato, et al.
Published: (2024)
Silicon Minds versus Human Hearts: The Wisdom of Crowds Beats the Wisdom of AI in Emotion Recognition
by: Akben, Mustafa, et al.
Published: (2025)
by: Akben, Mustafa, et al.
Published: (2025)
Smiling Women Pitching Down: Auditing Representational and Presentational Gender Biases in Image Generative AI
by: Sun, Luhang, et al.
Published: (2023)
by: Sun, Luhang, et al.
Published: (2023)
Two Stage Context Learning with Large Language Models for Multimodal Stance Detection on Climate Change
by: Pangtey, Lata, et al.
Published: (2025)
by: Pangtey, Lata, et al.
Published: (2025)
Assessing Greenspace Attractiveness with ChatGPT, Claude, and Gemini: Do AI Models Reflect Human Perceptions?
by: Malekzadeh, Milad, et al.
Published: (2025)
by: Malekzadeh, Milad, et al.
Published: (2025)
AI-generated data contamination erodes pathological variability and diagnostic reliability
by: He, Hongyu, et al.
Published: (2026)
by: He, Hongyu, et al.
Published: (2026)
When VLMs 'Fix' Students: Identifying and Penalizing Over-Correction in the Evaluation of Multi-line Handwritten Math OCR
by: Seong, Jin, et al.
Published: (2026)
by: Seong, Jin, et al.
Published: (2026)
Similar Items
-
Investigating the diversity and stylization of contemporary user generated visual arts in the complexity entropy plane
by: Kim, Seunghwan, et al.
Published: (2024) -
Social Links vs. Language Barriers: Decoding the Global Spread of Streaming Content
by: Park, Seoyoung, et al.
Published: (2024) -
Aligning AI with Public Values: Deliberation and Decision-Making for Governing Multimodal LLMs in Political Video Analysis
by: Sharma, Tanusree, et al.
Published: (2024) -
SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection
by: Qi, Peng, et al.
Published: (2024) -
Direction-Flipped Influence Audits Reveal Hidden Structure in Moral Choices of LLMs
by: Blandfort, Phil, et al.
Published: (2026)