:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ng, Thye Shan, Han, Caren Soyeon, Holden, Eun-Jung
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2508.13387
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MUSEKG: A Knowledge Graph Over Museum Collections
by: Li, Jinhao, et al.
Published: (2025)

3M: Multi-modal Multi-task Multi-teacher Learning for Game Event Detection
by: Ng, Thye Shan, et al.
Published: (2024)

UniCast: A Unified Framework for Instance-Conditioned Multimodal Time-Series Forecasting
by: Park, Sehyuk, et al.
Published: (2025)

Multimodal Commonsense Knowledge Distillation for Visual Question Answering
by: Yang, Shuo, et al.
Published: (2024)

FiLoRA: Focus-and-Ignore LoRA for Controllable Feature Reliance
by: Chung, Hyunsuk, et al.
Published: (2026)

Diagnosing Causal Reasoning in Vision-Language Models via Structured Relevance Graphs
by: Pratama, Dhita Putri, et al.
Published: (2026)

SCO-VIST: Social Interaction Commonsense Knowledge-based Visual Storytelling
by: Wang, Eileen, et al.
Published: (2024)

A Training-Free Length Extrapolation Approach for LLMs: Greedy Attention Logit Interpolation (GALI)
by: Li, Yan, et al.
Published: (2025)

VRD-IU: Lessons from Visually Rich Document Intelligence and Understanding
by: Ding, Yihao, et al.
Published: (2025)

ToolTree: Efficient LLM Agent Tool Planning via Dual-Feedback Monte Carlo Tree Search and Bidirectional Pruning
by: Yang, Shuo, et al.
Published: (2026)

Inclusion-of-Thoughts: Mitigating Preference Instability via Purifying the Decision Space
by: Madani, Mohammad Reza Ghasemi, et al.
Published: (2026)

HierCon: Hierarchical Contrastive Attention for Audio Deepfake Detection
by: Liang, Zhili Nicholas, et al.
Published: (2026)

Multimodal Forecasting for Commodity Prices Using Spectrogram-Based and Time Series Representations
by: Park, Soyeon, et al.
Published: (2026)

EvoTool: Self-Evolving Tool-Use Policy Optimization in LLM Agents via Blame-Aware Mutation and Diversity-Aware Selection
by: Yang, Shuo, et al.
Published: (2026)

OntoAligner Meets Knowledge Graph Embedding Aligners
by: Giglou, Hamed Babaei, et al.
Published: (2025)

Physics-based phenomenological characterization of cross-modal bias in multimodal models
by: Kim, Hyeongmo, et al.
Published: (2026)

GeoChemAD: Benchmarking Unsupervised Geochemical Anomaly Detection for Mineral Exploration
by: Ding, Yihao, et al.
Published: (2026)

ArcAligner: Adaptive Recursive Aligner for Compressed Context Embeddings in RAG
by: Li, Jianbo, et al.
Published: (2026)

Impact Measures for Gradual Argumentation Semantics
by: Anaissy, Caren Al, et al.
Published: (2024)

'No' Matters: Out-of-Distribution Detection in Multimodality Long Dialogue
by: Gao, Rena, et al.
Published: (2024)

Aligners: Decoupling LLMs and Alignment
by: Ngweta, Lilian, et al.
Published: (2024)

SemShareKV: Efficient KVCache Sharing for Semantically Similar Prompts via Token-Level LSH Matching
by: Zhao, Xinye, et al.
Published: (2025)

Loop Neural Networks for Parameter Sharing
by: Ng, Kei-Sing, et al.
Published: (2024)

Multimodal Representation Learning Conditioned on Semantic Relations
by: Qiao, Yang, et al.
Published: (2025)

Representation Magnitude has a Liability to Privacy Vulnerability
by: Fang, Xingli, et al.
Published: (2024)

Aligner: Efficient Alignment by Learning to Correct
by: Ji, Jiaming, et al.
Published: (2024)

K-MetBench: A Multi-Dimensional Benchmark for Fine-Grained Evaluation of Expert Reasoning, Locality, and Multimodality in Meteorology
by: Kim, Soyeon, et al.
Published: (2026)

Semantics as a Shield: Label Disguise Defense (LDD) against Prompt Injection in LLM Sentiment Classification
by: Li, Yanxi, et al.
Published: (2025)

MMCOMET: A Large-Scale Multimodal Commonsense Knowledge Graph for Contextual Reasoning
by: Wang, Eileen, et al.
Published: (2026)

Location-Aware Pretraining for Medical Difference Visual Question Answering
by: Musinguzi, Denis, et al.
Published: (2026)

Aligner-Guided Training Paradigm: Advancing Text-to-Speech Models with Aligner Guided Duration
by: Lou, Haowei, et al.
Published: (2024)

MSG-Chart: Multimodal Scene Graph for ChartQA
by: Dai, Yue, et al.
Published: (2024)

Semantic Generative Tuning for Unified Multimodal Models
by: Yu, Songsong, et al.
Published: (2026)

Graph-Based Multimodal Contrastive Learning for Chart Question Answering
by: Dai, Yue, et al.
Published: (2025)

Aggregate Representation Measure for Predictive Model Reusability
by: Sangarya, Vishwesh, et al.
Published: (2024)

ARCHI-TTS: A flow-matching-based Text-to-Speech Model with Self-supervised Semantic Aligner and Accelerated Inference
by: Wu, Chunyat, et al.
Published: (2026)

Multiverse: Language-Conditioned Multi-Game Level Blending via Shared Representation
by: Baek, In-Chang, et al.
Published: (2026)

Sharing State Between Prompts and Programs
by: Cheng, Ellie Y., et al.
Published: (2025)

Probing Multimodal Large Language Models for Global and Local Semantic Representations
by: Tao, Mingxu, et al.
Published: (2024)

In-game Toxic Language Detection: Shared Task and Attention Residuals
by: Jia, Yuanzhe, et al.
Published: (2022)