Saved in:
| Main Authors: | Ng, Thye Shan, Han, Caren Soyeon, Holden, Eun-Jung |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.13387 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MUSEKG: A Knowledge Graph Over Museum Collections
by: Li, Jinhao, et al.
Published: (2025)
by: Li, Jinhao, et al.
Published: (2025)
3M: Multi-modal Multi-task Multi-teacher Learning for Game Event Detection
by: Ng, Thye Shan, et al.
Published: (2024)
by: Ng, Thye Shan, et al.
Published: (2024)
UniCast: A Unified Framework for Instance-Conditioned Multimodal Time-Series Forecasting
by: Park, Sehyuk, et al.
Published: (2025)
by: Park, Sehyuk, et al.
Published: (2025)
Multimodal Commonsense Knowledge Distillation for Visual Question Answering
by: Yang, Shuo, et al.
Published: (2024)
by: Yang, Shuo, et al.
Published: (2024)
FiLoRA: Focus-and-Ignore LoRA for Controllable Feature Reliance
by: Chung, Hyunsuk, et al.
Published: (2026)
by: Chung, Hyunsuk, et al.
Published: (2026)
Diagnosing Causal Reasoning in Vision-Language Models via Structured Relevance Graphs
by: Pratama, Dhita Putri, et al.
Published: (2026)
by: Pratama, Dhita Putri, et al.
Published: (2026)
SCO-VIST: Social Interaction Commonsense Knowledge-based Visual Storytelling
by: Wang, Eileen, et al.
Published: (2024)
by: Wang, Eileen, et al.
Published: (2024)
A Training-Free Length Extrapolation Approach for LLMs: Greedy Attention Logit Interpolation (GALI)
by: Li, Yan, et al.
Published: (2025)
by: Li, Yan, et al.
Published: (2025)
VRD-IU: Lessons from Visually Rich Document Intelligence and Understanding
by: Ding, Yihao, et al.
Published: (2025)
by: Ding, Yihao, et al.
Published: (2025)
ToolTree: Efficient LLM Agent Tool Planning via Dual-Feedback Monte Carlo Tree Search and Bidirectional Pruning
by: Yang, Shuo, et al.
Published: (2026)
by: Yang, Shuo, et al.
Published: (2026)
Inclusion-of-Thoughts: Mitigating Preference Instability via Purifying the Decision Space
by: Madani, Mohammad Reza Ghasemi, et al.
Published: (2026)
by: Madani, Mohammad Reza Ghasemi, et al.
Published: (2026)
HierCon: Hierarchical Contrastive Attention for Audio Deepfake Detection
by: Liang, Zhili Nicholas, et al.
Published: (2026)
by: Liang, Zhili Nicholas, et al.
Published: (2026)
Multimodal Forecasting for Commodity Prices Using Spectrogram-Based and Time Series Representations
by: Park, Soyeon, et al.
Published: (2026)
by: Park, Soyeon, et al.
Published: (2026)
EvoTool: Self-Evolving Tool-Use Policy Optimization in LLM Agents via Blame-Aware Mutation and Diversity-Aware Selection
by: Yang, Shuo, et al.
Published: (2026)
by: Yang, Shuo, et al.
Published: (2026)
OntoAligner Meets Knowledge Graph Embedding Aligners
by: Giglou, Hamed Babaei, et al.
Published: (2025)
by: Giglou, Hamed Babaei, et al.
Published: (2025)
Physics-based phenomenological characterization of cross-modal bias in multimodal models
by: Kim, Hyeongmo, et al.
Published: (2026)
by: Kim, Hyeongmo, et al.
Published: (2026)
GeoChemAD: Benchmarking Unsupervised Geochemical Anomaly Detection for Mineral Exploration
by: Ding, Yihao, et al.
Published: (2026)
by: Ding, Yihao, et al.
Published: (2026)
ArcAligner: Adaptive Recursive Aligner for Compressed Context Embeddings in RAG
by: Li, Jianbo, et al.
Published: (2026)
by: Li, Jianbo, et al.
Published: (2026)
Impact Measures for Gradual Argumentation Semantics
by: Anaissy, Caren Al, et al.
Published: (2024)
by: Anaissy, Caren Al, et al.
Published: (2024)
'No' Matters: Out-of-Distribution Detection in Multimodality Long Dialogue
by: Gao, Rena, et al.
Published: (2024)
by: Gao, Rena, et al.
Published: (2024)
Aligners: Decoupling LLMs and Alignment
by: Ngweta, Lilian, et al.
Published: (2024)
by: Ngweta, Lilian, et al.
Published: (2024)
SemShareKV: Efficient KVCache Sharing for Semantically Similar Prompts via Token-Level LSH Matching
by: Zhao, Xinye, et al.
Published: (2025)
by: Zhao, Xinye, et al.
Published: (2025)
Loop Neural Networks for Parameter Sharing
by: Ng, Kei-Sing, et al.
Published: (2024)
by: Ng, Kei-Sing, et al.
Published: (2024)
Multimodal Representation Learning Conditioned on Semantic Relations
by: Qiao, Yang, et al.
Published: (2025)
by: Qiao, Yang, et al.
Published: (2025)
Representation Magnitude has a Liability to Privacy Vulnerability
by: Fang, Xingli, et al.
Published: (2024)
by: Fang, Xingli, et al.
Published: (2024)
Aligner: Efficient Alignment by Learning to Correct
by: Ji, Jiaming, et al.
Published: (2024)
by: Ji, Jiaming, et al.
Published: (2024)
K-MetBench: A Multi-Dimensional Benchmark for Fine-Grained Evaluation of Expert Reasoning, Locality, and Multimodality in Meteorology
by: Kim, Soyeon, et al.
Published: (2026)
by: Kim, Soyeon, et al.
Published: (2026)
Semantics as a Shield: Label Disguise Defense (LDD) against Prompt Injection in LLM Sentiment Classification
by: Li, Yanxi, et al.
Published: (2025)
by: Li, Yanxi, et al.
Published: (2025)
MMCOMET: A Large-Scale Multimodal Commonsense Knowledge Graph for Contextual Reasoning
by: Wang, Eileen, et al.
Published: (2026)
by: Wang, Eileen, et al.
Published: (2026)
Location-Aware Pretraining for Medical Difference Visual Question Answering
by: Musinguzi, Denis, et al.
Published: (2026)
by: Musinguzi, Denis, et al.
Published: (2026)
Aligner-Guided Training Paradigm: Advancing Text-to-Speech Models with Aligner Guided Duration
by: Lou, Haowei, et al.
Published: (2024)
by: Lou, Haowei, et al.
Published: (2024)
MSG-Chart: Multimodal Scene Graph for ChartQA
by: Dai, Yue, et al.
Published: (2024)
by: Dai, Yue, et al.
Published: (2024)
Semantic Generative Tuning for Unified Multimodal Models
by: Yu, Songsong, et al.
Published: (2026)
by: Yu, Songsong, et al.
Published: (2026)
Graph-Based Multimodal Contrastive Learning for Chart Question Answering
by: Dai, Yue, et al.
Published: (2025)
by: Dai, Yue, et al.
Published: (2025)
Aggregate Representation Measure for Predictive Model Reusability
by: Sangarya, Vishwesh, et al.
Published: (2024)
by: Sangarya, Vishwesh, et al.
Published: (2024)
ARCHI-TTS: A flow-matching-based Text-to-Speech Model with Self-supervised Semantic Aligner and Accelerated Inference
by: Wu, Chunyat, et al.
Published: (2026)
by: Wu, Chunyat, et al.
Published: (2026)
Multiverse: Language-Conditioned Multi-Game Level Blending via Shared Representation
by: Baek, In-Chang, et al.
Published: (2026)
by: Baek, In-Chang, et al.
Published: (2026)
Sharing State Between Prompts and Programs
by: Cheng, Ellie Y., et al.
Published: (2025)
by: Cheng, Ellie Y., et al.
Published: (2025)
Probing Multimodal Large Language Models for Global and Local Semantic Representations
by: Tao, Mingxu, et al.
Published: (2024)
by: Tao, Mingxu, et al.
Published: (2024)
In-game Toxic Language Detection: Shared Task and Attention Residuals
by: Jia, Yuanzhe, et al.
Published: (2022)
by: Jia, Yuanzhe, et al.
Published: (2022)
Similar Items
-
MUSEKG: A Knowledge Graph Over Museum Collections
by: Li, Jinhao, et al.
Published: (2025) -
3M: Multi-modal Multi-task Multi-teacher Learning for Game Event Detection
by: Ng, Thye Shan, et al.
Published: (2024) -
UniCast: A Unified Framework for Instance-Conditioned Multimodal Time-Series Forecasting
by: Park, Sehyuk, et al.
Published: (2025) -
Multimodal Commonsense Knowledge Distillation for Visual Question Answering
by: Yang, Shuo, et al.
Published: (2024) -
FiLoRA: Focus-and-Ignore LoRA for Controllable Feature Reliance
by: Chung, Hyunsuk, et al.
Published: (2026)