Saved in:
| Main Authors: | Kim, Jiyeon, Lee, Hyunji, Zhou, Dylan, Park, Sue Hyun, Yoon, Seunghyun, Bui, Trung, Dernoncourt, Franck, Cha, Sungmin, Seo, Minjoon |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.07392 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CORG: Generating Answers from Complex, Interrelated Contexts
by: Lee, Hyunji, et al.
Published: (2025)
by: Lee, Hyunji, et al.
Published: (2025)
How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?
by: Lee, Seongyun, et al.
Published: (2024)
by: Lee, Seongyun, et al.
Published: (2024)
Instruction Tuning with and without Context: Behavioral Shifts and Downstream Impact
by: Lee, Hyunji, et al.
Published: (2025)
by: Lee, Hyunji, et al.
Published: (2025)
Towards Enhancing Coherence in Extractive Summarization: Dataset and Experiments with LLMs
by: Parmar, Mihir, et al.
Published: (2024)
by: Parmar, Mihir, et al.
Published: (2024)
StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos
by: Lee, Daeun, et al.
Published: (2025)
by: Lee, Daeun, et al.
Published: (2025)
Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Utilization
by: Ko, Miyoung, et al.
Published: (2024)
by: Ko, Miyoung, et al.
Published: (2024)
Scaling Up Video Summarization Pretraining with Large Language Models
by: Argaw, Dawit Mureja, et al.
Published: (2024)
by: Argaw, Dawit Mureja, et al.
Published: (2024)
Fine-tuning CLIP Text Encoders with Two-step Paraphrasing
by: Kim, Hyunjae, et al.
Published: (2024)
by: Kim, Hyunjae, et al.
Published: (2024)
SlimLM: An Efficient Small Language Model for On-Device Document Assistance
by: Pham, Thang M., et al.
Published: (2024)
by: Pham, Thang M., et al.
Published: (2024)
ViT-AdaLA: Adapting Vision Transformers with Linear Attention
by: Li, Yifan, et al.
Published: (2026)
by: Li, Yifan, et al.
Published: (2026)
NoLiMa: Long-Context Evaluation Beyond Literal Matching
by: Modarressi, Ali, et al.
Published: (2025)
by: Modarressi, Ali, et al.
Published: (2025)
MS4UI: A Dataset for Multi-modal Summarization of User Interface Instructional Videos
by: Zang, Yuan, et al.
Published: (2025)
by: Zang, Yuan, et al.
Published: (2025)
Retrieval Augmented Generation for Domain-specific Question Answering
by: Sharma, Sanat, et al.
Published: (2024)
by: Sharma, Sanat, et al.
Published: (2024)
Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition
by: Kim, Jiyeon, et al.
Published: (2024)
by: Kim, Jiyeon, et al.
Published: (2024)
Volcano: Mitigating Multimodal Hallucination through Self-Feedback Guided Revision
by: Lee, Seongyun, et al.
Published: (2023)
by: Lee, Seongyun, et al.
Published: (2023)
Aligning to Thousands of Preferences via System Message Generalization
by: Lee, Seongyun, et al.
Published: (2024)
by: Lee, Seongyun, et al.
Published: (2024)
Understanding and Enhancing Mamba-Transformer Hybrids for Memory Recall and Language Modeling
by: Lee, Hyunji, et al.
Published: (2025)
by: Lee, Hyunji, et al.
Published: (2025)
KTRL+F: Knowledge-Augmented In-Document Search
by: Oh, Hanseok, et al.
Published: (2023)
by: Oh, Hanseok, et al.
Published: (2023)
Identifying Speakers in Dialogue Transcripts: A Text-based Approach Using Pretrained Language Models
by: Nguyen, Minh, et al.
Published: (2024)
by: Nguyen, Minh, et al.
Published: (2024)
Differential Information Distribution: A Bayesian Perspective on Direct Preference Optimization
by: Won, Yunjae, et al.
Published: (2025)
by: Won, Yunjae, et al.
Published: (2025)
Prometheus-Vision: Vision-Language Model as a Judge for Fine-Grained Evaluation
by: Lee, Seongyun, et al.
Published: (2024)
by: Lee, Seongyun, et al.
Published: (2024)
Exploring the Practicality of Generative Retrieval on Dynamic Corpora
by: Kim, Chaeeun, et al.
Published: (2023)
by: Kim, Chaeeun, et al.
Published: (2023)
Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage
by: Lee, Saehyung, et al.
Published: (2024)
by: Lee, Saehyung, et al.
Published: (2024)
mSCoRe: a $M$ultilingual and Scalable Benchmark for $S$kill-based $Co$mmonsense $Re$asoning
by: Ngo, Nghia Trung, et al.
Published: (2025)
by: Ngo, Nghia Trung, et al.
Published: (2025)
INSTRUCTIR: A Benchmark for Instruction Following of Information Retrieval Models
by: Oh, Hanseok, et al.
Published: (2024)
by: Oh, Hanseok, et al.
Published: (2024)
Agentic Planning with Reasoning for Image Styling via Offline RL
by: Mukherjee, Subhojyoti, et al.
Published: (2026)
by: Mukherjee, Subhojyoti, et al.
Published: (2026)
Hyperparameters in Continual Learning: A Reality Check
by: Cha, Sungmin, et al.
Published: (2024)
by: Cha, Sungmin, et al.
Published: (2024)
RouterRetriever: Routing over a Mixture of Expert Embedding Models
by: Lee, Hyunji, et al.
Published: (2024)
by: Lee, Hyunji, et al.
Published: (2024)
FIFA: Unified Faithfulness Evaluation Framework for Text-to-Video and Video-to-Text Generation
by: Jing, Liqiang, et al.
Published: (2025)
by: Jing, Liqiang, et al.
Published: (2025)
Why Knowledge Distillation Works in Generative Models: A Minimal Working Explanation
by: Cha, Sungmin, et al.
Published: (2025)
by: Cha, Sungmin, et al.
Published: (2025)
Early Decisions Matter: Proximity Bias and Initial Trajectory Shaping in Non-Autoregressive Diffusion Language Models
by: Kim, Jiyeon, et al.
Published: (2026)
by: Kim, Jiyeon, et al.
Published: (2026)
Cross-Modal Watermarking for Authentic Audio Recovery and Tamper Localization in Synthesized Audiovisual Forgeries
by: Kim, Minyoung, et al.
Published: (2025)
by: Kim, Minyoung, et al.
Published: (2025)
Do Modern Video-LLMs Need to Listen? A Benchmark Audit and Scalable Remedy
by: Kim, Geewook, et al.
Published: (2025)
by: Kim, Geewook, et al.
Published: (2025)
DynaSaur: Large Language Agents Beyond Predefined Actions
by: Nguyen, Dang, et al.
Published: (2024)
by: Nguyen, Dang, et al.
Published: (2024)
Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis
by: Yang, Sohee, et al.
Published: (2023)
by: Yang, Sohee, et al.
Published: (2023)
Hypothetical Documents or Knowledge Leakage? Rethinking LLM-based Query Expansion
by: Yoon, Yejun, et al.
Published: (2025)
by: Yoon, Yejun, et al.
Published: (2025)
Regularizing with Pseudo-Negatives for Continual Self-Supervised Learning
by: Cha, Sungmin, et al.
Published: (2023)
by: Cha, Sungmin, et al.
Published: (2023)
KaPQA: Knowledge-Augmented Product Question-Answering
by: Eppalapally, Swetha, et al.
Published: (2024)
by: Eppalapally, Swetha, et al.
Published: (2024)
Domain-specific Question Answering with Hybrid Search
by: Sultania, Dewang, et al.
Published: (2024)
by: Sultania, Dewang, et al.
Published: (2024)
Steering MoE LLMs via Expert (De)Activation
by: Fayyaz, Mohsen, et al.
Published: (2025)
by: Fayyaz, Mohsen, et al.
Published: (2025)
Similar Items
-
CORG: Generating Answers from Complex, Interrelated Contexts
by: Lee, Hyunji, et al.
Published: (2025) -
How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?
by: Lee, Seongyun, et al.
Published: (2024) -
Instruction Tuning with and without Context: Behavioral Shifts and Downstream Impact
by: Lee, Hyunji, et al.
Published: (2025) -
Towards Enhancing Coherence in Extractive Summarization: Dataset and Experiments with LLMs
by: Parmar, Mihir, et al.
Published: (2024) -
StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos
by: Lee, Daeun, et al.
Published: (2025)