:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yu, Seungjun, Lee, Seonho, Kim, Namho, Shin, Jaeyo, Park, Junsung, Ryu, Wonjeong, Jung, Raehyuk, Shim, Hyunjung
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2511.20022
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SUPER-AD: Semantic Uncertainty-aware Planning for End-to-End Robust Autonomous Driving
by: Ryu, Wonjeong, et al.
Published: (2025)

Robust Driving QA through Metadata-Grounded Context and Task-Specific Prompts
by: Yu, Seungjun, et al.
Published: (2025)

Prompt the Unseen: Evaluating Visual-Language Alignment Beyond Supervision
by: Jung, Raehyuk, et al.
Published: (2025)

DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation
by: Kim, Jiwook, et al.
Published: (2024)

Grounding Driving VLA via Inverse Kinematics
by: Park, Junsung, et al.
Published: (2026)

Representation Alignment for Just Image Transformers is not Easier than You Think
by: Shin, Jaeyo, et al.
Published: (2026)

Real-Time Long Horizon Air Quality Forecasting via Group-Relative Policy Optimization
by: Kang, Inha, et al.
Published: (2025)

Precision matters: Precision-aware ensemble for weakly supervised semantic segmentation
by: Park, Junsung, et al.
Published: (2024)

No Thing, Nothing: Highlighting Safety-Critical Classes for Robust LiDAR Semantic Segmentation in Adverse Weather
by: Park, Junsung, et al.
Published: (2025)

3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation
by: Lee, Seonho, et al.
Published: (2025)

CoT-PL: Chain-of-Thought Pseudo-Labeling for Open-Vocabulary Object Detection
by: Choi, Hojun, et al.
Published: (2025)

Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather
by: Park, Junsung, et al.
Published: (2024)

Evaluating Image Hallucination in Text-to-Image Generation with Question-Answering
by: Lim, Youngsun, et al.
Published: (2024)

LingoQA: Visual Question Answering for Autonomous Driving
by: Marcu, Ana-Maria, et al.
Published: (2023)

What "Not" to Detect: Negation-Aware VLMs via Structured Reasoning and Token Merging
by: Kang, Inha, et al.
Published: (2025)

GRS-QA -- Graph Reasoning-Structured Question Answering Dataset
by: Pahilajani, Anish, et al.
Published: (2024)

DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models
by: Ryu, Hyogon, et al.
Published: (2025)

STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes
by: Ishihara, Keishi, et al.
Published: (2025)

Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
by: Choi, Jiho, et al.
Published: (2024)

Scribble-Guided Diffusion for Training-free Text-to-Image Generation
by: Lee, Seonho, et al.
Published: (2024)

Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation
by: Choi, Jiho, et al.
Published: (2025)

QPaug: Question and Passage Augmentation for Open-Domain Question Answering of LLMs
by: Kim, Minsang, et al.
Published: (2024)

Too Many Frames, Not All Useful: Efficient Strategies for Long-Form Video QA
by: Park, Jongwoo, et al.
Published: (2024)

Memory-Efficient Fine-Tuning for Quantized Diffusion Model
by: Ryu, Hyogon, et al.
Published: (2024)

ROAD-Waymo: Action Awareness at Scale for Autonomous Driving
by: Khan, Salman, et al.
Published: (2024)

PolQA: Polish Question Answering Dataset
by: Rybak, Piotr, et al.
Published: (2022)

Sampling Bag of Views for Open-Vocabulary Object Detection
by: Choi, Hojun, et al.
Published: (2024)

Label-Augmented Dataset Distillation
by: Kang, Seoungyoon, et al.
Published: (2024)

FoQA: A Faroese Question-Answering Dataset
by: Simonsen, Annika, et al.
Published: (2025)

HiKEY: Hierarchical Multimodal Retrieval for Open-Domain Document Question Answering
by: Shin, Joongmin, et al.
Published: (2026)

NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario
by: Qian, Tianwen, et al.
Published: (2023)

AdvisorQA: Towards Helpful and Harmless Advice-seeking Question Answering with Collective Intelligence
by: Kim, Minbeom, et al.
Published: (2024)

SyllabusQA: A Course Logistics Question Answering Dataset
by: Fernandez, Nigel, et al.
Published: (2024)

ArabicaQA: A Comprehensive Dataset for Arabic Question Answering
by: Abdallah, Abdelrahman, et al.
Published: (2024)

Weakly Supervised Semantic Segmentation for Driving Scenes
by: Kim, Dongseob, et al.
Published: (2023)

Upright adjustment with graph convolutional networks
by: Jung, Raehyuk, et al.
Published: (2024)

MagiCapture: High-Resolution Multi-Concept Portrait Customization
by: Hyung, Junha, et al.
Published: (2023)

Risk-Based Filtering of Valuable Driving Situations in the Waymo Open Motion Dataset
by: Puphal, Tim, et al.
Published: (2025)

Jamendo-QA: A Large-Scale Music Question Answering Dataset
by: Koh, Junyoung, et al.
Published: (2025)

KET-QA: A Dataset for Knowledge Enhanced Table Question Answering
by: Hu, Mengkang, et al.
Published: (2024)