Saved in:
| Main Authors: | Yu, Seungjun, Lee, Seonho, Kim, Namho, Shin, Jaeyo, Park, Junsung, Ryu, Wonjeong, Jung, Raehyuk, Shim, Hyunjung |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.20022 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SUPER-AD: Semantic Uncertainty-aware Planning for End-to-End Robust Autonomous Driving
by: Ryu, Wonjeong, et al.
Published: (2025)
by: Ryu, Wonjeong, et al.
Published: (2025)
Robust Driving QA through Metadata-Grounded Context and Task-Specific Prompts
by: Yu, Seungjun, et al.
Published: (2025)
by: Yu, Seungjun, et al.
Published: (2025)
Prompt the Unseen: Evaluating Visual-Language Alignment Beyond Supervision
by: Jung, Raehyuk, et al.
Published: (2025)
by: Jung, Raehyuk, et al.
Published: (2025)
DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation
by: Kim, Jiwook, et al.
Published: (2024)
by: Kim, Jiwook, et al.
Published: (2024)
Grounding Driving VLA via Inverse Kinematics
by: Park, Junsung, et al.
Published: (2026)
by: Park, Junsung, et al.
Published: (2026)
Representation Alignment for Just Image Transformers is not Easier than You Think
by: Shin, Jaeyo, et al.
Published: (2026)
by: Shin, Jaeyo, et al.
Published: (2026)
Real-Time Long Horizon Air Quality Forecasting via Group-Relative Policy Optimization
by: Kang, Inha, et al.
Published: (2025)
by: Kang, Inha, et al.
Published: (2025)
Precision matters: Precision-aware ensemble for weakly supervised semantic segmentation
by: Park, Junsung, et al.
Published: (2024)
by: Park, Junsung, et al.
Published: (2024)
No Thing, Nothing: Highlighting Safety-Critical Classes for Robust LiDAR Semantic Segmentation in Adverse Weather
by: Park, Junsung, et al.
Published: (2025)
by: Park, Junsung, et al.
Published: (2025)
3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation
by: Lee, Seonho, et al.
Published: (2025)
by: Lee, Seonho, et al.
Published: (2025)
CoT-PL: Chain-of-Thought Pseudo-Labeling for Open-Vocabulary Object Detection
by: Choi, Hojun, et al.
Published: (2025)
by: Choi, Hojun, et al.
Published: (2025)
Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather
by: Park, Junsung, et al.
Published: (2024)
by: Park, Junsung, et al.
Published: (2024)
Evaluating Image Hallucination in Text-to-Image Generation with Question-Answering
by: Lim, Youngsun, et al.
Published: (2024)
by: Lim, Youngsun, et al.
Published: (2024)
LingoQA: Visual Question Answering for Autonomous Driving
by: Marcu, Ana-Maria, et al.
Published: (2023)
by: Marcu, Ana-Maria, et al.
Published: (2023)
What "Not" to Detect: Negation-Aware VLMs via Structured Reasoning and Token Merging
by: Kang, Inha, et al.
Published: (2025)
by: Kang, Inha, et al.
Published: (2025)
GRS-QA -- Graph Reasoning-Structured Question Answering Dataset
by: Pahilajani, Anish, et al.
Published: (2024)
by: Pahilajani, Anish, et al.
Published: (2024)
DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models
by: Ryu, Hyogon, et al.
Published: (2025)
by: Ryu, Hyogon, et al.
Published: (2025)
STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes
by: Ishihara, Keishi, et al.
Published: (2025)
by: Ishihara, Keishi, et al.
Published: (2025)
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
by: Choi, Jiho, et al.
Published: (2024)
by: Choi, Jiho, et al.
Published: (2024)
Scribble-Guided Diffusion for Training-free Text-to-Image Generation
by: Lee, Seonho, et al.
Published: (2024)
by: Lee, Seonho, et al.
Published: (2024)
Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation
by: Choi, Jiho, et al.
Published: (2025)
by: Choi, Jiho, et al.
Published: (2025)
QPaug: Question and Passage Augmentation for Open-Domain Question Answering of LLMs
by: Kim, Minsang, et al.
Published: (2024)
by: Kim, Minsang, et al.
Published: (2024)
Too Many Frames, Not All Useful: Efficient Strategies for Long-Form Video QA
by: Park, Jongwoo, et al.
Published: (2024)
by: Park, Jongwoo, et al.
Published: (2024)
Memory-Efficient Fine-Tuning for Quantized Diffusion Model
by: Ryu, Hyogon, et al.
Published: (2024)
by: Ryu, Hyogon, et al.
Published: (2024)
ROAD-Waymo: Action Awareness at Scale for Autonomous Driving
by: Khan, Salman, et al.
Published: (2024)
by: Khan, Salman, et al.
Published: (2024)
PolQA: Polish Question Answering Dataset
by: Rybak, Piotr, et al.
Published: (2022)
by: Rybak, Piotr, et al.
Published: (2022)
Sampling Bag of Views for Open-Vocabulary Object Detection
by: Choi, Hojun, et al.
Published: (2024)
by: Choi, Hojun, et al.
Published: (2024)
Label-Augmented Dataset Distillation
by: Kang, Seoungyoon, et al.
Published: (2024)
by: Kang, Seoungyoon, et al.
Published: (2024)
FoQA: A Faroese Question-Answering Dataset
by: Simonsen, Annika, et al.
Published: (2025)
by: Simonsen, Annika, et al.
Published: (2025)
HiKEY: Hierarchical Multimodal Retrieval for Open-Domain Document Question Answering
by: Shin, Joongmin, et al.
Published: (2026)
by: Shin, Joongmin, et al.
Published: (2026)
NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario
by: Qian, Tianwen, et al.
Published: (2023)
by: Qian, Tianwen, et al.
Published: (2023)
AdvisorQA: Towards Helpful and Harmless Advice-seeking Question Answering with Collective Intelligence
by: Kim, Minbeom, et al.
Published: (2024)
by: Kim, Minbeom, et al.
Published: (2024)
SyllabusQA: A Course Logistics Question Answering Dataset
by: Fernandez, Nigel, et al.
Published: (2024)
by: Fernandez, Nigel, et al.
Published: (2024)
ArabicaQA: A Comprehensive Dataset for Arabic Question Answering
by: Abdallah, Abdelrahman, et al.
Published: (2024)
by: Abdallah, Abdelrahman, et al.
Published: (2024)
Weakly Supervised Semantic Segmentation for Driving Scenes
by: Kim, Dongseob, et al.
Published: (2023)
by: Kim, Dongseob, et al.
Published: (2023)
Upright adjustment with graph convolutional networks
by: Jung, Raehyuk, et al.
Published: (2024)
by: Jung, Raehyuk, et al.
Published: (2024)
MagiCapture: High-Resolution Multi-Concept Portrait Customization
by: Hyung, Junha, et al.
Published: (2023)
by: Hyung, Junha, et al.
Published: (2023)
Risk-Based Filtering of Valuable Driving Situations in the Waymo Open Motion Dataset
by: Puphal, Tim, et al.
Published: (2025)
by: Puphal, Tim, et al.
Published: (2025)
Jamendo-QA: A Large-Scale Music Question Answering Dataset
by: Koh, Junyoung, et al.
Published: (2025)
by: Koh, Junyoung, et al.
Published: (2025)
KET-QA: A Dataset for Knowledge Enhanced Table Question Answering
by: Hu, Mengkang, et al.
Published: (2024)
by: Hu, Mengkang, et al.
Published: (2024)
Similar Items
-
SUPER-AD: Semantic Uncertainty-aware Planning for End-to-End Robust Autonomous Driving
by: Ryu, Wonjeong, et al.
Published: (2025) -
Robust Driving QA through Metadata-Grounded Context and Task-Specific Prompts
by: Yu, Seungjun, et al.
Published: (2025) -
Prompt the Unseen: Evaluating Visual-Language Alignment Beyond Supervision
by: Jung, Raehyuk, et al.
Published: (2025) -
DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation
by: Kim, Jiwook, et al.
Published: (2024) -
Grounding Driving VLA via Inverse Kinematics
by: Park, Junsung, et al.
Published: (2026)