Saved in:
| Main Authors: | Nguyen, Hoang C., Lee, Haeil, Kim, Junmo |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2311.11378 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Effects of Mixed Sample Data Augmentation are Class Dependent
by: Lee, Haeil, et al.
Published: (2023)
by: Lee, Haeil, et al.
Published: (2023)
Beta Sampling is All You Need: Efficient Image Generation Strategy for Diffusion Models using Stepwise Spectral Analysis
by: Lee, Haeil, et al.
Published: (2024)
by: Lee, Haeil, et al.
Published: (2024)
Test-Time Mixup Augmentation for Data and Class-Specific Uncertainty Estimation in Deep Learning Image Classification
by: Lee, Hansang, et al.
Published: (2022)
by: Lee, Hansang, et al.
Published: (2022)
Do Vision Models Encode Object-Level Semantic Relatedness? A Cognitive Psychology-Inspired Benchmark
by: Lee, Hansang, et al.
Published: (2017)
by: Lee, Hansang, et al.
Published: (2017)
Noisy Label Classification using Label Noise Selection with Test-Time Augmentation Cross-Entropy and NoiseMix Learning
by: Lee, Hansang, et al.
Published: (2022)
by: Lee, Hansang, et al.
Published: (2022)
IWP: Token Pruning as Implicit Weight Pruning in Large Vision Language Models
by: Lee, Dong-Jae, et al.
Published: (2026)
by: Lee, Dong-Jae, et al.
Published: (2026)
Frequency-Aware Token Reduction for Efficient Vision Transformer
by: Lee, Dong-Jae, et al.
Published: (2025)
by: Lee, Dong-Jae, et al.
Published: (2025)
Cross-Axis Feature Fusion with Joint-Wise Motion Difference Prediction for Text-Based 3D Human Motion Editing
by: Han, Gyojin, et al.
Published: (2026)
by: Han, Gyojin, et al.
Published: (2026)
VLM's Eye Examination: Instruct and Inspect Visual Competency of Vision Language Models
by: Hyeon-Woo, Nam, et al.
Published: (2024)
by: Hyeon-Woo, Nam, et al.
Published: (2024)
AH-OCDA: Amplitude-based Curriculum Learning and Hopfield Segmentation Model for Open Compound Domain Adaptation
by: Choi, Jaehyun, et al.
Published: (2024)
by: Choi, Jaehyun, et al.
Published: (2024)
Pygmalion Effect in Vision: Image-to-Clay Translation for Reflective Geometry Reconstruction
by: Lee, Gayoung, et al.
Published: (2025)
by: Lee, Gayoung, et al.
Published: (2025)
Learning Question-Aware Keyframe Selection with Synthetic Supervision for Video Question Answering
by: Kwon, Minchan, et al.
Published: (2026)
by: Kwon, Minchan, et al.
Published: (2026)
Self-supervised Transformation Learning for Equivariant Representations
by: Yu, Jaemyung, et al.
Published: (2025)
by: Yu, Jaemyung, et al.
Published: (2025)
DMQ: Dissecting Outliers of Diffusion Models for Post-Training Quantization
by: Lee, Dongyeun, et al.
Published: (2025)
by: Lee, Dongyeun, et al.
Published: (2025)
MATE: Meet At The Embedding -- Connecting Images with Long Texts
by: Jang, Young Kyun, et al.
Published: (2024)
by: Jang, Young Kyun, et al.
Published: (2024)
Video Diffusion Models Excel at Tracking Similar-Looking Objects Without Supervision
by: Zhang, Chenshuang, et al.
Published: (2025)
by: Zhang, Chenshuang, et al.
Published: (2025)
SFLD: Reducing the content bias for AI-generated Image Detection
by: Gye, Seoyeon, et al.
Published: (2025)
by: Gye, Seoyeon, et al.
Published: (2025)
IMSE: Intrinsic Mixture of Spectral Experts Fine-tuning for Test-Time Adaptation
by: Baek, Sunghyun, et al.
Published: (2026)
by: Baek, Sunghyun, et al.
Published: (2026)
ARGOS: Who, Where, and When in Agentic Multi-Camera Person Search
by: Kim, Myungchul, et al.
Published: (2026)
by: Kim, Myungchul, et al.
Published: (2026)
Text-to-image Diffusion Models in Generative AI: A Survey
by: Zhang, Chenshuang, et al.
Published: (2023)
by: Zhang, Chenshuang, et al.
Published: (2023)
DAM: Domain-Aware Module for Multi-Domain Dataset Condensation
by: Choi, Jaehyun, et al.
Published: (2025)
by: Choi, Jaehyun, et al.
Published: (2025)
Refining Visual Artifacts in Diffusion Models via Explainable AI-based Flaw Activation Maps
by: Lee, Seoyeon, et al.
Published: (2025)
by: Lee, Seoyeon, et al.
Published: (2025)
Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
by: Oh, Youngtaek, et al.
Published: (2024)
by: Oh, Youngtaek, et al.
Published: (2024)
Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition
by: Oh, Youngtaek, et al.
Published: (2024)
by: Oh, Youngtaek, et al.
Published: (2024)
Transferring Visual Explainability of Self-Explaining Models to Prediction-Only Models without Additional Training
by: Yoshikawa, Yuya, et al.
Published: (2025)
by: Yoshikawa, Yuya, et al.
Published: (2025)
Rethinking Top Probability from Multi-view for Distracted Driver Behaviour Localization
by: Nguyen, Quang Vinh, et al.
Published: (2024)
by: Nguyen, Quang Vinh, et al.
Published: (2024)
Enhancing the Fairness and Performance of Edge Cameras with Explainable AI
by: Nguyen, Truong Thanh Hung, et al.
Published: (2024)
by: Nguyen, Truong Thanh Hung, et al.
Published: (2024)
Adaptive Knowledge Distillation for Classification of Hand Images using Explainable Vision Transformers
by: Nguyen, Thanh Thi, et al.
Published: (2024)
by: Nguyen, Thanh Thi, et al.
Published: (2024)
InfoDisent: Explainability of Image Classification Models by Information Disentanglement
by: Struski, Łukasz, et al.
Published: (2024)
by: Struski, Łukasz, et al.
Published: (2024)
ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object
by: Zhang, Chenshuang, et al.
Published: (2024)
by: Zhang, Chenshuang, et al.
Published: (2024)
PRISM: Video Dataset Condensation with Progressive Refinement and Insertion for Sparse Motion
by: Choi, Jaehyun, et al.
Published: (2025)
by: Choi, Jaehyun, et al.
Published: (2025)
LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks
by: Nguyen, Truong Thanh Hung, et al.
Published: (2024)
by: Nguyen, Truong Thanh Hung, et al.
Published: (2024)
Brain Stroke Detection and Classification Using CT Imaging with Transformer Models and Explainable AI
by: Qari, Shomukh, et al.
Published: (2025)
by: Qari, Shomukh, et al.
Published: (2025)
Explainable Adversarial-Robust Vision-Language-Action Model for Robotic Manipulation
by: Kim, Ju-Young, et al.
Published: (2025)
by: Kim, Ju-Young, et al.
Published: (2025)
IPTQ-ViT: Post-Training Quantization of Non-linear Functions for Integer-only Vision Transformers
by: Kim, Gihwan, et al.
Published: (2025)
by: Kim, Gihwan, et al.
Published: (2025)
SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher
by: Dao, Trung, et al.
Published: (2024)
by: Dao, Trung, et al.
Published: (2024)
Mixed Non-linear Quantization for Vision Transformers
by: Kim, Gihwan, et al.
Published: (2024)
by: Kim, Gihwan, et al.
Published: (2024)
Knowledge-Guided Textual Reasoning for Explainable Video Anomaly Detection via LLMs
by: Lee, Hari
Published: (2025)
by: Lee, Hari
Published: (2025)
ConPro: Learning Severity Representation for Medical Images using Contrastive Learning and Preference Optimization
by: Nguyen, Hong, et al.
Published: (2024)
by: Nguyen, Hong, et al.
Published: (2024)
Explainable Parkinsons Disease Gait Recognition Using Multimodal RGB-D Fusion and Large Language Models
by: Alnaasan, Manar, et al.
Published: (2025)
by: Alnaasan, Manar, et al.
Published: (2025)
Similar Items
-
The Effects of Mixed Sample Data Augmentation are Class Dependent
by: Lee, Haeil, et al.
Published: (2023) -
Beta Sampling is All You Need: Efficient Image Generation Strategy for Diffusion Models using Stepwise Spectral Analysis
by: Lee, Haeil, et al.
Published: (2024) -
Test-Time Mixup Augmentation for Data and Class-Specific Uncertainty Estimation in Deep Learning Image Classification
by: Lee, Hansang, et al.
Published: (2022) -
Do Vision Models Encode Object-Level Semantic Relatedness? A Cognitive Psychology-Inspired Benchmark
by: Lee, Hansang, et al.
Published: (2017) -
Noisy Label Classification using Label Noise Selection with Test-Time Augmentation Cross-Entropy and NoiseMix Learning
by: Lee, Hansang, et al.
Published: (2022)