:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Moon, Jiyong, Lee, Junseok, Lee, Yunju, Park, Seongsik
Format:	Preprint
Published:	2023
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2308.02161
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

PlaceFormer: Transformer-based Visual Place Recognition using Multi-Scale Patch Selection and Fusion
by: Kannan, Shyam Sundar, et al.
Published: (2024)

ABM-LoRA: Activation Boundary Matching for Fast Convergence in Low-Rank Adaptation
by: Lee, Dongha, et al.
Published: (2025)

M2SFormer: Multi-Spectral and Multi-Scale Attention with Edge-Aware Difficulty Guidance for Image Forgery Localization
by: Nam, Ju-Hyeon, et al.
Published: (2025)

Lightweight Wasserstein Audio-Visual Model for Unified Speech Enhancement and Separation
by: Park, Jisoo, et al.
Published: (2025)

PCBEAR: Pose Concept Bottleneck for Explainable Action Recognition
by: Lee, Jongseo, et al.
Published: (2025)

Free-Grained Hierarchical Visual Recognition
by: Park, Seulki, et al.
Published: (2025)

Emotion Recognition Using Transformers with Masked Learning
by: Min, Seongjae, et al.
Published: (2024)

Advancing Fine-Grained Visual Understanding with Multi-Scale Alignment in Multi-Modal Models
by: Wang, Wei, et al.
Published: (2024)

Disentangled Concepts Speak Louder Than Words: Explainable Video Action Recognition
by: Lee, Jongseo, et al.
Published: (2025)

Patch-Wise Self-Supervised Visual Representation Learning: A Fine-Grained Approach
by: Javidani, Ali, et al.
Published: (2023)

Localization Meets Uncertainty: Uncertainty-Aware Multi-Modal Localization
by: Won, Hye-Min, et al.
Published: (2025)

Mosaic: Compositional Multi-Concept Erasure via Vector Field Blending
by: Ko, Junseok, et al.
Published: (2026)

MATANet: A Multi-context Attention and Taxonomy-Aware Network for Fine-Grained Underwater Recognition of Marine Species
by: Lee, Donghwan, et al.
Published: (2026)

CD-FKD: Cross-Domain Feature Knowledge Distillation for Robust Single-Domain Generalization in Object Detection
by: Lee, Junseok, et al.
Published: (2026)

UniFormer: Unifying Convolution and Self-attention for Visual Recognition
by: Li, Kunchang, et al.
Published: (2022)

XR-VLM: Cross-Relationship Modeling with Multi-part Prompts and Visual Features for Fine-Grained Recognition
by: Wang, Chuanming, et al.
Published: (2025)

SafeDrive: Fine-Grained Safety Reasoning for End-to-End Driving in a Sparse World
by: Kim, Jungho, et al.
Published: (2026)

Global Context-aware Representation Learning for Spatially Resolved Transcriptomics
by: Oh, Yunhak, et al.
Published: (2025)

SeaFormer++: Squeeze-enhanced Axial Transformer for Mobile Visual Recognition
by: Wan, Qiang, et al.
Published: (2023)

DEAL: Decoupled Classifier with Adaptive Linear Modulation for Group Robust Early Diagnosis of MCI to AD Conversion
by: Lee, Donggyu, et al.
Published: (2024)

Fine-R1: Make Multi-modal LLMs Excel in Fine-Grained Visual Recognition by Chain-of-Thought Reasoning
by: He, Hulingxiao, et al.
Published: (2026)

Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
by: Yu, Zhuoran, et al.
Published: (2023)

M^3-GloDets: Multi-Region and Multi-Scale Analysis of Fine-Grained Diseased Glomerular Detection
by: Shi, Tianyu, et al.
Published: (2025)

Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation
by: Choi, Jiho, et al.
Published: (2025)

On Learning Discriminative Features from Synthesized Data for Self-Supervised Fine-Grained Visual Recognition
by: Wang, Zihu, et al.
Published: (2024)

Extract More from Less: Efficient Fine-Grained Visual Recognition in Low-Data Regimes
by: Demidov, Dmitry, et al.
Published: (2024)

Learning Temporal Cues by Predicting Objects Move for Multi-camera 3D Object Detection
by: Moon, Seokha, et al.
Published: (2024)

ShadowMaskFormer: Mask Augmented Patch Embeddings for Shadow Removal
by: Li, Zhuohao, et al.
Published: (2024)

NavFormer: IGRF Forecasting in Moving Coordinate Frames
by: Hwang, Yoontae, et al.
Published: (2026)

Car-1000: A New Large Scale Fine-Grained Visual Categorization Dataset
by: Hu, Yutao, et al.
Published: (2025)

VideoPatchCore: An Effective Method to Memorize Normality for Video Anomaly Detection
by: Ahn, Sunghyun, et al.
Published: (2024)

Holistic Unlearning Benchmark: A Multi-Faceted Evaluation for Text-to-Image Diffusion Model Unlearning
by: Moon, Saemi, et al.
Published: (2024)

DiverseVAR: Balancing Diversity and Quality of Next-Scale Visual Autoregressive Models
by: Park, Mingue, et al.
Published: (2025)

LoDisc: Learning Global-Local Discriminative Features for Self-Supervised Fine-Grained Visual Recognition
by: Shi, Jialu, et al.
Published: (2024)

Efficient Vocabulary-Free Fine-Grained Visual Recognition in the Age of Multimodal LLMs
by: Kuchibhotla, Hari Chandana, et al.
Published: (2025)

Self-Supervised Pretraining for Fine-Grained Plankton Recognition
by: Kareinen, Joona, et al.
Published: (2025)

AREPAS: Anomaly Detection in Fine-Grained Anatomy with Reconstruction-Based Semantic Patch-Scoring
by: Mitic, Branko, et al.
Published: (2025)

Finer: Investigating and Enhancing Fine-Grained Visual Concept Recognition in Large Vision Language Models
by: Kim, Jeonghwan, et al.
Published: (2024)

PAtt-Lite: Lightweight Patch and Attention MobileNet for Challenging Facial Expression Recognition
by: Ngwe, Jia Le, et al.
Published: (2023)

Real-Aware Residual Model Merging for Deepfake Detection
by: Park, Jinhee, et al.
Published: (2025)