Saved in:
| Main Authors: | Moon, Jiyong, Lee, Junseok, Lee, Yunju, Park, Seongsik |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2308.02161 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PlaceFormer: Transformer-based Visual Place Recognition using Multi-Scale Patch Selection and Fusion
by: Kannan, Shyam Sundar, et al.
Published: (2024)
by: Kannan, Shyam Sundar, et al.
Published: (2024)
ABM-LoRA: Activation Boundary Matching for Fast Convergence in Low-Rank Adaptation
by: Lee, Dongha, et al.
Published: (2025)
by: Lee, Dongha, et al.
Published: (2025)
M2SFormer: Multi-Spectral and Multi-Scale Attention with Edge-Aware Difficulty Guidance for Image Forgery Localization
by: Nam, Ju-Hyeon, et al.
Published: (2025)
by: Nam, Ju-Hyeon, et al.
Published: (2025)
Lightweight Wasserstein Audio-Visual Model for Unified Speech Enhancement and Separation
by: Park, Jisoo, et al.
Published: (2025)
by: Park, Jisoo, et al.
Published: (2025)
PCBEAR: Pose Concept Bottleneck for Explainable Action Recognition
by: Lee, Jongseo, et al.
Published: (2025)
by: Lee, Jongseo, et al.
Published: (2025)
Free-Grained Hierarchical Visual Recognition
by: Park, Seulki, et al.
Published: (2025)
by: Park, Seulki, et al.
Published: (2025)
Emotion Recognition Using Transformers with Masked Learning
by: Min, Seongjae, et al.
Published: (2024)
by: Min, Seongjae, et al.
Published: (2024)
Advancing Fine-Grained Visual Understanding with Multi-Scale Alignment in Multi-Modal Models
by: Wang, Wei, et al.
Published: (2024)
by: Wang, Wei, et al.
Published: (2024)
Disentangled Concepts Speak Louder Than Words: Explainable Video Action Recognition
by: Lee, Jongseo, et al.
Published: (2025)
by: Lee, Jongseo, et al.
Published: (2025)
Patch-Wise Self-Supervised Visual Representation Learning: A Fine-Grained Approach
by: Javidani, Ali, et al.
Published: (2023)
by: Javidani, Ali, et al.
Published: (2023)
Localization Meets Uncertainty: Uncertainty-Aware Multi-Modal Localization
by: Won, Hye-Min, et al.
Published: (2025)
by: Won, Hye-Min, et al.
Published: (2025)
Mosaic: Compositional Multi-Concept Erasure via Vector Field Blending
by: Ko, Junseok, et al.
Published: (2026)
by: Ko, Junseok, et al.
Published: (2026)
MATANet: A Multi-context Attention and Taxonomy-Aware Network for Fine-Grained Underwater Recognition of Marine Species
by: Lee, Donghwan, et al.
Published: (2026)
by: Lee, Donghwan, et al.
Published: (2026)
CD-FKD: Cross-Domain Feature Knowledge Distillation for Robust Single-Domain Generalization in Object Detection
by: Lee, Junseok, et al.
Published: (2026)
by: Lee, Junseok, et al.
Published: (2026)
UniFormer: Unifying Convolution and Self-attention for Visual Recognition
by: Li, Kunchang, et al.
Published: (2022)
by: Li, Kunchang, et al.
Published: (2022)
XR-VLM: Cross-Relationship Modeling with Multi-part Prompts and Visual Features for Fine-Grained Recognition
by: Wang, Chuanming, et al.
Published: (2025)
by: Wang, Chuanming, et al.
Published: (2025)
SafeDrive: Fine-Grained Safety Reasoning for End-to-End Driving in a Sparse World
by: Kim, Jungho, et al.
Published: (2026)
by: Kim, Jungho, et al.
Published: (2026)
Global Context-aware Representation Learning for Spatially Resolved Transcriptomics
by: Oh, Yunhak, et al.
Published: (2025)
by: Oh, Yunhak, et al.
Published: (2025)
SeaFormer++: Squeeze-enhanced Axial Transformer for Mobile Visual Recognition
by: Wan, Qiang, et al.
Published: (2023)
by: Wan, Qiang, et al.
Published: (2023)
DEAL: Decoupled Classifier with Adaptive Linear Modulation for Group Robust Early Diagnosis of MCI to AD Conversion
by: Lee, Donggyu, et al.
Published: (2024)
by: Lee, Donggyu, et al.
Published: (2024)
Fine-R1: Make Multi-modal LLMs Excel in Fine-Grained Visual Recognition by Chain-of-Thought Reasoning
by: He, Hulingxiao, et al.
Published: (2026)
by: He, Hulingxiao, et al.
Published: (2026)
Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
by: Yu, Zhuoran, et al.
Published: (2023)
by: Yu, Zhuoran, et al.
Published: (2023)
M^3-GloDets: Multi-Region and Multi-Scale Analysis of Fine-Grained Diseased Glomerular Detection
by: Shi, Tianyu, et al.
Published: (2025)
by: Shi, Tianyu, et al.
Published: (2025)
Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation
by: Choi, Jiho, et al.
Published: (2025)
by: Choi, Jiho, et al.
Published: (2025)
On Learning Discriminative Features from Synthesized Data for Self-Supervised Fine-Grained Visual Recognition
by: Wang, Zihu, et al.
Published: (2024)
by: Wang, Zihu, et al.
Published: (2024)
Extract More from Less: Efficient Fine-Grained Visual Recognition in Low-Data Regimes
by: Demidov, Dmitry, et al.
Published: (2024)
by: Demidov, Dmitry, et al.
Published: (2024)
Learning Temporal Cues by Predicting Objects Move for Multi-camera 3D Object Detection
by: Moon, Seokha, et al.
Published: (2024)
by: Moon, Seokha, et al.
Published: (2024)
ShadowMaskFormer: Mask Augmented Patch Embeddings for Shadow Removal
by: Li, Zhuohao, et al.
Published: (2024)
by: Li, Zhuohao, et al.
Published: (2024)
NavFormer: IGRF Forecasting in Moving Coordinate Frames
by: Hwang, Yoontae, et al.
Published: (2026)
by: Hwang, Yoontae, et al.
Published: (2026)
Car-1000: A New Large Scale Fine-Grained Visual Categorization Dataset
by: Hu, Yutao, et al.
Published: (2025)
by: Hu, Yutao, et al.
Published: (2025)
VideoPatchCore: An Effective Method to Memorize Normality for Video Anomaly Detection
by: Ahn, Sunghyun, et al.
Published: (2024)
by: Ahn, Sunghyun, et al.
Published: (2024)
Holistic Unlearning Benchmark: A Multi-Faceted Evaluation for Text-to-Image Diffusion Model Unlearning
by: Moon, Saemi, et al.
Published: (2024)
by: Moon, Saemi, et al.
Published: (2024)
DiverseVAR: Balancing Diversity and Quality of Next-Scale Visual Autoregressive Models
by: Park, Mingue, et al.
Published: (2025)
by: Park, Mingue, et al.
Published: (2025)
LoDisc: Learning Global-Local Discriminative Features for Self-Supervised Fine-Grained Visual Recognition
by: Shi, Jialu, et al.
Published: (2024)
by: Shi, Jialu, et al.
Published: (2024)
Efficient Vocabulary-Free Fine-Grained Visual Recognition in the Age of Multimodal LLMs
by: Kuchibhotla, Hari Chandana, et al.
Published: (2025)
by: Kuchibhotla, Hari Chandana, et al.
Published: (2025)
Self-Supervised Pretraining for Fine-Grained Plankton Recognition
by: Kareinen, Joona, et al.
Published: (2025)
by: Kareinen, Joona, et al.
Published: (2025)
AREPAS: Anomaly Detection in Fine-Grained Anatomy with Reconstruction-Based Semantic Patch-Scoring
by: Mitic, Branko, et al.
Published: (2025)
by: Mitic, Branko, et al.
Published: (2025)
Finer: Investigating and Enhancing Fine-Grained Visual Concept Recognition in Large Vision Language Models
by: Kim, Jeonghwan, et al.
Published: (2024)
by: Kim, Jeonghwan, et al.
Published: (2024)
PAtt-Lite: Lightweight Patch and Attention MobileNet for Challenging Facial Expression Recognition
by: Ngwe, Jia Le, et al.
Published: (2023)
by: Ngwe, Jia Le, et al.
Published: (2023)
Real-Aware Residual Model Merging for Deepfake Detection
by: Park, Jinhee, et al.
Published: (2025)
by: Park, Jinhee, et al.
Published: (2025)
Similar Items
-
PlaceFormer: Transformer-based Visual Place Recognition using Multi-Scale Patch Selection and Fusion
by: Kannan, Shyam Sundar, et al.
Published: (2024) -
ABM-LoRA: Activation Boundary Matching for Fast Convergence in Low-Rank Adaptation
by: Lee, Dongha, et al.
Published: (2025) -
M2SFormer: Multi-Spectral and Multi-Scale Attention with Edge-Aware Difficulty Guidance for Image Forgery Localization
by: Nam, Ju-Hyeon, et al.
Published: (2025) -
Lightweight Wasserstein Audio-Visual Model for Unified Speech Enhancement and Separation
by: Park, Jisoo, et al.
Published: (2025) -
PCBEAR: Pose Concept Bottleneck for Explainable Action Recognition
by: Lee, Jongseo, et al.
Published: (2025)