:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lee, Jaewook, Park, Yoel, Lee, Seulki
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2408.03663
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

On-device Sora: Enabling Training-Free Diffusion-based Text-to-Video Generation for Mobile Devices
by: Kim, Bosung, et al.
Published: (2025)

On-device Sora: Enabling Training-Free Diffusion-based Text-to-Video Generation for Mobile Devices
by: Kim, Bosung, et al.
Published: (2025)

Partial Large Kernel CNNs for Efficient Super-Resolution
by: Lee, Dongheon, et al.
Published: (2024)

Free-Grained Hierarchical Visual Recognition
by: Park, Seulki, et al.
Published: (2025)

Leveraging Programmatically Generated Synthetic Data for Differentially Private Diffusion Training
by: Choi, Yujin, et al.
Published: (2024)

Investigating and unmasking feature-level vulnerabilities of CNNs to adversarial perturbations
by: Coppola, Davide, et al.
Published: (2024)

Lightweight Channel Attention for Efficient CNNs
by: Kanaparthi, Prem Babu, et al.
Published: (2026)

Visually Consistent Hierarchical Image Classification
by: Park, Seulki, et al.
Published: (2024)

XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression
by: Su, Zunhai, et al.
Published: (2026)

XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression
by: Su, Zunhai, et al.
Published: (2026)

ViscNet: Vision-Based In-line Viscometry for Fluid Mixing Process
by: Sohn, Jongwon, et al.
Published: (2025)

ExBluRF: Efficient Radiance Fields for Extreme Motion Blurred Images
by: Lee, Dongwoo, et al.
Published: (2023)

SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design
by: Yun, Seokju, et al.
Published: (2024)

B-cos Alignment for Inherently Interpretable CNNs and Vision Transformers
by: Böhle, Moritz, et al.
Published: (2023)

ZIP: An Efficient Zeroth-order Prompt Tuning for Black-box Vision-Language Models
by: Park, Seonghwan, et al.
Published: (2025)

Parameter-Efficient Architectural Modifications for Translation-Invariant CNNs
by: Alabau-Bosque, Nuria, et al.
Published: (2026)

CNNs, Transformers, Hybrid, and Vision Language Models for Skin Cancer Detection
by: Dey, Durjoy, et al.
Published: (2026)

KOALA: Empirical Lessons Toward Memory-Efficient and Fast Diffusion Models for Text-to-Image Synthesis
by: Lee, Youngwan, et al.
Published: (2023)

MEIL-NeRF: Memory-Efficient Incremental Learning of Neural Radiance Fields
by: Chung, Jaeyoung, et al.
Published: (2022)

TADFormer : Task-Adaptive Dynamic Transformer for Efficient Multi-Task Learning
by: Baek, Seungmin, et al.
Published: (2025)

BEM: Training-Free Background Embedding Memory for False-Positive Suppression in Real-Time Fixed-Background Camera
by: Park, Junwoo, et al.
Published: (2026)

OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation
by: Peng, Bohao, et al.
Published: (2024)

Vehicle Classification under Extreme Imbalance: A Comparative Study of Ensemble Learning and CNNs
by: Syarubany, Abu Hanif Muhammad
Published: (2025)

ReDistill: Residual Encoded Distillation for Peak Memory Reduction of CNNs
by: Chen, Fang, et al.
Published: (2024)

CAD: Memory Efficient Convolutional Adapter for Segment Anything
by: Kim, Joohyeok, et al.
Published: (2024)

Extreme Point Supervised Instance Segmentation
by: Lee, Hyeonjun, et al.
Published: (2024)

MoRGS: Efficient Per-Gaussian Motion Reasoning for Streamable Dynamic 3D Scenes
by: Lee, Wonjoon, et al.
Published: (2026)

Promoting CNNs with Cross-Architecture Knowledge Distillation for Efficient Monocular Depth Estimation
by: Zheng, Zhimeng, et al.
Published: (2024)

How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?
by: Lee, Seongyun, et al.
Published: (2024)

ACM-UNet: Adaptive Integration of CNNs and Mamba for Efficient Medical Image Segmentation
by: Huang, Jing, et al.
Published: (2025)

Combining Transformers and CNNs for Efficient Object Detection in High-Resolution Satellite Imagery
by: Drapier, Nicolas, et al.
Published: (2025)

Continuous Memory Representation for Anomaly Detection
by: Lee, Joo Chan, et al.
Published: (2024)

Self-Supervised Vision Transformers Are Efficient Segmentation Learners for Imperfect Labels
by: Lee, Seungho, et al.
Published: (2024)

Efficient Hyperparameter Importance Assessment for CNNs
by: Wang, Ruinan, et al.
Published: (2024)

The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers
by: Son, Seungwoo, et al.
Published: (2023)

EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality
by: Lee, Sanghyeok, et al.
Published: (2024)

RoCOCO: Robustness Benchmark of MS-COCO to Stress-test Image-Text Matching Models
by: Park, Seulki, et al.
Published: (2023)

Bridging Vision and Language Spaces with Assignment Prediction
by: Park, Jungin, et al.
Published: (2024)

MoE-GRPO: Optimizing Mixture-of-Experts via Reinforcement Learning in Vision-Language Models
by: Ko, Dohwan, et al.
Published: (2026)

Exploring Synergistic Ensemble Learning: Uniting CNNs, MLP-Mixers, and Vision Transformers to Enhance Image Classification
by: Bashar, Mk, et al.
Published: (2025)