Saved in:
| Main Authors: | Lee, Jaewook, Park, Yoel, Lee, Seulki |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.03663 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
On-device Sora: Enabling Training-Free Diffusion-based Text-to-Video Generation for Mobile Devices
by: Kim, Bosung, et al.
Published: (2025)
by: Kim, Bosung, et al.
Published: (2025)
On-device Sora: Enabling Training-Free Diffusion-based Text-to-Video Generation for Mobile Devices
by: Kim, Bosung, et al.
Published: (2025)
by: Kim, Bosung, et al.
Published: (2025)
Partial Large Kernel CNNs for Efficient Super-Resolution
by: Lee, Dongheon, et al.
Published: (2024)
by: Lee, Dongheon, et al.
Published: (2024)
Free-Grained Hierarchical Visual Recognition
by: Park, Seulki, et al.
Published: (2025)
by: Park, Seulki, et al.
Published: (2025)
Leveraging Programmatically Generated Synthetic Data for Differentially Private Diffusion Training
by: Choi, Yujin, et al.
Published: (2024)
by: Choi, Yujin, et al.
Published: (2024)
Investigating and unmasking feature-level vulnerabilities of CNNs to adversarial perturbations
by: Coppola, Davide, et al.
Published: (2024)
by: Coppola, Davide, et al.
Published: (2024)
Lightweight Channel Attention for Efficient CNNs
by: Kanaparthi, Prem Babu, et al.
Published: (2026)
by: Kanaparthi, Prem Babu, et al.
Published: (2026)
Visually Consistent Hierarchical Image Classification
by: Park, Seulki, et al.
Published: (2024)
by: Park, Seulki, et al.
Published: (2024)
XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression
by: Su, Zunhai, et al.
Published: (2026)
by: Su, Zunhai, et al.
Published: (2026)
XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression
by: Su, Zunhai, et al.
Published: (2026)
by: Su, Zunhai, et al.
Published: (2026)
ViscNet: Vision-Based In-line Viscometry for Fluid Mixing Process
by: Sohn, Jongwon, et al.
Published: (2025)
by: Sohn, Jongwon, et al.
Published: (2025)
ExBluRF: Efficient Radiance Fields for Extreme Motion Blurred Images
by: Lee, Dongwoo, et al.
Published: (2023)
by: Lee, Dongwoo, et al.
Published: (2023)
SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design
by: Yun, Seokju, et al.
Published: (2024)
by: Yun, Seokju, et al.
Published: (2024)
B-cos Alignment for Inherently Interpretable CNNs and Vision Transformers
by: Böhle, Moritz, et al.
Published: (2023)
by: Böhle, Moritz, et al.
Published: (2023)
ZIP: An Efficient Zeroth-order Prompt Tuning for Black-box Vision-Language Models
by: Park, Seonghwan, et al.
Published: (2025)
by: Park, Seonghwan, et al.
Published: (2025)
Parameter-Efficient Architectural Modifications for Translation-Invariant CNNs
by: Alabau-Bosque, Nuria, et al.
Published: (2026)
by: Alabau-Bosque, Nuria, et al.
Published: (2026)
CNNs, Transformers, Hybrid, and Vision Language Models for Skin Cancer Detection
by: Dey, Durjoy, et al.
Published: (2026)
by: Dey, Durjoy, et al.
Published: (2026)
KOALA: Empirical Lessons Toward Memory-Efficient and Fast Diffusion Models for Text-to-Image Synthesis
by: Lee, Youngwan, et al.
Published: (2023)
by: Lee, Youngwan, et al.
Published: (2023)
MEIL-NeRF: Memory-Efficient Incremental Learning of Neural Radiance Fields
by: Chung, Jaeyoung, et al.
Published: (2022)
by: Chung, Jaeyoung, et al.
Published: (2022)
TADFormer : Task-Adaptive Dynamic Transformer for Efficient Multi-Task Learning
by: Baek, Seungmin, et al.
Published: (2025)
by: Baek, Seungmin, et al.
Published: (2025)
BEM: Training-Free Background Embedding Memory for False-Positive Suppression in Real-Time Fixed-Background Camera
by: Park, Junwoo, et al.
Published: (2026)
by: Park, Junwoo, et al.
Published: (2026)
OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation
by: Peng, Bohao, et al.
Published: (2024)
by: Peng, Bohao, et al.
Published: (2024)
Vehicle Classification under Extreme Imbalance: A Comparative Study of Ensemble Learning and CNNs
by: Syarubany, Abu Hanif Muhammad
Published: (2025)
by: Syarubany, Abu Hanif Muhammad
Published: (2025)
ReDistill: Residual Encoded Distillation for Peak Memory Reduction of CNNs
by: Chen, Fang, et al.
Published: (2024)
by: Chen, Fang, et al.
Published: (2024)
CAD: Memory Efficient Convolutional Adapter for Segment Anything
by: Kim, Joohyeok, et al.
Published: (2024)
by: Kim, Joohyeok, et al.
Published: (2024)
Extreme Point Supervised Instance Segmentation
by: Lee, Hyeonjun, et al.
Published: (2024)
by: Lee, Hyeonjun, et al.
Published: (2024)
MoRGS: Efficient Per-Gaussian Motion Reasoning for Streamable Dynamic 3D Scenes
by: Lee, Wonjoon, et al.
Published: (2026)
by: Lee, Wonjoon, et al.
Published: (2026)
Promoting CNNs with Cross-Architecture Knowledge Distillation for Efficient Monocular Depth Estimation
by: Zheng, Zhimeng, et al.
Published: (2024)
by: Zheng, Zhimeng, et al.
Published: (2024)
How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?
by: Lee, Seongyun, et al.
Published: (2024)
by: Lee, Seongyun, et al.
Published: (2024)
ACM-UNet: Adaptive Integration of CNNs and Mamba for Efficient Medical Image Segmentation
by: Huang, Jing, et al.
Published: (2025)
by: Huang, Jing, et al.
Published: (2025)
Combining Transformers and CNNs for Efficient Object Detection in High-Resolution Satellite Imagery
by: Drapier, Nicolas, et al.
Published: (2025)
by: Drapier, Nicolas, et al.
Published: (2025)
Continuous Memory Representation for Anomaly Detection
by: Lee, Joo Chan, et al.
Published: (2024)
by: Lee, Joo Chan, et al.
Published: (2024)
Self-Supervised Vision Transformers Are Efficient Segmentation Learners for Imperfect Labels
by: Lee, Seungho, et al.
Published: (2024)
by: Lee, Seungho, et al.
Published: (2024)
Efficient Hyperparameter Importance Assessment for CNNs
by: Wang, Ruinan, et al.
Published: (2024)
by: Wang, Ruinan, et al.
Published: (2024)
The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers
by: Son, Seungwoo, et al.
Published: (2023)
by: Son, Seungwoo, et al.
Published: (2023)
EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality
by: Lee, Sanghyeok, et al.
Published: (2024)
by: Lee, Sanghyeok, et al.
Published: (2024)
RoCOCO: Robustness Benchmark of MS-COCO to Stress-test Image-Text Matching Models
by: Park, Seulki, et al.
Published: (2023)
by: Park, Seulki, et al.
Published: (2023)
Bridging Vision and Language Spaces with Assignment Prediction
by: Park, Jungin, et al.
Published: (2024)
by: Park, Jungin, et al.
Published: (2024)
MoE-GRPO: Optimizing Mixture-of-Experts via Reinforcement Learning in Vision-Language Models
by: Ko, Dohwan, et al.
Published: (2026)
by: Ko, Dohwan, et al.
Published: (2026)
Exploring Synergistic Ensemble Learning: Uniting CNNs, MLP-Mixers, and Vision Transformers to Enhance Image Classification
by: Bashar, Mk, et al.
Published: (2025)
by: Bashar, Mk, et al.
Published: (2025)
Similar Items
-
On-device Sora: Enabling Training-Free Diffusion-based Text-to-Video Generation for Mobile Devices
by: Kim, Bosung, et al.
Published: (2025) -
On-device Sora: Enabling Training-Free Diffusion-based Text-to-Video Generation for Mobile Devices
by: Kim, Bosung, et al.
Published: (2025) -
Partial Large Kernel CNNs for Efficient Super-Resolution
by: Lee, Dongheon, et al.
Published: (2024) -
Free-Grained Hierarchical Visual Recognition
by: Park, Seulki, et al.
Published: (2025) -
Leveraging Programmatically Generated Synthetic Data for Differentially Private Diffusion Training
by: Choi, Yujin, et al.
Published: (2024)