Saved in:
| Main Authors: | Lahoti, Aakash, Karp, Stefani, Winston, Ezra, Singh, Aarti, Li, Yuanzhi |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.15707 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Detection Limits and Statistical Separability of Tree Ring Watermarks in Rectified Flow-based Text-to-Image Generation Models
by: Umrajkar, Ved, et al.
Published: (2025)
by: Umrajkar, Ved, et al.
Published: (2025)
Pixels to Prose: Understanding the art of Image Captioning
by: Singh, Hrishikesh, et al.
Published: (2024)
by: Singh, Hrishikesh, et al.
Published: (2024)
MRI Volume-Based Robust Brain Age Estimation Using Weight-Shared Spatial Attention in 3D CNNs
by: Kancharla, Vamshi Krishna, et al.
Published: (2024)
by: Kancharla, Vamshi Krishna, et al.
Published: (2024)
Automatic Complementary Separation Pruning Toward Lightweight CNNs
by: Levin, David, et al.
Published: (2025)
by: Levin, David, et al.
Published: (2025)
A Hybrid Transformer-Sequencer approach for Age and Gender classification from in-wild facial images
by: Singh, Aakash, et al.
Published: (2024)
by: Singh, Aakash, et al.
Published: (2024)
Synthesizer Based Efficient Self-Attention for Vision Tasks
by: Zhu, Guangyang, et al.
Published: (2022)
by: Zhu, Guangyang, et al.
Published: (2022)
SpatialLock: Precise Spatial Control in Text-to-Image Synthesis
by: Liu, Biao, et al.
Published: (2025)
by: Liu, Biao, et al.
Published: (2025)
Designing Extremely Memory-Efficient CNNs for On-device Vision Tasks
by: Lee, Jaewook, et al.
Published: (2024)
by: Lee, Jaewook, et al.
Published: (2024)
Regularizing CNNs using Confusion Penalty Based Label Smoothing for Histopathology Images
by: Kuiry, Somenath, et al.
Published: (2024)
by: Kuiry, Somenath, et al.
Published: (2024)
Automated Image Captioning with CNNs and Transformers
by: Cahyono, Joshua Adrian, et al.
Published: (2024)
by: Cahyono, Joshua Adrian, et al.
Published: (2024)
Understanding and Improving CNNs with Complex Structure Tensor: A Biometrics Study
by: Hernandez-Diaz, Kevin, et al.
Published: (2024)
by: Hernandez-Diaz, Kevin, et al.
Published: (2024)
B-cos Alignment for Inherently Interpretable CNNs and Vision Transformers
by: Böhle, Moritz, et al.
Published: (2023)
by: Böhle, Moritz, et al.
Published: (2023)
Integrative CAM: Adaptive Layer Fusion for Comprehensive Interpretation of CNNs
by: Singh, Aniket K., et al.
Published: (2024)
by: Singh, Aniket K., et al.
Published: (2024)
SceneFunRI: Reasoning the Invisible for Task-Driven Functional Object Localization
by: Chen, Posheng, et al.
Published: (2026)
by: Chen, Posheng, et al.
Published: (2026)
Investigating Market Strength Prediction with CNNs on Candlestick Chart Images
by: Duong, Thanh Nam, et al.
Published: (2025)
by: Duong, Thanh Nam, et al.
Published: (2025)
OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation
by: Peng, Bohao, et al.
Published: (2024)
by: Peng, Bohao, et al.
Published: (2024)
From CNNs to Shift-Invariant Twin Models Based on Complex Wavelets
by: Leterme, Hubert, et al.
Published: (2022)
by: Leterme, Hubert, et al.
Published: (2022)
SeaS: Few-shot Industrial Anomaly Image Generation with Separation and Sharing Fine-tuning
by: Dai, Zhewei, et al.
Published: (2024)
by: Dai, Zhewei, et al.
Published: (2024)
Data-Agnostic Face Image Synthesis Detection Using Bayesian CNNs
by: Leyva, Roberto, et al.
Published: (2024)
by: Leyva, Roberto, et al.
Published: (2024)
Efficient CNNs via Passive Filter Pruning
by: Singh, Arshdeep, et al.
Published: (2023)
by: Singh, Arshdeep, et al.
Published: (2023)
VistaFormer: Scalable Vision Transformers for Satellite Image Time Series Segmentation
by: MacDonald, Ezra, et al.
Published: (2024)
by: MacDonald, Ezra, et al.
Published: (2024)
Neural Bloom: A Deep Learning Approach to Real-Time Lighting
by: Karp, Rafal, et al.
Published: (2025)
by: Karp, Rafal, et al.
Published: (2025)
Rethinking Where to Edit: Task-Aware Localization for Instruction-Based Image Editing
by: He, Jingxuan, et al.
Published: (2026)
by: He, Jingxuan, et al.
Published: (2026)
Systematic Integration of Attention Modules into CNNs for Accurate and Generalizable Medical Image Diagnosis
by: Ullah, Zahid, et al.
Published: (2025)
by: Ullah, Zahid, et al.
Published: (2025)
ACM-UNet: Adaptive Integration of CNNs and Mamba for Efficient Medical Image Segmentation
by: Huang, Jing, et al.
Published: (2025)
by: Huang, Jing, et al.
Published: (2025)
Breaking Shallow Limits: Task-Driven Pixel Fusion for Gap-free RGBT Tracking
by: Lu, Andong, et al.
Published: (2025)
by: Lu, Andong, et al.
Published: (2025)
Uncertainty-Aware Dual-Student Knowledge Distillation for Efficient Image Classification
by: Gore, Aakash, et al.
Published: (2025)
by: Gore, Aakash, et al.
Published: (2025)
Reliable or Deceptive? Investigating Gated Features for Smooth Visual Explanations in CNNs
by: Mitra, Soham, et al.
Published: (2024)
by: Mitra, Soham, et al.
Published: (2024)
Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning
by: Nai, Ruiqian, et al.
Published: (2024)
by: Nai, Ruiqian, et al.
Published: (2024)
Underwater Image Restoration via Polymorphic Large Kernel CNNs
by: Guo, Xiaojiao, et al.
Published: (2024)
by: Guo, Xiaojiao, et al.
Published: (2024)
Lightweight Channel Attention for Efficient CNNs
by: Kanaparthi, Prem Babu, et al.
Published: (2026)
by: Kanaparthi, Prem Babu, et al.
Published: (2026)
CAM Back Again: Large Kernel CNNs from a Weakly Supervised Object Localization Perspective
by: Yasuki, Shunsuke, et al.
Published: (2024)
by: Yasuki, Shunsuke, et al.
Published: (2024)
Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images
by: Zhu, Qinfeng, et al.
Published: (2024)
by: Zhu, Qinfeng, et al.
Published: (2024)
Perceiving Beyond Language Priors: Enhancing Visual Comprehension and Attention in Multimodal Models
by: Ghatkesar, Aarti, et al.
Published: (2025)
by: Ghatkesar, Aarti, et al.
Published: (2025)
On the universality of neural encodings in CNNs
by: Guth, Florentin, et al.
Published: (2024)
by: Guth, Florentin, et al.
Published: (2024)
Understanding CNNs from excitations
by: Ying, Zijian, et al.
Published: (2022)
by: Ying, Zijian, et al.
Published: (2022)
Seeing Beyond Redundancy: Task Complexity's Role in Vision Token Specialization in VLLMs
by: Hannan, Darryl, et al.
Published: (2026)
by: Hannan, Darryl, et al.
Published: (2026)
The Effective Depth Paradox: Evaluating the Relationship between Architectural Topology and Trainability in Deep CNNs
by: Fischer, Manfred M., et al.
Published: (2026)
by: Fischer, Manfred M., et al.
Published: (2026)
Polar Separable Transform for Efficient Orthogonal Rotation-Invariant Image Representation
by: Singh, Satya P., et al.
Published: (2025)
by: Singh, Satya P., et al.
Published: (2025)
Bioinspired CNNs for border completion in occluded images
by: Coutinho, Catarina P., et al.
Published: (2026)
by: Coutinho, Catarina P., et al.
Published: (2026)
Similar Items
-
Detection Limits and Statistical Separability of Tree Ring Watermarks in Rectified Flow-based Text-to-Image Generation Models
by: Umrajkar, Ved, et al.
Published: (2025) -
Pixels to Prose: Understanding the art of Image Captioning
by: Singh, Hrishikesh, et al.
Published: (2024) -
MRI Volume-Based Robust Brain Age Estimation Using Weight-Shared Spatial Attention in 3D CNNs
by: Kancharla, Vamshi Krishna, et al.
Published: (2024) -
Automatic Complementary Separation Pruning Toward Lightweight CNNs
by: Levin, David, et al.
Published: (2025) -
A Hybrid Transformer-Sequencer approach for Age and Gender classification from in-wild facial images
by: Singh, Aakash, et al.
Published: (2024)