:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lahoti, Aakash, Karp, Stefani, Winston, Ezra, Singh, Aarti, Li, Yuanzhi
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2403.15707
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Detection Limits and Statistical Separability of Tree Ring Watermarks in Rectified Flow-based Text-to-Image Generation Models
by: Umrajkar, Ved, et al.
Published: (2025)

Pixels to Prose: Understanding the art of Image Captioning
by: Singh, Hrishikesh, et al.
Published: (2024)

MRI Volume-Based Robust Brain Age Estimation Using Weight-Shared Spatial Attention in 3D CNNs
by: Kancharla, Vamshi Krishna, et al.
Published: (2024)

Automatic Complementary Separation Pruning Toward Lightweight CNNs
by: Levin, David, et al.
Published: (2025)

A Hybrid Transformer-Sequencer approach for Age and Gender classification from in-wild facial images
by: Singh, Aakash, et al.
Published: (2024)

Synthesizer Based Efficient Self-Attention for Vision Tasks
by: Zhu, Guangyang, et al.
Published: (2022)

SpatialLock: Precise Spatial Control in Text-to-Image Synthesis
by: Liu, Biao, et al.
Published: (2025)

Designing Extremely Memory-Efficient CNNs for On-device Vision Tasks
by: Lee, Jaewook, et al.
Published: (2024)

Regularizing CNNs using Confusion Penalty Based Label Smoothing for Histopathology Images
by: Kuiry, Somenath, et al.
Published: (2024)

Automated Image Captioning with CNNs and Transformers
by: Cahyono, Joshua Adrian, et al.
Published: (2024)

Understanding and Improving CNNs with Complex Structure Tensor: A Biometrics Study
by: Hernandez-Diaz, Kevin, et al.
Published: (2024)

B-cos Alignment for Inherently Interpretable CNNs and Vision Transformers
by: Böhle, Moritz, et al.
Published: (2023)

Integrative CAM: Adaptive Layer Fusion for Comprehensive Interpretation of CNNs
by: Singh, Aniket K., et al.
Published: (2024)

SceneFunRI: Reasoning the Invisible for Task-Driven Functional Object Localization
by: Chen, Posheng, et al.
Published: (2026)

Investigating Market Strength Prediction with CNNs on Candlestick Chart Images
by: Duong, Thanh Nam, et al.
Published: (2025)

OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation
by: Peng, Bohao, et al.
Published: (2024)

From CNNs to Shift-Invariant Twin Models Based on Complex Wavelets
by: Leterme, Hubert, et al.
Published: (2022)

SeaS: Few-shot Industrial Anomaly Image Generation with Separation and Sharing Fine-tuning
by: Dai, Zhewei, et al.
Published: (2024)

Data-Agnostic Face Image Synthesis Detection Using Bayesian CNNs
by: Leyva, Roberto, et al.
Published: (2024)

Efficient CNNs via Passive Filter Pruning
by: Singh, Arshdeep, et al.
Published: (2023)

VistaFormer: Scalable Vision Transformers for Satellite Image Time Series Segmentation
by: MacDonald, Ezra, et al.
Published: (2024)

Neural Bloom: A Deep Learning Approach to Real-Time Lighting
by: Karp, Rafal, et al.
Published: (2025)

Rethinking Where to Edit: Task-Aware Localization for Instruction-Based Image Editing
by: He, Jingxuan, et al.
Published: (2026)

Systematic Integration of Attention Modules into CNNs for Accurate and Generalizable Medical Image Diagnosis
by: Ullah, Zahid, et al.
Published: (2025)

ACM-UNet: Adaptive Integration of CNNs and Mamba for Efficient Medical Image Segmentation
by: Huang, Jing, et al.
Published: (2025)

Breaking Shallow Limits: Task-Driven Pixel Fusion for Gap-free RGBT Tracking
by: Lu, Andong, et al.
Published: (2025)

Uncertainty-Aware Dual-Student Knowledge Distillation for Efficient Image Classification
by: Gore, Aakash, et al.
Published: (2025)

Reliable or Deceptive? Investigating Gated Features for Smooth Visual Explanations in CNNs
by: Mitra, Soham, et al.
Published: (2024)

Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning
by: Nai, Ruiqian, et al.
Published: (2024)

Underwater Image Restoration via Polymorphic Large Kernel CNNs
by: Guo, Xiaojiao, et al.
Published: (2024)

Lightweight Channel Attention for Efficient CNNs
by: Kanaparthi, Prem Babu, et al.
Published: (2026)

CAM Back Again: Large Kernel CNNs from a Weakly Supervised Object Localization Perspective
by: Yasuki, Shunsuke, et al.
Published: (2024)

Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images
by: Zhu, Qinfeng, et al.
Published: (2024)

Perceiving Beyond Language Priors: Enhancing Visual Comprehension and Attention in Multimodal Models
by: Ghatkesar, Aarti, et al.
Published: (2025)

On the universality of neural encodings in CNNs
by: Guth, Florentin, et al.
Published: (2024)

Understanding CNNs from excitations
by: Ying, Zijian, et al.
Published: (2022)

Seeing Beyond Redundancy: Task Complexity's Role in Vision Token Specialization in VLLMs
by: Hannan, Darryl, et al.
Published: (2026)

The Effective Depth Paradox: Evaluating the Relationship between Architectural Topology and Trainability in Deep CNNs
by: Fischer, Manfred M., et al.
Published: (2026)

Polar Separable Transform for Efficient Orthogonal Rotation-Invariant Image Representation
by: Singh, Satya P., et al.
Published: (2025)

Bioinspired CNNs for border completion in occluded images
by: Coutinho, Catarina P., et al.
Published: (2026)