:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Heng, Yuwen, Dasmahapatra, Srinandan, Kim, Hansung
Format:	Preprint
Published:	2023
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2305.03919
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Understanding Imbalanced Forgetting in Rehearsal-Based Class-Incremental Learning
by: Tamajo, Alberto, et al.
Published: (2026)

Unsupervised Multi-Person 3D Human Pose Estimation From 2D Poses Alone
by: Hardy, Peter, et al.
Published: (2023)

Ultra-High Resolution Segmentation via Boundary-Enhanced Patch-Merging Transformer
by: Sun, Haopeng, et al.
Published: (2024)

Semantic Scene Completion with Multi-Feature Data Balancing Network
by: Alawadh, Mona, et al.
Published: (2024)

MedSAD-CLIP: Supervised CLIP with Token-Patch Cross-Attention for Medical Anomaly Detection and Segmentation
by: Tran, Thuy Truong, et al.
Published: (2026)

Adaptive Morph-Patch Transformer for Aortic Vessel Segmentation
by: Zhang, Zhenxi, et al.
Published: (2025)

Camera-Aware Cross-View Alignment for Referring 3D Gaussian Splatting Segmentation
by: Tao, Yuwen, et al.
Published: (2025)

Low-Resolution Self-Attention for Semantic Segmentation
by: Wu, Yu-Huan, et al.
Published: (2023)

MUJICA: Reforming SISR Models for PBR Material Super-Resolution via Cross-Map Attention
by: Du, Xin, et al.
Published: (2025)

DDiT: Dynamic Patch Scheduling for Efficient Diffusion Transformers
by: Kim, Dahye, et al.
Published: (2026)

MOGO: Residual Quantized Hierarchical Causal Transformer for High-Quality and Real-Time 3D Human Motion Generation
by: Fu, Dongjie, et al.
Published: (2025)

CRAB: Camera-Radar Fusion for Reducing Depth Ambiguity in Backward Projection based View Transformation
by: Lee, In-Jae, et al.
Published: (2025)

Entropy Guided Dynamic Patch Segmentation for Time Series Transformers
by: Abeywickrama, Sachith, et al.
Published: (2025)

MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any Resolution
by: Liu, Wenzhuo, et al.
Published: (2024)

PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution
by: Du, Shian, et al.
Published: (2025)

MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation
by: Yang, Zhiwei, et al.
Published: (2024)

Towards Robust Semantic Segmentation against Patch-based Attack via Attention Refinement
by: Yuan, Zheng, et al.
Published: (2024)

Dynamic Texture Transfer using PatchMatch and Transformers
by: Pu, Guo, et al.
Published: (2024)

ROI-Aware Multiscale Cross-Attention Vision Transformer for Pest Image Identification
by: Kim, Ga-Eun, et al.
Published: (2023)

Improving Real-Time Omnidirectional 3D Multi-Person Human Pose Estimation with People Matching and Unsupervised 2D-3D Lifting
by: Knap, Pawel, et al.
Published: (2024)

Cross Resolution Encoding-Decoding For Detection Transformers
by: Kumar, Ashish, et al.
Published: (2024)

The Collapse of Patches
by: Guo, Wei, et al.
Published: (2025)

PatchScaler: An Efficient Patch-Independent Diffusion Model for Image Super-Resolution
by: Liu, Yong, et al.
Published: (2024)

Terrain-Enhanced Resolution-aware Refinement Attention for Off-Road Segmentation
by: Choi, Seongkyu, et al.
Published: (2025)

SCASeg: Strip Cross-Attention for Efficient Semantic Segmentation
by: Xu, Guoan, et al.
Published: (2024)

Cross-Stage Attention Propagation for Efficient Semantic Segmentation
by: Kang, Beoungwoo
Published: (2026)

PC-SAM: Patch-Constrained Fine-Grained Interactive Road Segmentation in High-Resolution Remote Sensing Images
by: Lv, Chengcheng, et al.
Published: (2026)

Backward-Compatible Aligned Representations via an Orthogonal Transformation Layer
by: Ricci, Simone, et al.
Published: (2024)

Cross-modulated Attention Transformer for RGBT Tracking
by: Xiao, Yun, et al.
Published: (2024)

Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack
by: Suryanto, Naufal, et al.
Published: (2024)

Patch Pruning Strategy Based on Robust Statistical Measures of Attention Weight Diversity in Vision Transformers
by: Igaue, Yuki, et al.
Published: (2025)

MATIS: Masked-Attention Transformers for Surgical Instrument Segmentation
by: Ayobi, Nicolás, et al.
Published: (2023)

Dynamic Patch-aware Enrichment Transformer for Occluded Person Re-Identification
by: Zhang, Xin, et al.
Published: (2024)

GPAFormer: Graph-guided Patch Aggregation Transformer for Efficient 3D Medical Image Segmentation
by: Lo, Chung-Ming, et al.
Published: (2026)

Video Super-Resolution Transformer with Masked Inter&Intra-Frame Attention
by: Zhou, Xingyu, et al.
Published: (2024)

SEGA: Spectral-Energy Guided Attention for Resolution Extrapolation in Diffusion Transformers
by: Rajabi, Javad, et al.
Published: (2026)

MAT: Multi-Range Attention Transformer for Efficient Image Super-Resolution
by: Xie, Chengxing, et al.
Published: (2024)

Crafting Query-Aware Selective Attention for Single Image Super-Resolution
by: Kim, Junyoung, et al.
Published: (2025)

Integrating Query-aware Segmentation and Cross-Attention for Robust VQA
by: Choi, Wonjun, et al.
Published: (2024)

LoReTrack: Efficient and Accurate Low-Resolution Transformer Tracking
by: Dong, Shaohua, et al.
Published: (2024)