Saved in:
| Main Authors: | Heng, Yuwen, Dasmahapatra, Srinandan, Kim, Hansung |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2305.03919 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Understanding Imbalanced Forgetting in Rehearsal-Based Class-Incremental Learning
by: Tamajo, Alberto, et al.
Published: (2026)
by: Tamajo, Alberto, et al.
Published: (2026)
Unsupervised Multi-Person 3D Human Pose Estimation From 2D Poses Alone
by: Hardy, Peter, et al.
Published: (2023)
by: Hardy, Peter, et al.
Published: (2023)
Ultra-High Resolution Segmentation via Boundary-Enhanced Patch-Merging Transformer
by: Sun, Haopeng, et al.
Published: (2024)
by: Sun, Haopeng, et al.
Published: (2024)
Semantic Scene Completion with Multi-Feature Data Balancing Network
by: Alawadh, Mona, et al.
Published: (2024)
by: Alawadh, Mona, et al.
Published: (2024)
MedSAD-CLIP: Supervised CLIP with Token-Patch Cross-Attention for Medical Anomaly Detection and Segmentation
by: Tran, Thuy Truong, et al.
Published: (2026)
by: Tran, Thuy Truong, et al.
Published: (2026)
Adaptive Morph-Patch Transformer for Aortic Vessel Segmentation
by: Zhang, Zhenxi, et al.
Published: (2025)
by: Zhang, Zhenxi, et al.
Published: (2025)
Camera-Aware Cross-View Alignment for Referring 3D Gaussian Splatting Segmentation
by: Tao, Yuwen, et al.
Published: (2025)
by: Tao, Yuwen, et al.
Published: (2025)
Low-Resolution Self-Attention for Semantic Segmentation
by: Wu, Yu-Huan, et al.
Published: (2023)
by: Wu, Yu-Huan, et al.
Published: (2023)
MUJICA: Reforming SISR Models for PBR Material Super-Resolution via Cross-Map Attention
by: Du, Xin, et al.
Published: (2025)
by: Du, Xin, et al.
Published: (2025)
DDiT: Dynamic Patch Scheduling for Efficient Diffusion Transformers
by: Kim, Dahye, et al.
Published: (2026)
by: Kim, Dahye, et al.
Published: (2026)
MOGO: Residual Quantized Hierarchical Causal Transformer for High-Quality and Real-Time 3D Human Motion Generation
by: Fu, Dongjie, et al.
Published: (2025)
by: Fu, Dongjie, et al.
Published: (2025)
CRAB: Camera-Radar Fusion for Reducing Depth Ambiguity in Backward Projection based View Transformation
by: Lee, In-Jae, et al.
Published: (2025)
by: Lee, In-Jae, et al.
Published: (2025)
Entropy Guided Dynamic Patch Segmentation for Time Series Transformers
by: Abeywickrama, Sachith, et al.
Published: (2025)
by: Abeywickrama, Sachith, et al.
Published: (2025)
MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any Resolution
by: Liu, Wenzhuo, et al.
Published: (2024)
by: Liu, Wenzhuo, et al.
Published: (2024)
PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution
by: Du, Shian, et al.
Published: (2025)
by: Du, Shian, et al.
Published: (2025)
MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation
by: Yang, Zhiwei, et al.
Published: (2024)
by: Yang, Zhiwei, et al.
Published: (2024)
Towards Robust Semantic Segmentation against Patch-based Attack via Attention Refinement
by: Yuan, Zheng, et al.
Published: (2024)
by: Yuan, Zheng, et al.
Published: (2024)
Dynamic Texture Transfer using PatchMatch and Transformers
by: Pu, Guo, et al.
Published: (2024)
by: Pu, Guo, et al.
Published: (2024)
ROI-Aware Multiscale Cross-Attention Vision Transformer for Pest Image Identification
by: Kim, Ga-Eun, et al.
Published: (2023)
by: Kim, Ga-Eun, et al.
Published: (2023)
Improving Real-Time Omnidirectional 3D Multi-Person Human Pose Estimation with People Matching and Unsupervised 2D-3D Lifting
by: Knap, Pawel, et al.
Published: (2024)
by: Knap, Pawel, et al.
Published: (2024)
Cross Resolution Encoding-Decoding For Detection Transformers
by: Kumar, Ashish, et al.
Published: (2024)
by: Kumar, Ashish, et al.
Published: (2024)
The Collapse of Patches
by: Guo, Wei, et al.
Published: (2025)
by: Guo, Wei, et al.
Published: (2025)
PatchScaler: An Efficient Patch-Independent Diffusion Model for Image Super-Resolution
by: Liu, Yong, et al.
Published: (2024)
by: Liu, Yong, et al.
Published: (2024)
Terrain-Enhanced Resolution-aware Refinement Attention for Off-Road Segmentation
by: Choi, Seongkyu, et al.
Published: (2025)
by: Choi, Seongkyu, et al.
Published: (2025)
SCASeg: Strip Cross-Attention for Efficient Semantic Segmentation
by: Xu, Guoan, et al.
Published: (2024)
by: Xu, Guoan, et al.
Published: (2024)
Cross-Stage Attention Propagation for Efficient Semantic Segmentation
by: Kang, Beoungwoo
Published: (2026)
by: Kang, Beoungwoo
Published: (2026)
PC-SAM: Patch-Constrained Fine-Grained Interactive Road Segmentation in High-Resolution Remote Sensing Images
by: Lv, Chengcheng, et al.
Published: (2026)
by: Lv, Chengcheng, et al.
Published: (2026)
Backward-Compatible Aligned Representations via an Orthogonal Transformation Layer
by: Ricci, Simone, et al.
Published: (2024)
by: Ricci, Simone, et al.
Published: (2024)
Cross-modulated Attention Transformer for RGBT Tracking
by: Xiao, Yun, et al.
Published: (2024)
by: Xiao, Yun, et al.
Published: (2024)
Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack
by: Suryanto, Naufal, et al.
Published: (2024)
by: Suryanto, Naufal, et al.
Published: (2024)
Patch Pruning Strategy Based on Robust Statistical Measures of Attention Weight Diversity in Vision Transformers
by: Igaue, Yuki, et al.
Published: (2025)
by: Igaue, Yuki, et al.
Published: (2025)
MATIS: Masked-Attention Transformers for Surgical Instrument Segmentation
by: Ayobi, Nicolás, et al.
Published: (2023)
by: Ayobi, Nicolás, et al.
Published: (2023)
Dynamic Patch-aware Enrichment Transformer for Occluded Person Re-Identification
by: Zhang, Xin, et al.
Published: (2024)
by: Zhang, Xin, et al.
Published: (2024)
GPAFormer: Graph-guided Patch Aggregation Transformer for Efficient 3D Medical Image Segmentation
by: Lo, Chung-Ming, et al.
Published: (2026)
by: Lo, Chung-Ming, et al.
Published: (2026)
Video Super-Resolution Transformer with Masked Inter&Intra-Frame Attention
by: Zhou, Xingyu, et al.
Published: (2024)
by: Zhou, Xingyu, et al.
Published: (2024)
SEGA: Spectral-Energy Guided Attention for Resolution Extrapolation in Diffusion Transformers
by: Rajabi, Javad, et al.
Published: (2026)
by: Rajabi, Javad, et al.
Published: (2026)
MAT: Multi-Range Attention Transformer for Efficient Image Super-Resolution
by: Xie, Chengxing, et al.
Published: (2024)
by: Xie, Chengxing, et al.
Published: (2024)
Crafting Query-Aware Selective Attention for Single Image Super-Resolution
by: Kim, Junyoung, et al.
Published: (2025)
by: Kim, Junyoung, et al.
Published: (2025)
Integrating Query-aware Segmentation and Cross-Attention for Robust VQA
by: Choi, Wonjun, et al.
Published: (2024)
by: Choi, Wonjun, et al.
Published: (2024)
LoReTrack: Efficient and Accurate Low-Resolution Transformer Tracking
by: Dong, Shaohua, et al.
Published: (2024)
by: Dong, Shaohua, et al.
Published: (2024)
Similar Items
-
Understanding Imbalanced Forgetting in Rehearsal-Based Class-Incremental Learning
by: Tamajo, Alberto, et al.
Published: (2026) -
Unsupervised Multi-Person 3D Human Pose Estimation From 2D Poses Alone
by: Hardy, Peter, et al.
Published: (2023) -
Ultra-High Resolution Segmentation via Boundary-Enhanced Patch-Merging Transformer
by: Sun, Haopeng, et al.
Published: (2024) -
Semantic Scene Completion with Multi-Feature Data Balancing Network
by: Alawadh, Mona, et al.
Published: (2024) -
MedSAD-CLIP: Supervised CLIP with Token-Patch Cross-Attention for Medical Anomaly Detection and Segmentation
by: Tran, Thuy Truong, et al.
Published: (2026)