:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Mkrtchyan, Rafayel, Manukyan, Armen, Khachatrian, Hrant, Raptis, Theofanis P.
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2508.03736
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Vision Transformers for Efficient Indoor Pathloss Radio Map Prediction
by: Mkrtchyan, Rafayel, et al.
Published: (2024)

Outdoor Environment Reconstruction with Deep Learning on Radio Propagation Paths
by: Khachatrian, Hrant, et al.
Published: (2024)

ML-based Approaches for Wireless NLOS Localization: Input Representations and Uncertainty Estimation
by: Darbinyan, Rafayel, et al.
Published: (2023)

On the Limitations of Ray-Tracing for Learning-Based RF Tasks in Urban Environments
by: Manukyan, Armen, et al.
Published: (2025)

Analyzing Local Representations of Self-supervised Vision Transformers
by: Vanyan, Ani, et al.
Published: (2023)

Do Satellite Tasks Need Special Pretraining?
by: Vanyan, Ani, et al.
Published: (2025)

Channel Vision Transformers: An Image Is Worth 1 x 16 x 16 Words
by: Bao, Yujia, et al.
Published: (2023)

In-context Learning in Presence of Spurious Correlations
by: Harutyunyan, Hrayr, et al.
Published: (2024)

Advancing Vision Transformer with Enhanced Spatial Priors
by: Fan, Qihang, et al.
Published: (2026)

Zero-Painter: Training-Free Layout Control for Text-to-Image Synthesis
by: Ohanyan, Marianna, et al.
Published: (2024)

FSATFusion: Frequency-Spatial Attention Transformer for Infrared and Visible Image Fusion
by: Zhang, Tianpei, et al.
Published: (2025)

Image Fusion via Vision-Language Model
by: Zhao, Zixiang, et al.
Published: (2024)

Spatial-Frequency Enhanced Mamba for Multi-Modal Image Fusion
by: Sun, Hui, et al.
Published: (2025)

Arbitrary Data as Images: Fusion of Patient Data Across Modalities and Irregular Intervals with Vision Transformers
by: Tölle, Malte, et al.
Published: (2025)

Fusion of Foundation and Vision Transformer Model Features for Dermatoscopic Image Classification
by: Mahbod, Amirreza, et al.
Published: (2025)

MapRF: Weakly Supervised Online HD Map Construction via NeRF-Guided Self-Training
by: Lyu, Hongyu, et al.
Published: (2025)

HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
by: Manukyan, Hayk, et al.
Published: (2023)

NeRF-DetS: Enhanced Adaptive Spatial-wise Sampling and View-wise Fusion Strategies for NeRF-based Indoor Multi-view 3D Object Detection
by: Huang, Chi, et al.
Published: (2024)

Lightweight Vision Transformer with Window and Spatial Attention for Food Image Classification
by: Gao, Xinle, et al.
Published: (2025)

PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors
by: Yuan, Tianyuan, et al.
Published: (2024)

Learning Spatial Decay for Vision Transformers
by: Mao, Yuxin, et al.
Published: (2025)

EVCC: Enhanced Vision Transformer-ConvNeXt-CoAtNet Fusion for Classification
by: Hasan, Kazi Reyazul, et al.
Published: (2025)

Fusion of regional and sparse attention in Vision Transformers
by: Ibtehaz, Nabil, et al.
Published: (2024)

GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer
by: Jia, Ding, et al.
Published: (2024)

VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning
by: Liao, Tangfei, et al.
Published: (2023)

Empirical Application Insights on Industrial Data and Service Aspects of Digital Twin Networks
by: Becattini, Marco, et al.
Published: (2024)

CityRAG: Stepping Into a City via Spatially-Grounded Video Generation
by: Chou, Gene, et al.
Published: (2026)

HyFusion: Enhanced Reception Field Transformer for Hyperspectral Image Fusion
by: Lee, Chia-Ming, et al.
Published: (2025)

A Light and Smart Wearable Platform with Multimodal Foundation Model for Enhanced Spatial Reasoning in People with Blindness and Low Vision
by: Magay, Alexey, et al.
Published: (2025)

Multi-Modal Vision Transformers for Crop Mapping from Satellite Image Time Series
by: Follath, Theresa, et al.
Published: (2024)

A Saccade-inspired Approach to Image Classification using Vision Transformer Attention Maps
by: Dallain, Matthis, et al.
Published: (2026)

Tri-FusionNet: Enhancing Image Description Generation with Transformer-based Fusion Network and Dual Attention Mechanism
by: Agarwal, Lakshita, et al.
Published: (2025)

Interpretable Vision Transformers in Image Classification via SVDA
by: Arampatzakis, Vasileios, et al.
Published: (2026)

Transformer-based RGB-T Tracking with Channel and Spatial Feature Fusion
by: Li, Yunfeng, et al.
Published: (2024)

SATA: Spatial Autocorrelation Token Analysis for Enhancing the Robustness of Vision Transformers
by: Nikzad, Nick, et al.
Published: (2024)

Texture Image Synthesis Using Spatial GAN Based on Vision Transformers
by: Salari, Elahe, et al.
Published: (2025)

Omni-Fusion of Spatial and Spectral for Hyperspectral Image Segmentation
by: Zhang, Qing, et al.
Published: (2025)

Interactive Spatial-Frequency Fusion Mamba for Multi-Modal Image Fusion
by: Zhu, Yixin, et al.
Published: (2026)

LoRA-Enhanced Vision Transformer for Single Image based Morphing Attack Detection via Knowledge Distillation from EfficientNet
by: Shekhawat, Ria, et al.
Published: (2025)

SFDFusion: An Efficient Spatial-Frequency Domain Fusion Network for Infrared and Visible Image Fusion
by: Hu, Kun, et al.
Published: (2024)