Saved in:
| Main Authors: | Mkrtchyan, Rafayel, Manukyan, Armen, Khachatrian, Hrant, Raptis, Theofanis P. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.03736 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Vision Transformers for Efficient Indoor Pathloss Radio Map Prediction
by: Mkrtchyan, Rafayel, et al.
Published: (2024)
by: Mkrtchyan, Rafayel, et al.
Published: (2024)
Outdoor Environment Reconstruction with Deep Learning on Radio Propagation Paths
by: Khachatrian, Hrant, et al.
Published: (2024)
by: Khachatrian, Hrant, et al.
Published: (2024)
ML-based Approaches for Wireless NLOS Localization: Input Representations and Uncertainty Estimation
by: Darbinyan, Rafayel, et al.
Published: (2023)
by: Darbinyan, Rafayel, et al.
Published: (2023)
On the Limitations of Ray-Tracing for Learning-Based RF Tasks in Urban Environments
by: Manukyan, Armen, et al.
Published: (2025)
by: Manukyan, Armen, et al.
Published: (2025)
Analyzing Local Representations of Self-supervised Vision Transformers
by: Vanyan, Ani, et al.
Published: (2023)
by: Vanyan, Ani, et al.
Published: (2023)
Do Satellite Tasks Need Special Pretraining?
by: Vanyan, Ani, et al.
Published: (2025)
by: Vanyan, Ani, et al.
Published: (2025)
Channel Vision Transformers: An Image Is Worth 1 x 16 x 16 Words
by: Bao, Yujia, et al.
Published: (2023)
by: Bao, Yujia, et al.
Published: (2023)
In-context Learning in Presence of Spurious Correlations
by: Harutyunyan, Hrayr, et al.
Published: (2024)
by: Harutyunyan, Hrayr, et al.
Published: (2024)
Advancing Vision Transformer with Enhanced Spatial Priors
by: Fan, Qihang, et al.
Published: (2026)
by: Fan, Qihang, et al.
Published: (2026)
Zero-Painter: Training-Free Layout Control for Text-to-Image Synthesis
by: Ohanyan, Marianna, et al.
Published: (2024)
by: Ohanyan, Marianna, et al.
Published: (2024)
FSATFusion: Frequency-Spatial Attention Transformer for Infrared and Visible Image Fusion
by: Zhang, Tianpei, et al.
Published: (2025)
by: Zhang, Tianpei, et al.
Published: (2025)
Image Fusion via Vision-Language Model
by: Zhao, Zixiang, et al.
Published: (2024)
by: Zhao, Zixiang, et al.
Published: (2024)
Spatial-Frequency Enhanced Mamba for Multi-Modal Image Fusion
by: Sun, Hui, et al.
Published: (2025)
by: Sun, Hui, et al.
Published: (2025)
Arbitrary Data as Images: Fusion of Patient Data Across Modalities and Irregular Intervals with Vision Transformers
by: Tölle, Malte, et al.
Published: (2025)
by: Tölle, Malte, et al.
Published: (2025)
Fusion of Foundation and Vision Transformer Model Features for Dermatoscopic Image Classification
by: Mahbod, Amirreza, et al.
Published: (2025)
by: Mahbod, Amirreza, et al.
Published: (2025)
MapRF: Weakly Supervised Online HD Map Construction via NeRF-Guided Self-Training
by: Lyu, Hongyu, et al.
Published: (2025)
by: Lyu, Hongyu, et al.
Published: (2025)
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
by: Manukyan, Hayk, et al.
Published: (2023)
by: Manukyan, Hayk, et al.
Published: (2023)
NeRF-DetS: Enhanced Adaptive Spatial-wise Sampling and View-wise Fusion Strategies for NeRF-based Indoor Multi-view 3D Object Detection
by: Huang, Chi, et al.
Published: (2024)
by: Huang, Chi, et al.
Published: (2024)
Lightweight Vision Transformer with Window and Spatial Attention for Food Image Classification
by: Gao, Xinle, et al.
Published: (2025)
by: Gao, Xinle, et al.
Published: (2025)
PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors
by: Yuan, Tianyuan, et al.
Published: (2024)
by: Yuan, Tianyuan, et al.
Published: (2024)
Learning Spatial Decay for Vision Transformers
by: Mao, Yuxin, et al.
Published: (2025)
by: Mao, Yuxin, et al.
Published: (2025)
EVCC: Enhanced Vision Transformer-ConvNeXt-CoAtNet Fusion for Classification
by: Hasan, Kazi Reyazul, et al.
Published: (2025)
by: Hasan, Kazi Reyazul, et al.
Published: (2025)
Fusion of regional and sparse attention in Vision Transformers
by: Ibtehaz, Nabil, et al.
Published: (2024)
by: Ibtehaz, Nabil, et al.
Published: (2024)
GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer
by: Jia, Ding, et al.
Published: (2024)
by: Jia, Ding, et al.
Published: (2024)
VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning
by: Liao, Tangfei, et al.
Published: (2023)
by: Liao, Tangfei, et al.
Published: (2023)
Empirical Application Insights on Industrial Data and Service Aspects of Digital Twin Networks
by: Becattini, Marco, et al.
Published: (2024)
by: Becattini, Marco, et al.
Published: (2024)
CityRAG: Stepping Into a City via Spatially-Grounded Video Generation
by: Chou, Gene, et al.
Published: (2026)
by: Chou, Gene, et al.
Published: (2026)
HyFusion: Enhanced Reception Field Transformer for Hyperspectral Image Fusion
by: Lee, Chia-Ming, et al.
Published: (2025)
by: Lee, Chia-Ming, et al.
Published: (2025)
A Light and Smart Wearable Platform with Multimodal Foundation Model for Enhanced Spatial Reasoning in People with Blindness and Low Vision
by: Magay, Alexey, et al.
Published: (2025)
by: Magay, Alexey, et al.
Published: (2025)
Multi-Modal Vision Transformers for Crop Mapping from Satellite Image Time Series
by: Follath, Theresa, et al.
Published: (2024)
by: Follath, Theresa, et al.
Published: (2024)
A Saccade-inspired Approach to Image Classification using Vision Transformer Attention Maps
by: Dallain, Matthis, et al.
Published: (2026)
by: Dallain, Matthis, et al.
Published: (2026)
Tri-FusionNet: Enhancing Image Description Generation with Transformer-based Fusion Network and Dual Attention Mechanism
by: Agarwal, Lakshita, et al.
Published: (2025)
by: Agarwal, Lakshita, et al.
Published: (2025)
Interpretable Vision Transformers in Image Classification via SVDA
by: Arampatzakis, Vasileios, et al.
Published: (2026)
by: Arampatzakis, Vasileios, et al.
Published: (2026)
Transformer-based RGB-T Tracking with Channel and Spatial Feature Fusion
by: Li, Yunfeng, et al.
Published: (2024)
by: Li, Yunfeng, et al.
Published: (2024)
SATA: Spatial Autocorrelation Token Analysis for Enhancing the Robustness of Vision Transformers
by: Nikzad, Nick, et al.
Published: (2024)
by: Nikzad, Nick, et al.
Published: (2024)
Texture Image Synthesis Using Spatial GAN Based on Vision Transformers
by: Salari, Elahe, et al.
Published: (2025)
by: Salari, Elahe, et al.
Published: (2025)
Omni-Fusion of Spatial and Spectral for Hyperspectral Image Segmentation
by: Zhang, Qing, et al.
Published: (2025)
by: Zhang, Qing, et al.
Published: (2025)
Interactive Spatial-Frequency Fusion Mamba for Multi-Modal Image Fusion
by: Zhu, Yixin, et al.
Published: (2026)
by: Zhu, Yixin, et al.
Published: (2026)
LoRA-Enhanced Vision Transformer for Single Image based Morphing Attack Detection via Knowledge Distillation from EfficientNet
by: Shekhawat, Ria, et al.
Published: (2025)
by: Shekhawat, Ria, et al.
Published: (2025)
SFDFusion: An Efficient Spatial-Frequency Domain Fusion Network for Infrared and Visible Image Fusion
by: Hu, Kun, et al.
Published: (2024)
by: Hu, Kun, et al.
Published: (2024)
Similar Items
-
Vision Transformers for Efficient Indoor Pathloss Radio Map Prediction
by: Mkrtchyan, Rafayel, et al.
Published: (2024) -
Outdoor Environment Reconstruction with Deep Learning on Radio Propagation Paths
by: Khachatrian, Hrant, et al.
Published: (2024) -
ML-based Approaches for Wireless NLOS Localization: Input Representations and Uncertainty Estimation
by: Darbinyan, Rafayel, et al.
Published: (2023) -
On the Limitations of Ray-Tracing for Learning-Based RF Tasks in Urban Environments
by: Manukyan, Armen, et al.
Published: (2025) -
Analyzing Local Representations of Self-supervised Vision Transformers
by: Vanyan, Ani, et al.
Published: (2023)