Saved in:
| Main Authors: | Zhang, Hongying, Ma, ShuaiShuai |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.02726 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Cross-view geo-localization: a survey
by: Durgam, Abhilash, et al.
Published: (2024)
by: Durgam, Abhilash, et al.
Published: (2024)
Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network
by: Ye, Junyan, et al.
Published: (2024)
by: Ye, Junyan, et al.
Published: (2024)
Revisiting Continuity of Image Tokens for Cross-domain Few-shot Learning
by: Yi, Shuai, et al.
Published: (2025)
by: Yi, Shuai, et al.
Published: (2025)
REVERSE: Reinforcing Evidence Verification and Search for Agentic Image geo-localization
by: Li, Yong, et al.
Published: (2026)
by: Li, Yong, et al.
Published: (2026)
DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation
by: Ma, Zehong, et al.
Published: (2025)
by: Ma, Zehong, et al.
Published: (2025)
Cross-view and Cross-domain Underwater Localization based on Optical Aerial and Acoustic Underwater Images
by: Santos, Matheus M. Dos, et al.
Published: (2022)
by: Santos, Matheus M. Dos, et al.
Published: (2022)
FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image Segmentation
by: Bo, Yuntian, et al.
Published: (2024)
by: Bo, Yuntian, et al.
Published: (2024)
Rethinking Generalizable Infrared Small Target Detection: A Real-scene Benchmark and Cross-view Representation Learning
by: Lu, Yahao, et al.
Published: (2025)
by: Lu, Yahao, et al.
Published: (2025)
Multi-scale Semantic Prior Features Guided Deep Neural Network for Urban Street-view Image
by: Zeng, Jianshun, et al.
Published: (2024)
by: Zeng, Jianshun, et al.
Published: (2024)
Anchor-free Cross-view Object Geo-localization with Gaussian Position Encoding and Cross-view Association
by: Ling, Xingtao, et al.
Published: (2025)
by: Ling, Xingtao, et al.
Published: (2025)
FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models
by: Wu, Wei, et al.
Published: (2024)
by: Wu, Wei, et al.
Published: (2024)
Geometry-guided Cross-view Diffusion for One-to-many Cross-view Image Synthesis
by: Lin, Tao Jun, et al.
Published: (2024)
by: Lin, Tao Jun, et al.
Published: (2024)
VICI: VLM-Instructed Cross-view Image-localisation
by: Zhang, Xiaohan, et al.
Published: (2025)
by: Zhang, Xiaohan, et al.
Published: (2025)
Frequency-domain Learning with Kernel Prior for Blind Image Deblurring
by: Sun, Jixiang, et al.
Published: (2025)
by: Sun, Jixiang, et al.
Published: (2025)
Harnessing Frequency Spectrum Insights for Image Copyright Protection Against Diffusion Models
by: Liu, Zhenguang, et al.
Published: (2025)
by: Liu, Zhenguang, et al.
Published: (2025)
Forgedit: Text Guided Image Editing via Learning and Forgetting
by: Zhang, Shiwen, et al.
Published: (2023)
by: Zhang, Shiwen, et al.
Published: (2023)
Cross-view Masked Diffusion Transformers for Person Image Synthesis
by: Pham, Trung X., et al.
Published: (2024)
by: Pham, Trung X., et al.
Published: (2024)
EEPNet-V2: Patch-to-Pixel Solution for Efficient Cross-Modal Registration between LiDAR Point Cloud and Camera Image
by: Yue, Yuanchao, et al.
Published: (2025)
by: Yue, Yuanchao, et al.
Published: (2025)
MindShot: A Few-Shot Brain Decoding Framework via Transferring Cross-Subject Prior and Distilling Frequency Domain Knowledge
by: Jiang, Shuai, et al.
Published: (2024)
by: Jiang, Shuai, et al.
Published: (2024)
MDAFNet: Multiscale Differential Edge and Adaptive Frequency Guided Network for Infrared Small Target Detection
by: Li, Shuying, et al.
Published: (2026)
by: Li, Shuying, et al.
Published: (2026)
MAD: Makeup All-in-One with Cross-Domain Diffusion Model
by: Ruan, Bo-Kai, et al.
Published: (2025)
by: Ruan, Bo-Kai, et al.
Published: (2025)
PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance
by: Liu, Aoming, et al.
Published: (2024)
by: Liu, Aoming, et al.
Published: (2024)
PdfTable: A Unified Toolkit for Deep Learning-Based Table Extraction
by: Sheng, Lei, et al.
Published: (2024)
by: Sheng, Lei, et al.
Published: (2024)
Improving Cross-view Object Geo-localization: A Dual Attention Approach with Cross-view Interaction and Multi-Scale Spatial Features
by: Zhu, Xingtao Ling Yingying
Published: (2025)
by: Zhu, Xingtao Ling Yingying
Published: (2025)
Learning Cross-view Visual Geo-localization without Ground Truth
by: Li, Haoyuan, et al.
Published: (2024)
by: Li, Haoyuan, et al.
Published: (2024)
VMambaCC: A Visual State Space Model for Crowd Counting
by: Ma, Hao-Yuan, et al.
Published: (2024)
by: Ma, Hao-Yuan, et al.
Published: (2024)
FD-LSCIC: Frequency Decomposition-based Learned Screen Content Image Compression
by: Jiang, Shiqi, et al.
Published: (2025)
by: Jiang, Shiqi, et al.
Published: (2025)
Frequency-domain Event-based Imaging for Selective Surveillance
by: Birch, Megan, et al.
Published: (2026)
by: Birch, Megan, et al.
Published: (2026)
GeoDTR+: Toward generic cross-view geolocalization via geometric disentanglement
by: Zhang, Xiaohan, et al.
Published: (2023)
by: Zhang, Xiaohan, et al.
Published: (2023)
Personalized Federated Learning for Cross-view Geo-localization
by: Anagnostopoulos, Christos, et al.
Published: (2024)
by: Anagnostopoulos, Christos, et al.
Published: (2024)
Random Registers for Cross-Domain Few-Shot Learning
by: Yi, Shuai, et al.
Published: (2025)
by: Yi, Shuai, et al.
Published: (2025)
Language-based Image Colorization: A Benchmark and Beyond
by: Li, Yifan, et al.
Published: (2025)
by: Li, Yifan, et al.
Published: (2025)
Geo$^\textbf{2}$: Geometry-Guided Cross-view Geo-Localization and Image Synthesis
by: Zhang, Yancheng, et al.
Published: (2026)
by: Zhang, Yancheng, et al.
Published: (2026)
ROI-Aware Multiscale Cross-Attention Vision Transformer for Pest Image Identification
by: Kim, Ga-Eun, et al.
Published: (2023)
by: Kim, Ga-Eun, et al.
Published: (2023)
JRN-Geo: A Joint Perception Network based on RGB and Normal images for Cross-view Geo-localization
by: Zhou, Hongyu, et al.
Published: (2025)
by: Zhou, Hongyu, et al.
Published: (2025)
DCDet: Dynamic Cross-based 3D Object Detector
by: Liu, Shuai, et al.
Published: (2024)
by: Liu, Shuai, et al.
Published: (2024)
A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models
by: Shuai, Xincheng, et al.
Published: (2024)
by: Shuai, Xincheng, et al.
Published: (2024)
OctGPT: Octree-based Multiscale Autoregressive Models for 3D Shape Generation
by: Wei, Si-Tong, et al.
Published: (2025)
by: Wei, Si-Tong, et al.
Published: (2025)
BiSegMamba: Efficient Bidirectional Tri-Oriented Mamba for 3D Medical Image Segmentation
by: Zada, Bakht, et al.
Published: (2026)
by: Zada, Bakht, et al.
Published: (2026)
ConGeo: Robust Cross-view Geo-localization across Ground View Variations
by: Mi, Li, et al.
Published: (2024)
by: Mi, Li, et al.
Published: (2024)
Similar Items
-
Cross-view geo-localization: a survey
by: Durgam, Abhilash, et al.
Published: (2024) -
Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network
by: Ye, Junyan, et al.
Published: (2024) -
Revisiting Continuity of Image Tokens for Cross-domain Few-shot Learning
by: Yi, Shuai, et al.
Published: (2025) -
REVERSE: Reinforcing Evidence Verification and Search for Agentic Image geo-localization
by: Li, Yong, et al.
Published: (2026) -
DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation
by: Ma, Zehong, et al.
Published: (2025)