Saved in:
| Main Authors: | Wang, Hai, Yang, Xiaochen, Dong, Mingzhi, Xue, Jing-Hao |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.24642 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
360PanT: Training-Free Text-Driven 360-Degree Panorama-to-Panorama Translation
by: Wang, Hai, et al.
Published: (2024)
by: Wang, Hai, et al.
Published: (2024)
A Survey on Text-Driven 360-Degree Panorama Generation
by: Wang, Hai, et al.
Published: (2025)
by: Wang, Hai, et al.
Published: (2025)
360DVO: Deep Visual Odometry for Monocular 360-Degree Camera
by: Guo, Xiaopeng, et al.
Published: (2026)
by: Guo, Xiaopeng, et al.
Published: (2026)
Grad-ECLIP: Gradient-based Visual and Textual Explanations for CLIP
by: Zhao, Chenyang, et al.
Published: (2025)
by: Zhao, Chenyang, et al.
Published: (2025)
Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning
by: Wang, Yifan, et al.
Published: (2026)
by: Wang, Yifan, et al.
Published: (2026)
FG-CLIP: Fine-Grained Visual and Textual Alignment
by: Xie, Chunyu, et al.
Published: (2025)
by: Xie, Chunyu, et al.
Published: (2025)
CLIPErase: Efficient Unlearning of Visual-Textual Associations in CLIP
by: Yang, Tianyu, et al.
Published: (2024)
by: Yang, Tianyu, et al.
Published: (2024)
Harnessing Textual Semantic Priors for Knowledge Transfer and Refinement in CLIP-Driven Continual Learning
by: He, Lingfeng, et al.
Published: (2025)
by: He, Lingfeng, et al.
Published: (2025)
Continual Learning on CLIP via Incremental Prompt Tuning with Intrinsic Textual Anchors
by: Lu, Haodong, et al.
Published: (2025)
by: Lu, Haodong, et al.
Published: (2025)
Spherical Vision Transformers for Audio-Visual Saliency Prediction in 360-Degree Videos
by: Cokelek, Mert, et al.
Published: (2025)
by: Cokelek, Mert, et al.
Published: (2025)
Anomaly Detection for People with Visual Impairments Using an Egocentric 360-Degree Camera
by: Song, Inpyo, et al.
Published: (2024)
by: Song, Inpyo, et al.
Published: (2024)
360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model
by: Wang, Qian, et al.
Published: (2024)
by: Wang, Qian, et al.
Published: (2024)
Elite360D: Towards Efficient 360 Depth Estimation via Semantic- and Distance-Aware Bi-Projection Fusion
by: Ai, Hao, et al.
Published: (2024)
by: Ai, Hao, et al.
Published: (2024)
Video Question Answering for People with Visual Impairments Using an Egocentric 360-Degree Camera
by: Song, Inpyo, et al.
Published: (2024)
by: Song, Inpyo, et al.
Published: (2024)
Adaptive Score Alignment Learning for Continual Perceptual Quality Assessment of 360-Degree Videos in Virtual Reality
by: Zhou, Kanglei, et al.
Published: (2025)
by: Zhou, Kanglei, et al.
Published: (2025)
PathoSCOPE: Few-Shot Pathology Detection via Self-Supervised Contrastive Learning and Pathology-Informed Synthetic Embeddings
by: Chin, Sinchee, et al.
Published: (2025)
by: Chin, Sinchee, et al.
Published: (2025)
CLIP-AGIQA: Boosting the Performance of AI-Generated Image Quality Assessment with CLIP
by: Tang, Zhenchen, et al.
Published: (2024)
by: Tang, Zhenchen, et al.
Published: (2024)
Exploring Textual Semantics Diversity for Image Transmission in Semantic Communication Systems using Visual Language Model
by: Huang, Peishan, et al.
Published: (2025)
by: Huang, Peishan, et al.
Published: (2025)
TSalV360: A Method and Dataset for Text-driven Saliency Detection in 360-Degrees Videos
by: Kontostathis, Ioannis, et al.
Published: (2025)
by: Kontostathis, Ioannis, et al.
Published: (2025)
MIGA: Mutual Information-Guided Attack on Denoising Models for Semantic Manipulation
by: Li, Guanghao, et al.
Published: (2025)
by: Li, Guanghao, et al.
Published: (2025)
CLIP-Driven Semantic Discovery Network for Visible-Infrared Person Re-Identification
by: Yu, Xiaoyan, et al.
Published: (2024)
by: Yu, Xiaoyan, et al.
Published: (2024)
CLIP-MUSED: CLIP-Guided Multi-Subject Visual Neural Information Semantic Decoding
by: Zhou, Qiongyi, et al.
Published: (2024)
by: Zhou, Qiongyi, et al.
Published: (2024)
PanoDreamer: Consistent Text to 360-Degree Scene Generation
by: Xiong, Zhexiao, et al.
Published: (2025)
by: Xiong, Zhexiao, et al.
Published: (2025)
GazeTarget360: Towards Gaze Target Estimation in 360-Degree for Robot Perception
by: Dai, Zhuangzhuang, et al.
Published: (2025)
by: Dai, Zhuangzhuang, et al.
Published: (2025)
CLIP-VG: Self-paced Curriculum Adapting of CLIP for Visual Grounding
by: Xiao, Linhui, et al.
Published: (2023)
by: Xiao, Linhui, et al.
Published: (2023)
ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation
by: Wang, Jingyun, et al.
Published: (2024)
by: Wang, Jingyun, et al.
Published: (2024)
Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views
by: Bao, Chong, et al.
Published: (2025)
by: Bao, Chong, et al.
Published: (2025)
ACT360: An Efficient 360-Degree Action Detection and Summarization Framework for Mission-Critical Training and Debriefing
by: Tiwari, Aditi, et al.
Published: (2025)
by: Tiwari, Aditi, et al.
Published: (2025)
Thinking in 360°: Humanoid Visual Search in the Wild
by: Yu, Heyang, et al.
Published: (2025)
by: Yu, Heyang, et al.
Published: (2025)
Rethinking Visual Content Refinement in Low-Shot CLIP Adaptation
by: Lu, Jinda, et al.
Published: (2024)
by: Lu, Jinda, et al.
Published: (2024)
DiCLIP: Diffusion Model Enhances CLIP's Dense Knowledge for Weakly Supervised Semantic Segmentation
by: Yang, Zhiwei, et al.
Published: (2026)
by: Yang, Zhiwei, et al.
Published: (2026)
Pose-Free Omnidirectional Gaussian Splatting for 360-Degree Videos with Consistent Depth Priors
by: Zhuang, Chuanqing, et al.
Published: (2026)
by: Zhuang, Chuanqing, et al.
Published: (2026)
Show or Tell? A Benchmark To Evaluate Visual and Textual Prompts in Semantic Segmentation
by: Rosi, Gabriele, et al.
Published: (2025)
by: Rosi, Gabriele, et al.
Published: (2025)
EdgeRelight360: Text-Conditioned 360-Degree HDR Image Generation for Real-Time On-Device Video Portrait Relighting
by: Lin, Min-Hui, et al.
Published: (2024)
by: Lin, Min-Hui, et al.
Published: (2024)
Imagine360: Immersive 360 Video Generation from Perspective Anchor
by: Tan, Jing, et al.
Published: (2024)
by: Tan, Jing, et al.
Published: (2024)
OmniAudio: Generating Spatial Audio from 360-Degree Video
by: Liu, Huadai, et al.
Published: (2025)
by: Liu, Huadai, et al.
Published: (2025)
Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios
by: Repinetska, Iryna, et al.
Published: (2025)
by: Repinetska, Iryna, et al.
Published: (2025)
CLIP-SENet: CLIP-based Semantic Enhancement Network for Vehicle Re-identification
by: Lu, Liping, et al.
Published: (2025)
by: Lu, Liping, et al.
Published: (2025)
WP-CLIP: Leveraging CLIP to Predict Wölfflin's Principles in Visual Art
by: Ghildyal, Abhijay, et al.
Published: (2025)
by: Ghildyal, Abhijay, et al.
Published: (2025)
SE360: Semantic Edit in 360$^\circ$ Panoramas via Hierarchical Data Construction
by: Zhong, Haoyi, et al.
Published: (2025)
by: Zhong, Haoyi, et al.
Published: (2025)
Similar Items
-
360PanT: Training-Free Text-Driven 360-Degree Panorama-to-Panorama Translation
by: Wang, Hai, et al.
Published: (2024) -
A Survey on Text-Driven 360-Degree Panorama Generation
by: Wang, Hai, et al.
Published: (2025) -
360DVO: Deep Visual Odometry for Monocular 360-Degree Camera
by: Guo, Xiaopeng, et al.
Published: (2026) -
Grad-ECLIP: Gradient-based Visual and Textual Explanations for CLIP
by: Zhao, Chenyang, et al.
Published: (2025) -
Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning
by: Wang, Yifan, et al.
Published: (2026)