:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Hai, Yang, Xiaochen, Dong, Mingzhi, Xue, Jing-Hao
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2604.24642
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

360PanT: Training-Free Text-Driven 360-Degree Panorama-to-Panorama Translation
by: Wang, Hai, et al.
Published: (2024)

A Survey on Text-Driven 360-Degree Panorama Generation
by: Wang, Hai, et al.
Published: (2025)

360DVO: Deep Visual Odometry for Monocular 360-Degree Camera
by: Guo, Xiaopeng, et al.
Published: (2026)

Grad-ECLIP: Gradient-based Visual and Textual Explanations for CLIP
by: Zhao, Chenyang, et al.
Published: (2025)

Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning
by: Wang, Yifan, et al.
Published: (2026)

FG-CLIP: Fine-Grained Visual and Textual Alignment
by: Xie, Chunyu, et al.
Published: (2025)

CLIPErase: Efficient Unlearning of Visual-Textual Associations in CLIP
by: Yang, Tianyu, et al.
Published: (2024)

Harnessing Textual Semantic Priors for Knowledge Transfer and Refinement in CLIP-Driven Continual Learning
by: He, Lingfeng, et al.
Published: (2025)

Continual Learning on CLIP via Incremental Prompt Tuning with Intrinsic Textual Anchors
by: Lu, Haodong, et al.
Published: (2025)

Spherical Vision Transformers for Audio-Visual Saliency Prediction in 360-Degree Videos
by: Cokelek, Mert, et al.
Published: (2025)

Anomaly Detection for People with Visual Impairments Using an Egocentric 360-Degree Camera
by: Song, Inpyo, et al.
Published: (2024)

360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model
by: Wang, Qian, et al.
Published: (2024)

Elite360D: Towards Efficient 360 Depth Estimation via Semantic- and Distance-Aware Bi-Projection Fusion
by: Ai, Hao, et al.
Published: (2024)

Video Question Answering for People with Visual Impairments Using an Egocentric 360-Degree Camera
by: Song, Inpyo, et al.
Published: (2024)

Adaptive Score Alignment Learning for Continual Perceptual Quality Assessment of 360-Degree Videos in Virtual Reality
by: Zhou, Kanglei, et al.
Published: (2025)

PathoSCOPE: Few-Shot Pathology Detection via Self-Supervised Contrastive Learning and Pathology-Informed Synthetic Embeddings
by: Chin, Sinchee, et al.
Published: (2025)

CLIP-AGIQA: Boosting the Performance of AI-Generated Image Quality Assessment with CLIP
by: Tang, Zhenchen, et al.
Published: (2024)

Exploring Textual Semantics Diversity for Image Transmission in Semantic Communication Systems using Visual Language Model
by: Huang, Peishan, et al.
Published: (2025)

TSalV360: A Method and Dataset for Text-driven Saliency Detection in 360-Degrees Videos
by: Kontostathis, Ioannis, et al.
Published: (2025)

MIGA: Mutual Information-Guided Attack on Denoising Models for Semantic Manipulation
by: Li, Guanghao, et al.
Published: (2025)

CLIP-Driven Semantic Discovery Network for Visible-Infrared Person Re-Identification
by: Yu, Xiaoyan, et al.
Published: (2024)

CLIP-MUSED: CLIP-Guided Multi-Subject Visual Neural Information Semantic Decoding
by: Zhou, Qiongyi, et al.
Published: (2024)

PanoDreamer: Consistent Text to 360-Degree Scene Generation
by: Xiong, Zhexiao, et al.
Published: (2025)

GazeTarget360: Towards Gaze Target Estimation in 360-Degree for Robot Perception
by: Dai, Zhuangzhuang, et al.
Published: (2025)

CLIP-VG: Self-paced Curriculum Adapting of CLIP for Visual Grounding
by: Xiao, Linhui, et al.
Published: (2023)

ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation
by: Wang, Jingyun, et al.
Published: (2024)

Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views
by: Bao, Chong, et al.
Published: (2025)

ACT360: An Efficient 360-Degree Action Detection and Summarization Framework for Mission-Critical Training and Debriefing
by: Tiwari, Aditi, et al.
Published: (2025)

Thinking in 360°: Humanoid Visual Search in the Wild
by: Yu, Heyang, et al.
Published: (2025)

Rethinking Visual Content Refinement in Low-Shot CLIP Adaptation
by: Lu, Jinda, et al.
Published: (2024)

DiCLIP: Diffusion Model Enhances CLIP's Dense Knowledge for Weakly Supervised Semantic Segmentation
by: Yang, Zhiwei, et al.
Published: (2026)

Pose-Free Omnidirectional Gaussian Splatting for 360-Degree Videos with Consistent Depth Priors
by: Zhuang, Chuanqing, et al.
Published: (2026)

Show or Tell? A Benchmark To Evaluate Visual and Textual Prompts in Semantic Segmentation
by: Rosi, Gabriele, et al.
Published: (2025)

EdgeRelight360: Text-Conditioned 360-Degree HDR Image Generation for Real-Time On-Device Video Portrait Relighting
by: Lin, Min-Hui, et al.
Published: (2024)

Imagine360: Immersive 360 Video Generation from Perspective Anchor
by: Tan, Jing, et al.
Published: (2024)

OmniAudio: Generating Spatial Audio from 360-Degree Video
by: Liu, Huadai, et al.
Published: (2025)

Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios
by: Repinetska, Iryna, et al.
Published: (2025)

CLIP-SENet: CLIP-based Semantic Enhancement Network for Vehicle Re-identification
by: Lu, Liping, et al.
Published: (2025)

WP-CLIP: Leveraging CLIP to Predict Wölfflin's Principles in Visual Art
by: Ghildyal, Abhijay, et al.
Published: (2025)

SE360: Semantic Edit in 360$^\circ$ Panoramas via Hierarchical Data Construction
by: Zhong, Haoyi, et al.
Published: (2025)