:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Li, Xin
Format:	Preprint
Published:	2018
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/1802.04723
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CauSight: Learning to Supersense for Visual Causal Discovery
by: Zhang, Yize, et al.
Published: (2025)

What is Memory? A Homological Perspective
by: Li, Xin
Published: (2023)

MITO: A Millimeter-Wave Dataset and Simulator for Non-Line-of-Sight Perception
by: Dodds, Laura, et al.
Published: (2025)

AV-Unified: A Unified Framework for Audio-visual Scene Understanding
by: Li, Guangyao, et al.
Published: (2026)

Panther: Illuminate the Sight of Multimodal LLMs with Instruction-Guided Visual Prompts
by: Li, Honglin, et al.
Published: (2024)

Hidden in Plain Sight: Evaluating Abstract Shape Recognition in Vision-Language Models
by: Hemmat, Arshia, et al.
Published: (2024)

Adaptive Perception for Unified Visual Multi-modal Object Tracking
by: Hu, Xiantao, et al.
Published: (2025)

EchoSight: Advancing Visual-Language Models with Wiki Knowledge
by: Yan, Yibin, et al.
Published: (2024)

PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors
by: Yuan, Tianyuan, et al.
Published: (2024)

Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors
by: Dong, Shiyin, et al.
Published: (2024)

Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object Perception
by: Wu, Yuanchen, et al.
Published: (2025)

Knowledge to Sight: Reasoning over Visual Attributes via Knowledge Decomposition for Abnormality Grounding
by: Li, Jun, et al.
Published: (2025)

Sight Over Site: Perception-Aware Reinforcement Learning for Efficient Robotic Inspection
by: Kuhlmann, Richard, et al.
Published: (2025)

UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface
by: Tang, Hao, et al.
Published: (2025)

A Unified and Controllable Framework for Layered Image Generation with Visual Effects
by: Yang, Jinrui, et al.
Published: (2026)

UniModel: A Visual-Only Framework for Unified Multimodal Understanding and Generation
by: Zhang, Chi, et al.
Published: (2025)

UniVision: A Unified Framework for Vision-Centric 3D Perception
by: Hong, Yu, et al.
Published: (2024)

VisionReasoner: Unified Reasoning-Integrated Visual Perception via Reinforcement Learning
by: Liu, Yuqi, et al.
Published: (2025)

Hidden Meanings in Plain Sight: RebusBench for Evaluating Cognitive Visual Reasoning
by: Kasaei, Seyed Amir, et al.
Published: (2026)

FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving
by: Zeng, Shuang, et al.
Published: (2025)

MMTL-UniAD: A Unified Framework for Multimodal and Multi-Task Learning in Assistive Driving Perception
by: Liu, Wenzhuo, et al.
Published: (2025)

SuperEx: Enhancing Indoor Mapping and Exploration using Non-Line-of-Sight Perception
by: Garg, Kush, et al.
Published: (2025)

MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding
by: Jin, Xin, et al.
Published: (2025)

ImagiDrive: A Unified Imagination-and-Planning Framework for Autonomous Driving
by: Li, Jingyu, et al.
Published: (2025)

A Unified Framework for Semi-Supervised Image Segmentation and Registration
by: Li, Ruizhe, et al.
Published: (2025)

A Unified 3D Object Perception Framework for Real-Time Outside-In Multi-Camera Systems
by: Wang, Yizhou, et al.
Published: (2026)

Visual Bridge: Universal Visual Perception Representations Generating
by: Gao, Yilin, et al.
Published: (2025)

UniDGF: A Unified Detection-to-Generation Framework for Hierarchical Object Visual Recognition
by: Nan, Xinyu, et al.
Published: (2025)

Active Visual Perception: Opportunities and Challenges
by: Li, Yian, et al.
Published: (2025)

OwlSight: A Robust Illumination Adaptation Framework for Dark Video Human Action Recognition
by: Cheng, Shihao, et al.
Published: (2025)

COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence
by: Zhang, Zefeng, et al.
Published: (2025)

Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
by: Pang, Ziqi, et al.
Published: (2025)

StableIdentity: Inserting Anybody into Anywhere at First Sight
by: Wang, Qinghe, et al.
Published: (2024)

VL-UniTrack: A Unified Framework with Visual-Language Prompts for UAV-Ground Visual Tracking
by: Xu, Boyue, et al.
Published: (2026)

StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion
by: Tao, Ming, et al.
Published: (2024)

A Unified Framework for 3D Scene Understanding
by: Xu, Wei, et al.
Published: (2024)

CoopDETR: A Unified Cooperative Perception Framework for 3D Detection via Object Query
by: Wang, Zhe, et al.
Published: (2025)

UV-M3TL: A Unified and Versatile Multimodal Multi-Task Learning Framework for Assistive Driving Perception
by: Liu, Wenzhuo, et al.
Published: (2026)

Audio-Visual World Models: Towards Multisensory Imagination in Sight and Sound
by: Wang, Jiahua, et al.
Published: (2025)

ViP$^2$-CLIP: Visual-Perception Prompting with Unified Alignment for Zero-Shot Anomaly Detection
by: Yang, Ziteng, et al.
Published: (2025)