Saved in:
| Main Author: | Li, Xin |
|---|---|
| Format: | Preprint |
| Published: |
2018
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/1802.04723 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CauSight: Learning to Supersense for Visual Causal Discovery
by: Zhang, Yize, et al.
Published: (2025)
by: Zhang, Yize, et al.
Published: (2025)
What is Memory? A Homological Perspective
by: Li, Xin
Published: (2023)
by: Li, Xin
Published: (2023)
MITO: A Millimeter-Wave Dataset and Simulator for Non-Line-of-Sight Perception
by: Dodds, Laura, et al.
Published: (2025)
by: Dodds, Laura, et al.
Published: (2025)
AV-Unified: A Unified Framework for Audio-visual Scene Understanding
by: Li, Guangyao, et al.
Published: (2026)
by: Li, Guangyao, et al.
Published: (2026)
Panther: Illuminate the Sight of Multimodal LLMs with Instruction-Guided Visual Prompts
by: Li, Honglin, et al.
Published: (2024)
by: Li, Honglin, et al.
Published: (2024)
Hidden in Plain Sight: Evaluating Abstract Shape Recognition in Vision-Language Models
by: Hemmat, Arshia, et al.
Published: (2024)
by: Hemmat, Arshia, et al.
Published: (2024)
Adaptive Perception for Unified Visual Multi-modal Object Tracking
by: Hu, Xiantao, et al.
Published: (2025)
by: Hu, Xiantao, et al.
Published: (2025)
EchoSight: Advancing Visual-Language Models with Wiki Knowledge
by: Yan, Yibin, et al.
Published: (2024)
by: Yan, Yibin, et al.
Published: (2024)
PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors
by: Yuan, Tianyuan, et al.
Published: (2024)
by: Yuan, Tianyuan, et al.
Published: (2024)
Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors
by: Dong, Shiyin, et al.
Published: (2024)
by: Dong, Shiyin, et al.
Published: (2024)
Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object Perception
by: Wu, Yuanchen, et al.
Published: (2025)
by: Wu, Yuanchen, et al.
Published: (2025)
Knowledge to Sight: Reasoning over Visual Attributes via Knowledge Decomposition for Abnormality Grounding
by: Li, Jun, et al.
Published: (2025)
by: Li, Jun, et al.
Published: (2025)
Sight Over Site: Perception-Aware Reinforcement Learning for Efficient Robotic Inspection
by: Kuhlmann, Richard, et al.
Published: (2025)
by: Kuhlmann, Richard, et al.
Published: (2025)
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface
by: Tang, Hao, et al.
Published: (2025)
by: Tang, Hao, et al.
Published: (2025)
A Unified and Controllable Framework for Layered Image Generation with Visual Effects
by: Yang, Jinrui, et al.
Published: (2026)
by: Yang, Jinrui, et al.
Published: (2026)
UniModel: A Visual-Only Framework for Unified Multimodal Understanding and Generation
by: Zhang, Chi, et al.
Published: (2025)
by: Zhang, Chi, et al.
Published: (2025)
UniVision: A Unified Framework for Vision-Centric 3D Perception
by: Hong, Yu, et al.
Published: (2024)
by: Hong, Yu, et al.
Published: (2024)
VisionReasoner: Unified Reasoning-Integrated Visual Perception via Reinforcement Learning
by: Liu, Yuqi, et al.
Published: (2025)
by: Liu, Yuqi, et al.
Published: (2025)
Hidden Meanings in Plain Sight: RebusBench for Evaluating Cognitive Visual Reasoning
by: Kasaei, Seyed Amir, et al.
Published: (2026)
by: Kasaei, Seyed Amir, et al.
Published: (2026)
FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving
by: Zeng, Shuang, et al.
Published: (2025)
by: Zeng, Shuang, et al.
Published: (2025)
MMTL-UniAD: A Unified Framework for Multimodal and Multi-Task Learning in Assistive Driving Perception
by: Liu, Wenzhuo, et al.
Published: (2025)
by: Liu, Wenzhuo, et al.
Published: (2025)
SuperEx: Enhancing Indoor Mapping and Exploration using Non-Line-of-Sight Perception
by: Garg, Kush, et al.
Published: (2025)
by: Garg, Kush, et al.
Published: (2025)
MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding
by: Jin, Xin, et al.
Published: (2025)
by: Jin, Xin, et al.
Published: (2025)
ImagiDrive: A Unified Imagination-and-Planning Framework for Autonomous Driving
by: Li, Jingyu, et al.
Published: (2025)
by: Li, Jingyu, et al.
Published: (2025)
A Unified Framework for Semi-Supervised Image Segmentation and Registration
by: Li, Ruizhe, et al.
Published: (2025)
by: Li, Ruizhe, et al.
Published: (2025)
A Unified 3D Object Perception Framework for Real-Time Outside-In Multi-Camera Systems
by: Wang, Yizhou, et al.
Published: (2026)
by: Wang, Yizhou, et al.
Published: (2026)
Visual Bridge: Universal Visual Perception Representations Generating
by: Gao, Yilin, et al.
Published: (2025)
by: Gao, Yilin, et al.
Published: (2025)
UniDGF: A Unified Detection-to-Generation Framework for Hierarchical Object Visual Recognition
by: Nan, Xinyu, et al.
Published: (2025)
by: Nan, Xinyu, et al.
Published: (2025)
Active Visual Perception: Opportunities and Challenges
by: Li, Yian, et al.
Published: (2025)
by: Li, Yian, et al.
Published: (2025)
OwlSight: A Robust Illumination Adaptation Framework for Dark Video Human Action Recognition
by: Cheng, Shihao, et al.
Published: (2025)
by: Cheng, Shihao, et al.
Published: (2025)
COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence
by: Zhang, Zefeng, et al.
Published: (2025)
by: Zhang, Zefeng, et al.
Published: (2025)
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
by: Pang, Ziqi, et al.
Published: (2025)
by: Pang, Ziqi, et al.
Published: (2025)
StableIdentity: Inserting Anybody into Anywhere at First Sight
by: Wang, Qinghe, et al.
Published: (2024)
by: Wang, Qinghe, et al.
Published: (2024)
VL-UniTrack: A Unified Framework with Visual-Language Prompts for UAV-Ground Visual Tracking
by: Xu, Boyue, et al.
Published: (2026)
by: Xu, Boyue, et al.
Published: (2026)
StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion
by: Tao, Ming, et al.
Published: (2024)
by: Tao, Ming, et al.
Published: (2024)
A Unified Framework for 3D Scene Understanding
by: Xu, Wei, et al.
Published: (2024)
by: Xu, Wei, et al.
Published: (2024)
CoopDETR: A Unified Cooperative Perception Framework for 3D Detection via Object Query
by: Wang, Zhe, et al.
Published: (2025)
by: Wang, Zhe, et al.
Published: (2025)
UV-M3TL: A Unified and Versatile Multimodal Multi-Task Learning Framework for Assistive Driving Perception
by: Liu, Wenzhuo, et al.
Published: (2026)
by: Liu, Wenzhuo, et al.
Published: (2026)
Audio-Visual World Models: Towards Multisensory Imagination in Sight and Sound
by: Wang, Jiahua, et al.
Published: (2025)
by: Wang, Jiahua, et al.
Published: (2025)
ViP$^2$-CLIP: Visual-Perception Prompting with Unified Alignment for Zero-Shot Anomaly Detection
by: Yang, Ziteng, et al.
Published: (2025)
by: Yang, Ziteng, et al.
Published: (2025)
Similar Items
-
CauSight: Learning to Supersense for Visual Causal Discovery
by: Zhang, Yize, et al.
Published: (2025) -
What is Memory? A Homological Perspective
by: Li, Xin
Published: (2023) -
MITO: A Millimeter-Wave Dataset and Simulator for Non-Line-of-Sight Perception
by: Dodds, Laura, et al.
Published: (2025) -
AV-Unified: A Unified Framework for Audio-visual Scene Understanding
by: Li, Guangyao, et al.
Published: (2026) -
Panther: Illuminate the Sight of Multimodal LLMs with Instruction-Guided Visual Prompts
by: Li, Honglin, et al.
Published: (2024)