:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lai, Qiuxia, Li, Yu, Zeng, Ailing, Liu, Minhao, Sun, Hanqiu, Xu, Qiang
Format:	Preprint
Published:	2021
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2108.03418
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Vector Quantization Prompting for Continual Learning
by: Jiao, Li, et al.
Published: (2024)

Learning 3D Representations for Spatial Intelligence from Unposed Multi-View Images
by: Zhou, Bo, et al.
Published: (2026)

Structural Teacher-Student Normality Learning for Multi-Class Anomaly Detection and Localization
by: Deng, Hanqiu, et al.
Published: (2024)

TEASER: Token Enhanced Spatial Modeling for Expressions Reconstruction
by: Liu, Yunfei, et al.
Published: (2025)

Partially Shared Concept Bottleneck Models
by: Zhao, Delong, et al.
Published: (2025)

Spatial Information Bottleneck for Interpretable Visual Recognition
by: Shu, Kaixiang, et al.
Published: (2025)

SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph Attention
by: Xiao, Feng, et al.
Published: (2024)

MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal Controls
by: Bian, Yuxuan, et al.
Published: (2024)

Information Bottleneck-based Causal Attention for Multi-label Medical Image Recognition
by: Cui, Xiaoxiao, et al.
Published: (2025)

OmniMotion-X: Versatile Multimodal Whole-Body Motion Generation
by: Xu, Guowei, et al.
Published: (2025)

Temporal Consistency-Aware Text-to-Motion Generation
by: Wang, Hongsong, et al.
Published: (2026)

STAA-SNN: Spatial-Temporal Attention Aggregator for Spiking Neural Networks
by: Zhang, Tianqing, et al.
Published: (2025)

The Dawn of Video Generation: Preliminary Explorations with SORA-like Models
by: Zeng, Ailing, et al.
Published: (2024)

Poly Kernel Inception Network for Remote Sensing Detection
by: Cai, Xinhao, et al.
Published: (2024)

Evaluating Text-to-Image Generative Models: An Empirical Study on Human Image Synthesis
by: Chen, Muxi, et al.
Published: (2024)

P3P: Pseudo-3D Pre-training for Scaling 3D Voxel-based Masked Autoencoders
by: Chen, Xuechao, et al.
Published: (2024)

Bootstrap Fine-Grained Vision-Language Alignment for Unified Zero-Shot Anomaly Localization
by: Deng, Hanqiu, et al.
Published: (2023)

Dual-Image Enhanced CLIP for Zero-Shot Anomaly Detection
by: Zhang, Zhaoxiang, et al.
Published: (2024)

Information Bottleneck-Guided Heterogeneous Graph Learning for Interpretable Neurodevelopmental Disorder Diagnosis
by: Li, Yueyang, et al.
Published: (2025)

MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions
by: Ju, Xuan, et al.
Published: (2024)

SpatialFormer: Semantic and Target Aware Attentions for Few-Shot Learning
by: Lai, Jinxiang, et al.
Published: (2023)

GPAvatar: Generalizable and Precise Head Avatar from Image(s)
by: Chu, Xuangeng, et al.
Published: (2024)

X-Pose: Detecting Any Keypoints
by: Yang, Jie, et al.
Published: (2023)

Language Guided Concept Bottleneck Models for Interpretable Continual Learning
by: Yu, Lu, et al.
Published: (2025)

A Conditional Probability Framework for Compositional Zero-shot Learning
by: Wu, Peng, et al.
Published: (2025)

Open-World Human-Object Interaction Detection via Multi-modal Prompts
by: Yang, Jie, et al.
Published: (2024)

Why Multimodal In-Context Learning Lags Behind? Unveiling the Inner Mechanisms and Bottlenecks
by: Wang, Yu, et al.
Published: (2026)

SC-Net: Robust Correspondence Learning via Spatial and Cross-Channel Context
by: Lin, Shuyuan, et al.
Published: (2025)

Learning Unsupervised Gaze Representation via Eye Mask Driven Information Bottleneck
by: Jiang, Yangzhou, et al.
Published: (2024)

Concept-wise Attention for Fine-grained Concept Bottleneck Models
by: Zhong, Minghong, et al.
Published: (2026)

On the Perception Bottleneck of VLMs for Chart Understanding
by: Liu, Junteng, et al.
Published: (2025)

Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival Prediction
by: Zhang, Yilan, et al.
Published: (2024)

A Training-Free Approach for Multi-ID Customization via Attention Adjustment and Spatial Control
by: Lin, Jiawei, et al.
Published: (2025)

TAB: Transformer Attention Bottlenecks enable User Intervention and Debugging in Vision-Language Models
by: Rahmanzadehgervi, Pooyan, et al.
Published: (2024)

Attend to Evidence: Evidence-Anchored Spatial Attention Supervision for Multimodal RLVR
by: Hu, Ruina, et al.
Published: (2026)

Cell Variational Information Bottleneck Network
by: Zhai, Zhonghua, et al.
Published: (2024)

Information-Bottleneck Driven Binary Neural Network for Change Detection
by: Yin, Kaijie, et al.
Published: (2025)

PAMD: Plausibility-Aware Motion Diffusion Model for Long Dance Generation
by: Wang, Hongsong, et al.
Published: (2025)

Disentangled Representation Learning with Transmitted Information Bottleneck
by: Dang, Zhuohang, et al.
Published: (2023)

RGB-Sonar Tracking Benchmark and Spatial Cross-Attention Transformer Tracker
by: Li, Yunfeng, et al.
Published: (2024)