Saved in:
| Main Authors: | Lai, Qiuxia, Li, Yu, Zeng, Ailing, Liu, Minhao, Sun, Hanqiu, Xu, Qiang |
|---|---|
| Format: | Preprint |
| Published: |
2021
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2108.03418 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Vector Quantization Prompting for Continual Learning
by: Jiao, Li, et al.
Published: (2024)
by: Jiao, Li, et al.
Published: (2024)
Learning 3D Representations for Spatial Intelligence from Unposed Multi-View Images
by: Zhou, Bo, et al.
Published: (2026)
by: Zhou, Bo, et al.
Published: (2026)
Structural Teacher-Student Normality Learning for Multi-Class Anomaly Detection and Localization
by: Deng, Hanqiu, et al.
Published: (2024)
by: Deng, Hanqiu, et al.
Published: (2024)
TEASER: Token Enhanced Spatial Modeling for Expressions Reconstruction
by: Liu, Yunfei, et al.
Published: (2025)
by: Liu, Yunfei, et al.
Published: (2025)
Partially Shared Concept Bottleneck Models
by: Zhao, Delong, et al.
Published: (2025)
by: Zhao, Delong, et al.
Published: (2025)
Spatial Information Bottleneck for Interpretable Visual Recognition
by: Shu, Kaixiang, et al.
Published: (2025)
by: Shu, Kaixiang, et al.
Published: (2025)
SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph Attention
by: Xiao, Feng, et al.
Published: (2024)
by: Xiao, Feng, et al.
Published: (2024)
MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal Controls
by: Bian, Yuxuan, et al.
Published: (2024)
by: Bian, Yuxuan, et al.
Published: (2024)
Information Bottleneck-based Causal Attention for Multi-label Medical Image Recognition
by: Cui, Xiaoxiao, et al.
Published: (2025)
by: Cui, Xiaoxiao, et al.
Published: (2025)
OmniMotion-X: Versatile Multimodal Whole-Body Motion Generation
by: Xu, Guowei, et al.
Published: (2025)
by: Xu, Guowei, et al.
Published: (2025)
Temporal Consistency-Aware Text-to-Motion Generation
by: Wang, Hongsong, et al.
Published: (2026)
by: Wang, Hongsong, et al.
Published: (2026)
STAA-SNN: Spatial-Temporal Attention Aggregator for Spiking Neural Networks
by: Zhang, Tianqing, et al.
Published: (2025)
by: Zhang, Tianqing, et al.
Published: (2025)
The Dawn of Video Generation: Preliminary Explorations with SORA-like Models
by: Zeng, Ailing, et al.
Published: (2024)
by: Zeng, Ailing, et al.
Published: (2024)
Poly Kernel Inception Network for Remote Sensing Detection
by: Cai, Xinhao, et al.
Published: (2024)
by: Cai, Xinhao, et al.
Published: (2024)
Evaluating Text-to-Image Generative Models: An Empirical Study on Human Image Synthesis
by: Chen, Muxi, et al.
Published: (2024)
by: Chen, Muxi, et al.
Published: (2024)
P3P: Pseudo-3D Pre-training for Scaling 3D Voxel-based Masked Autoencoders
by: Chen, Xuechao, et al.
Published: (2024)
by: Chen, Xuechao, et al.
Published: (2024)
Bootstrap Fine-Grained Vision-Language Alignment for Unified Zero-Shot Anomaly Localization
by: Deng, Hanqiu, et al.
Published: (2023)
by: Deng, Hanqiu, et al.
Published: (2023)
Dual-Image Enhanced CLIP for Zero-Shot Anomaly Detection
by: Zhang, Zhaoxiang, et al.
Published: (2024)
by: Zhang, Zhaoxiang, et al.
Published: (2024)
Information Bottleneck-Guided Heterogeneous Graph Learning for Interpretable Neurodevelopmental Disorder Diagnosis
by: Li, Yueyang, et al.
Published: (2025)
by: Li, Yueyang, et al.
Published: (2025)
MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions
by: Ju, Xuan, et al.
Published: (2024)
by: Ju, Xuan, et al.
Published: (2024)
SpatialFormer: Semantic and Target Aware Attentions for Few-Shot Learning
by: Lai, Jinxiang, et al.
Published: (2023)
by: Lai, Jinxiang, et al.
Published: (2023)
GPAvatar: Generalizable and Precise Head Avatar from Image(s)
by: Chu, Xuangeng, et al.
Published: (2024)
by: Chu, Xuangeng, et al.
Published: (2024)
X-Pose: Detecting Any Keypoints
by: Yang, Jie, et al.
Published: (2023)
by: Yang, Jie, et al.
Published: (2023)
Language Guided Concept Bottleneck Models for Interpretable Continual Learning
by: Yu, Lu, et al.
Published: (2025)
by: Yu, Lu, et al.
Published: (2025)
A Conditional Probability Framework for Compositional Zero-shot Learning
by: Wu, Peng, et al.
Published: (2025)
by: Wu, Peng, et al.
Published: (2025)
Open-World Human-Object Interaction Detection via Multi-modal Prompts
by: Yang, Jie, et al.
Published: (2024)
by: Yang, Jie, et al.
Published: (2024)
Why Multimodal In-Context Learning Lags Behind? Unveiling the Inner Mechanisms and Bottlenecks
by: Wang, Yu, et al.
Published: (2026)
by: Wang, Yu, et al.
Published: (2026)
SC-Net: Robust Correspondence Learning via Spatial and Cross-Channel Context
by: Lin, Shuyuan, et al.
Published: (2025)
by: Lin, Shuyuan, et al.
Published: (2025)
Learning Unsupervised Gaze Representation via Eye Mask Driven Information Bottleneck
by: Jiang, Yangzhou, et al.
Published: (2024)
by: Jiang, Yangzhou, et al.
Published: (2024)
Concept-wise Attention for Fine-grained Concept Bottleneck Models
by: Zhong, Minghong, et al.
Published: (2026)
by: Zhong, Minghong, et al.
Published: (2026)
On the Perception Bottleneck of VLMs for Chart Understanding
by: Liu, Junteng, et al.
Published: (2025)
by: Liu, Junteng, et al.
Published: (2025)
Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival Prediction
by: Zhang, Yilan, et al.
Published: (2024)
by: Zhang, Yilan, et al.
Published: (2024)
A Training-Free Approach for Multi-ID Customization via Attention Adjustment and Spatial Control
by: Lin, Jiawei, et al.
Published: (2025)
by: Lin, Jiawei, et al.
Published: (2025)
TAB: Transformer Attention Bottlenecks enable User Intervention and Debugging in Vision-Language Models
by: Rahmanzadehgervi, Pooyan, et al.
Published: (2024)
by: Rahmanzadehgervi, Pooyan, et al.
Published: (2024)
Attend to Evidence: Evidence-Anchored Spatial Attention Supervision for Multimodal RLVR
by: Hu, Ruina, et al.
Published: (2026)
by: Hu, Ruina, et al.
Published: (2026)
Cell Variational Information Bottleneck Network
by: Zhai, Zhonghua, et al.
Published: (2024)
by: Zhai, Zhonghua, et al.
Published: (2024)
Information-Bottleneck Driven Binary Neural Network for Change Detection
by: Yin, Kaijie, et al.
Published: (2025)
by: Yin, Kaijie, et al.
Published: (2025)
PAMD: Plausibility-Aware Motion Diffusion Model for Long Dance Generation
by: Wang, Hongsong, et al.
Published: (2025)
by: Wang, Hongsong, et al.
Published: (2025)
Disentangled Representation Learning with Transmitted Information Bottleneck
by: Dang, Zhuohang, et al.
Published: (2023)
by: Dang, Zhuohang, et al.
Published: (2023)
RGB-Sonar Tracking Benchmark and Spatial Cross-Attention Transformer Tracker
by: Li, Yunfeng, et al.
Published: (2024)
by: Li, Yunfeng, et al.
Published: (2024)
Similar Items
-
Vector Quantization Prompting for Continual Learning
by: Jiao, Li, et al.
Published: (2024) -
Learning 3D Representations for Spatial Intelligence from Unposed Multi-View Images
by: Zhou, Bo, et al.
Published: (2026) -
Structural Teacher-Student Normality Learning for Multi-Class Anomaly Detection and Localization
by: Deng, Hanqiu, et al.
Published: (2024) -
TEASER: Token Enhanced Spatial Modeling for Expressions Reconstruction
by: Liu, Yunfei, et al.
Published: (2025) -
Partially Shared Concept Bottleneck Models
by: Zhao, Delong, et al.
Published: (2025)