:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhang, Xuyang, Zhang, Xi, Chen, Liang, Shi, Hao, Guo, Qingshan
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2505.22226
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MSCGC-KAN: Multi-scale Causal Graph Convolution and Kolmogorov-Arnold Feature Mapping for EEG Emotion Recognition
by: Gong, Haoliang, et al.
Published: (2026)

Learning Expressive And Generalizable Motion Features For Face Forgery Detection
by: Zhang, Jingyi, et al.
Published: (2024)

T2VWorldBench: A Benchmark for Evaluating World Knowledge in Text-to-Video Generation
by: Chen, Yubin, et al.
Published: (2025)

SAGA: Selective Adaptive Gating for Efficient and Expressive Linear Attention
by: Cao, Yuan, et al.
Published: (2025)

Efficient RGB-D Scene Understanding via Multi-task Adaptive Learning and Cross-dimensional Feature Guidance
by: Sun, Guodong, et al.
Published: (2026)

FilterPrompt: A Simple yet Efficient Approach to Guide Image Appearance Transfer in Diffusion Models
by: Wang, Xi, et al.
Published: (2024)

PG-NeuS: Robust and Efficient Point Guidance for Multi-View Neural Surface Reconstruction
by: Zhang, Chen, et al.
Published: (2023)

Efficient Reinforcement Learning Through Adaptively Pretrained Visual Encoder
by: Zhang, Yuhan, et al.
Published: (2025)

PGAHum: Prior-Guided Geometry and Appearance Learning for High-Fidelity Animatable Human Reconstruction
by: Wang, Hao, et al.
Published: (2024)

Few-shot NeRF by Adaptive Rendering Loss Regularization
by: Xu, Qingshan, et al.
Published: (2024)

PSDF: Prior-Driven Neural Implicit Surface Learning for Multi-view Reconstruction
by: Su, Wanjuan, et al.
Published: (2024)

VGGT-Motion: Motion-Aware Calibration-Free Monocular SLAM for Long-Range Consistency
by: Xiong, Zhuang, et al.
Published: (2026)

$E^{3}$Gen: Efficient, Expressive and Editable Avatars Generation
by: Zhang, Weitian, et al.
Published: (2024)

Improving 3D Gaussian Splatting Compression by Scene-Adaptive Lattice Vector Quantization
by: Xu, Hao, et al.
Published: (2025)

Shifting Spotlight for Co-supervision: A Simple yet Efficient Single-branch Network to See Through Camouflage
by: Hu, Yang, et al.
Published: (2024)

ExpPortrait: Expressive Portrait Generation via Personalized Representation
by: Wang, Junyi, et al.
Published: (2026)

EchoingPixels: Cross-Modal Adaptive Token Reduction for Efficient Audio-Visual LLMs
by: Gong, Chao, et al.
Published: (2025)

EDM: Efficient Deep Feature Matching
by: Li, Xi, et al.
Published: (2025)

Expressive Speech-driven Facial Animation with controllable emotions
by: Chen, Yutong, et al.
Published: (2023)

Your Vision-Language Model Can't Even Count to 20: Exposing the Failures of VLMs in Compositional Counting
by: Guo, Xuyang, et al.
Published: (2025)

Practical Video Object Detection via Feature Selection and Aggregation
by: Shi, Yuheng, et al.
Published: (2024)

CLIDD: Cross-Layer Independent Deformable Description for Efficient and Discriminative Local Feature Representation
by: Yao, Haodi, et al.
Published: (2026)

Compact Hadamard Latent Codes for Efficient Spectral Rendering
by: Yu, Jiaqi, et al.
Published: (2026)

Joint Co-Speech Gesture and Expressive Talking Face Generation using Diffusion with Adapters
by: Hogue, Steven, et al.
Published: (2024)

Distribution-Aware Hadamard Quantization for Hardware-Efficient Implicit Neural Representations
by: Zhou, Wenyong, et al.
Published: (2025)

Hadamard Attention Recurrent Transformer: A Strong Baseline for Stereo Matching Transformer
by: Chen, Ziyang, et al.
Published: (2025)

ARM: A Learnable, Plug-and-Play Module for CLIP-based Open-vocabulary Semantic Segmentation
by: Liu, Ziquan, et al.
Published: (2025)

Let Storytelling Tell Vivid Stories: An Expressive and Fluent Multimodal Storyteller
by: Zang, Chuanqi, et al.
Published: (2024)

CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization
by: Ji, Yingrui, et al.
Published: (2025)

X-Dyna: Expressive Dynamic Human Image Animation
by: Chang, Di, et al.
Published: (2025)

CEDex: Cross-Embodiment Dexterous Grasp Generation at Scale from Human-like Contact Representations
by: Wu, Zhiyuan, et al.
Published: (2025)

Controllable and Expressive One-Shot Video Head Swapping
by: Ji, Chaonan, et al.
Published: (2025)

Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty
by: Zhang, Saining, et al.
Published: (2024)

MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
by: Zheng, Longtao, et al.
Published: (2024)

Towards Diverse Binary Segmentation via A Simple yet General Gated Network
by: Zhao, Xiaoqi, et al.
Published: (2023)

DeX-Portrait: Disentangled and Expressive Portrait Animation via Explicit and Latent Motion Representations
by: Shi, Yuxiang, et al.
Published: (2025)

Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment
by: Zhang, Shi-Chen, et al.
Published: (2025)

Camera-LiDAR Cross-modality Gait Recognition
by: Guo, Wenxuan, et al.
Published: (2024)

Follow-Your-Emoji-Faster: Towards Efficient, Fine-Controllable, and Expressive Freestyle Portrait Animation
by: Ma, Yue, et al.
Published: (2025)

SPEAL: Skeletal Prior Embedded Attention Learning for Cross-Source Point Cloud Registration
by: Xiong, Kezheng, et al.
Published: (2023)