Saved in:
| Main Authors: | Zhang, Xuyang, Zhang, Xi, Chen, Liang, Shi, Hao, Guo, Qingshan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.22226 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MSCGC-KAN: Multi-scale Causal Graph Convolution and Kolmogorov-Arnold Feature Mapping for EEG Emotion Recognition
by: Gong, Haoliang, et al.
Published: (2026)
by: Gong, Haoliang, et al.
Published: (2026)
Learning Expressive And Generalizable Motion Features For Face Forgery Detection
by: Zhang, Jingyi, et al.
Published: (2024)
by: Zhang, Jingyi, et al.
Published: (2024)
T2VWorldBench: A Benchmark for Evaluating World Knowledge in Text-to-Video Generation
by: Chen, Yubin, et al.
Published: (2025)
by: Chen, Yubin, et al.
Published: (2025)
SAGA: Selective Adaptive Gating for Efficient and Expressive Linear Attention
by: Cao, Yuan, et al.
Published: (2025)
by: Cao, Yuan, et al.
Published: (2025)
Efficient RGB-D Scene Understanding via Multi-task Adaptive Learning and Cross-dimensional Feature Guidance
by: Sun, Guodong, et al.
Published: (2026)
by: Sun, Guodong, et al.
Published: (2026)
FilterPrompt: A Simple yet Efficient Approach to Guide Image Appearance Transfer in Diffusion Models
by: Wang, Xi, et al.
Published: (2024)
by: Wang, Xi, et al.
Published: (2024)
PG-NeuS: Robust and Efficient Point Guidance for Multi-View Neural Surface Reconstruction
by: Zhang, Chen, et al.
Published: (2023)
by: Zhang, Chen, et al.
Published: (2023)
Efficient Reinforcement Learning Through Adaptively Pretrained Visual Encoder
by: Zhang, Yuhan, et al.
Published: (2025)
by: Zhang, Yuhan, et al.
Published: (2025)
PGAHum: Prior-Guided Geometry and Appearance Learning for High-Fidelity Animatable Human Reconstruction
by: Wang, Hao, et al.
Published: (2024)
by: Wang, Hao, et al.
Published: (2024)
Few-shot NeRF by Adaptive Rendering Loss Regularization
by: Xu, Qingshan, et al.
Published: (2024)
by: Xu, Qingshan, et al.
Published: (2024)
PSDF: Prior-Driven Neural Implicit Surface Learning for Multi-view Reconstruction
by: Su, Wanjuan, et al.
Published: (2024)
by: Su, Wanjuan, et al.
Published: (2024)
VGGT-Motion: Motion-Aware Calibration-Free Monocular SLAM for Long-Range Consistency
by: Xiong, Zhuang, et al.
Published: (2026)
by: Xiong, Zhuang, et al.
Published: (2026)
$E^{3}$Gen: Efficient, Expressive and Editable Avatars Generation
by: Zhang, Weitian, et al.
Published: (2024)
by: Zhang, Weitian, et al.
Published: (2024)
Improving 3D Gaussian Splatting Compression by Scene-Adaptive Lattice Vector Quantization
by: Xu, Hao, et al.
Published: (2025)
by: Xu, Hao, et al.
Published: (2025)
Shifting Spotlight for Co-supervision: A Simple yet Efficient Single-branch Network to See Through Camouflage
by: Hu, Yang, et al.
Published: (2024)
by: Hu, Yang, et al.
Published: (2024)
ExpPortrait: Expressive Portrait Generation via Personalized Representation
by: Wang, Junyi, et al.
Published: (2026)
by: Wang, Junyi, et al.
Published: (2026)
EchoingPixels: Cross-Modal Adaptive Token Reduction for Efficient Audio-Visual LLMs
by: Gong, Chao, et al.
Published: (2025)
by: Gong, Chao, et al.
Published: (2025)
EDM: Efficient Deep Feature Matching
by: Li, Xi, et al.
Published: (2025)
by: Li, Xi, et al.
Published: (2025)
Expressive Speech-driven Facial Animation with controllable emotions
by: Chen, Yutong, et al.
Published: (2023)
by: Chen, Yutong, et al.
Published: (2023)
Your Vision-Language Model Can't Even Count to 20: Exposing the Failures of VLMs in Compositional Counting
by: Guo, Xuyang, et al.
Published: (2025)
by: Guo, Xuyang, et al.
Published: (2025)
Practical Video Object Detection via Feature Selection and Aggregation
by: Shi, Yuheng, et al.
Published: (2024)
by: Shi, Yuheng, et al.
Published: (2024)
CLIDD: Cross-Layer Independent Deformable Description for Efficient and Discriminative Local Feature Representation
by: Yao, Haodi, et al.
Published: (2026)
by: Yao, Haodi, et al.
Published: (2026)
Compact Hadamard Latent Codes for Efficient Spectral Rendering
by: Yu, Jiaqi, et al.
Published: (2026)
by: Yu, Jiaqi, et al.
Published: (2026)
Joint Co-Speech Gesture and Expressive Talking Face Generation using Diffusion with Adapters
by: Hogue, Steven, et al.
Published: (2024)
by: Hogue, Steven, et al.
Published: (2024)
Distribution-Aware Hadamard Quantization for Hardware-Efficient Implicit Neural Representations
by: Zhou, Wenyong, et al.
Published: (2025)
by: Zhou, Wenyong, et al.
Published: (2025)
Hadamard Attention Recurrent Transformer: A Strong Baseline for Stereo Matching Transformer
by: Chen, Ziyang, et al.
Published: (2025)
by: Chen, Ziyang, et al.
Published: (2025)
ARM: A Learnable, Plug-and-Play Module for CLIP-based Open-vocabulary Semantic Segmentation
by: Liu, Ziquan, et al.
Published: (2025)
by: Liu, Ziquan, et al.
Published: (2025)
Let Storytelling Tell Vivid Stories: An Expressive and Fluent Multimodal Storyteller
by: Zang, Chuanqi, et al.
Published: (2024)
by: Zang, Chuanqi, et al.
Published: (2024)
CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization
by: Ji, Yingrui, et al.
Published: (2025)
by: Ji, Yingrui, et al.
Published: (2025)
X-Dyna: Expressive Dynamic Human Image Animation
by: Chang, Di, et al.
Published: (2025)
by: Chang, Di, et al.
Published: (2025)
CEDex: Cross-Embodiment Dexterous Grasp Generation at Scale from Human-like Contact Representations
by: Wu, Zhiyuan, et al.
Published: (2025)
by: Wu, Zhiyuan, et al.
Published: (2025)
Controllable and Expressive One-Shot Video Head Swapping
by: Ji, Chaonan, et al.
Published: (2025)
by: Ji, Chaonan, et al.
Published: (2025)
Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty
by: Zhang, Saining, et al.
Published: (2024)
by: Zhang, Saining, et al.
Published: (2024)
MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
by: Zheng, Longtao, et al.
Published: (2024)
by: Zheng, Longtao, et al.
Published: (2024)
Towards Diverse Binary Segmentation via A Simple yet General Gated Network
by: Zhao, Xiaoqi, et al.
Published: (2023)
by: Zhao, Xiaoqi, et al.
Published: (2023)
DeX-Portrait: Disentangled and Expressive Portrait Animation via Explicit and Latent Motion Representations
by: Shi, Yuxiang, et al.
Published: (2025)
by: Shi, Yuxiang, et al.
Published: (2025)
Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment
by: Zhang, Shi-Chen, et al.
Published: (2025)
by: Zhang, Shi-Chen, et al.
Published: (2025)
Camera-LiDAR Cross-modality Gait Recognition
by: Guo, Wenxuan, et al.
Published: (2024)
by: Guo, Wenxuan, et al.
Published: (2024)
Follow-Your-Emoji-Faster: Towards Efficient, Fine-Controllable, and Expressive Freestyle Portrait Animation
by: Ma, Yue, et al.
Published: (2025)
by: Ma, Yue, et al.
Published: (2025)
SPEAL: Skeletal Prior Embedded Attention Learning for Cross-Source Point Cloud Registration
by: Xiong, Kezheng, et al.
Published: (2023)
by: Xiong, Kezheng, et al.
Published: (2023)
Similar Items
-
MSCGC-KAN: Multi-scale Causal Graph Convolution and Kolmogorov-Arnold Feature Mapping for EEG Emotion Recognition
by: Gong, Haoliang, et al.
Published: (2026) -
Learning Expressive And Generalizable Motion Features For Face Forgery Detection
by: Zhang, Jingyi, et al.
Published: (2024) -
T2VWorldBench: A Benchmark for Evaluating World Knowledge in Text-to-Video Generation
by: Chen, Yubin, et al.
Published: (2025) -
SAGA: Selective Adaptive Gating for Efficient and Expressive Linear Attention
by: Cao, Yuan, et al.
Published: (2025) -
Efficient RGB-D Scene Understanding via Multi-task Adaptive Learning and Cross-dimensional Feature Guidance
by: Sun, Guodong, et al.
Published: (2026)