Saved in:
| Main Authors: | Wu, Longhuang, Tian, Shangxuan, Wang, Youxin, Xiong, Pengfei |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.11540 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Multi-modal Collaborative Optimization and Expansion Network for Event-assisted Single-eye Expression Recognition
by: Han, Runduo, et al.
Published: (2025)
by: Han, Runduo, et al.
Published: (2025)
Rethinking Detecting Salient and Camouflaged Objects in Unconstrained Scenes
by: Zhou, Zhangjun, et al.
Published: (2024)
by: Zhou, Zhangjun, et al.
Published: (2024)
MonoCD: Monocular 3D Object Detection with Complementary Depths
by: Yan, Longfei, et al.
Published: (2024)
by: Yan, Longfei, et al.
Published: (2024)
Detecting Deepfakes with Multivariate Soft Blending and CLIP-based Image-Text Alignment
by: Li, Jingwei, et al.
Published: (2026)
by: Li, Jingwei, et al.
Published: (2026)
Enhancing Fruit and Vegetable Detection in Unconstrained Environment with a Novel Dataset
by: Khanna, Sandeep, et al.
Published: (2024)
by: Khanna, Sandeep, et al.
Published: (2024)
Generating Adversarial Events: A Motion-Aware Point Cloud Framework
by: Ren, Hongwei, et al.
Published: (2026)
by: Ren, Hongwei, et al.
Published: (2026)
HiH: A Multi-modal Hierarchy in Hierarchy Network for Unconstrained Gait Recognition
by: Wang, Lei, et al.
Published: (2023)
by: Wang, Lei, et al.
Published: (2023)
Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition
by: Li, Jiaqi, et al.
Published: (2025)
by: Li, Jiaqi, et al.
Published: (2025)
CCF: Complementary Collaborative Fusion for Domain Generalized Multi-Modal 3D Object Detection
by: Wu, Yuchen, et al.
Published: (2026)
by: Wu, Yuchen, et al.
Published: (2026)
Complementary Text-Guided Attention for Zero-Shot Adversarial Robustness
by: Yu, Lu, et al.
Published: (2026)
by: Yu, Lu, et al.
Published: (2026)
Towards Unconstrained Human-Object Interaction
by: Tonini, Francesco, et al.
Published: (2026)
by: Tonini, Francesco, et al.
Published: (2026)
Text Region Multiple Information Perception Network for Scene Text Detection
by: Zheng, Jinzhi, et al.
Published: (2024)
by: Zheng, Jinzhi, et al.
Published: (2024)
SCLNet: A Scale-Robust Complementary Learning Network for Object Detection in UAV Images
by: Li, Xuexue
Published: (2024)
by: Li, Xuexue
Published: (2024)
TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image Super-Resolution
by: Liu, Baolin, et al.
Published: (2023)
by: Liu, Baolin, et al.
Published: (2023)
DevilSight: Augmenting Monocular Human Avatar Reconstruction through a Virtual Perspective
by: Chen, Yushuo, et al.
Published: (2025)
by: Chen, Yushuo, et al.
Published: (2025)
Gaussian in the Wild: 3D Gaussian Splatting for Unconstrained Image Collections
by: Zhang, Dongbin, et al.
Published: (2024)
by: Zhang, Dongbin, et al.
Published: (2024)
DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting
by: Zhou, Shijie, et al.
Published: (2024)
by: Zhou, Shijie, et al.
Published: (2024)
Temporally Grounding Instructional Diagrams in Unconstrained Videos
by: Zhang, Jiahao, et al.
Published: (2024)
by: Zhang, Jiahao, et al.
Published: (2024)
LV-OSD: Language-Vision-Complementary Open-Set Object Detection
by: Zhang, Yupeng, et al.
Published: (2026)
by: Zhang, Yupeng, et al.
Published: (2026)
Human4DiT: 360-degree Human Video Generation with 4D Diffusion Transformer
by: Shao, Ruizhi, et al.
Published: (2024)
by: Shao, Ruizhi, et al.
Published: (2024)
Towards Unconstrained Audio Splicing Detection and Localization with Neural Networks
by: Moussa, Denise, et al.
Published: (2022)
by: Moussa, Denise, et al.
Published: (2022)
Heterogeneous Complementary Distillation
by: Xu, Liuchi, et al.
Published: (2025)
by: Xu, Liuchi, et al.
Published: (2025)
How to Utilize Complementary Vision-Text Information for 2D Structure Understanding
by: Dong, Jiancheng, et al.
Published: (2026)
by: Dong, Jiancheng, et al.
Published: (2026)
Finger in Camera Speaks Everything: Unconstrained Air-Writing for Real-World
by: Wu, Meiqi, et al.
Published: (2024)
by: Wu, Meiqi, et al.
Published: (2024)
Hunyuan-GameCraft-2: Instruction-following Interactive Game World Model
by: Tang, Junshu, et al.
Published: (2025)
by: Tang, Junshu, et al.
Published: (2025)
HAVE-FUN: Human Avatar Reconstruction from Few-Shot Unconstrained Images
by: Yang, Xihe, et al.
Published: (2023)
by: Yang, Xihe, et al.
Published: (2023)
Generalizable Sparse-View 3D Reconstruction from Unconstrained Images
by: Gupta, Vinayak, et al.
Published: (2026)
by: Gupta, Vinayak, et al.
Published: (2026)
Rotate Your Character: Revisiting Video Diffusion Models for High-Quality 3D Character Generation
by: Wang, Jin, et al.
Published: (2026)
by: Wang, Jin, et al.
Published: (2026)
Semi-Supervised Unconstrained Head Pose Estimation in the Wild
by: Zhou, Huayi, et al.
Published: (2024)
by: Zhou, Huayi, et al.
Published: (2024)
Distillation-guided Representation Learning for Unconstrained Gait Recognition
by: Guo, Yuxiang, et al.
Published: (2023)
by: Guo, Yuxiang, et al.
Published: (2023)
WildActor: Unconstrained Identity-Preserving Video Generation
by: Guo, Qin, et al.
Published: (2026)
by: Guo, Qin, et al.
Published: (2026)
Label-Efficient Object Detection via Region Proposal Network Pre-Training
by: Dong, Nanqing, et al.
Published: (2022)
by: Dong, Nanqing, et al.
Published: (2022)
WE-GS: An In-the-wild Efficient 3D Gaussian Representation for Unconstrained Photo Collections
by: Wang, Yuze, et al.
Published: (2024)
by: Wang, Yuze, et al.
Published: (2024)
Domain-Invariant Proposals based on a Balanced Domain Classifier for Object Detection
by: Wu, Zhize, et al.
Published: (2022)
by: Wu, Zhize, et al.
Published: (2022)
Text-Phase Synergy Network with Dual Priors for Unsupervised Cross-Domain Image Retrieval
by: Yang, Jing, et al.
Published: (2026)
by: Yang, Jing, et al.
Published: (2026)
SupScene: Scene-Structured Overlap Supervision for Image Retrieval in Unconstrained SfM
by: Shi, Xulei, et al.
Published: (2026)
by: Shi, Xulei, et al.
Published: (2026)
Deep Fourier-embedded Network for RGB and Thermal Salient Object Detection
by: Lyu, Pengfei, et al.
Published: (2024)
by: Lyu, Pengfei, et al.
Published: (2024)
Tri-path DINO: Feature Complementary Learning for Remote Sensing Multi-Class Change Detection
by: Zheng, Kai, et al.
Published: (2026)
by: Zheng, Kai, et al.
Published: (2026)
UP2You: Fast Reconstruction of Yourself from Unconstrained Photo Collections
by: Cai, Zeyu, et al.
Published: (2025)
by: Cai, Zeyu, et al.
Published: (2025)
SmileSplat: Generalizable Gaussian Splats for Unconstrained Sparse Images
by: Li, Yanyan, et al.
Published: (2024)
by: Li, Yanyan, et al.
Published: (2024)
Similar Items
-
Multi-modal Collaborative Optimization and Expansion Network for Event-assisted Single-eye Expression Recognition
by: Han, Runduo, et al.
Published: (2025) -
Rethinking Detecting Salient and Camouflaged Objects in Unconstrained Scenes
by: Zhou, Zhangjun, et al.
Published: (2024) -
MonoCD: Monocular 3D Object Detection with Complementary Depths
by: Yan, Longfei, et al.
Published: (2024) -
Detecting Deepfakes with Multivariate Soft Blending and CLIP-based Image-Text Alignment
by: Li, Jingwei, et al.
Published: (2026) -
Enhancing Fruit and Vegetable Detection in Unconstrained Environment with a Novel Dataset
by: Khanna, Sandeep, et al.
Published: (2024)