:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wu, Longhuang, Tian, Shangxuan, Wang, Youxin, Xiong, Pengfei
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2402.11540
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Multi-modal Collaborative Optimization and Expansion Network for Event-assisted Single-eye Expression Recognition
by: Han, Runduo, et al.
Published: (2025)

Rethinking Detecting Salient and Camouflaged Objects in Unconstrained Scenes
by: Zhou, Zhangjun, et al.
Published: (2024)

MonoCD: Monocular 3D Object Detection with Complementary Depths
by: Yan, Longfei, et al.
Published: (2024)

Detecting Deepfakes with Multivariate Soft Blending and CLIP-based Image-Text Alignment
by: Li, Jingwei, et al.
Published: (2026)

Enhancing Fruit and Vegetable Detection in Unconstrained Environment with a Novel Dataset
by: Khanna, Sandeep, et al.
Published: (2024)

Generating Adversarial Events: A Motion-Aware Point Cloud Framework
by: Ren, Hongwei, et al.
Published: (2026)

HiH: A Multi-modal Hierarchy in Hierarchy Network for Unconstrained Gait Recognition
by: Wang, Lei, et al.
Published: (2023)

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition
by: Li, Jiaqi, et al.
Published: (2025)

CCF: Complementary Collaborative Fusion for Domain Generalized Multi-Modal 3D Object Detection
by: Wu, Yuchen, et al.
Published: (2026)

Complementary Text-Guided Attention for Zero-Shot Adversarial Robustness
by: Yu, Lu, et al.
Published: (2026)

Towards Unconstrained Human-Object Interaction
by: Tonini, Francesco, et al.
Published: (2026)

Text Region Multiple Information Perception Network for Scene Text Detection
by: Zheng, Jinzhi, et al.
Published: (2024)

SCLNet: A Scale-Robust Complementary Learning Network for Object Detection in UAV Images
by: Li, Xuexue
Published: (2024)

TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image Super-Resolution
by: Liu, Baolin, et al.
Published: (2023)

DevilSight: Augmenting Monocular Human Avatar Reconstruction through a Virtual Perspective
by: Chen, Yushuo, et al.
Published: (2025)

Gaussian in the Wild: 3D Gaussian Splatting for Unconstrained Image Collections
by: Zhang, Dongbin, et al.
Published: (2024)

DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting
by: Zhou, Shijie, et al.
Published: (2024)

Temporally Grounding Instructional Diagrams in Unconstrained Videos
by: Zhang, Jiahao, et al.
Published: (2024)

LV-OSD: Language-Vision-Complementary Open-Set Object Detection
by: Zhang, Yupeng, et al.
Published: (2026)

Human4DiT: 360-degree Human Video Generation with 4D Diffusion Transformer
by: Shao, Ruizhi, et al.
Published: (2024)

Towards Unconstrained Audio Splicing Detection and Localization with Neural Networks
by: Moussa, Denise, et al.
Published: (2022)

Heterogeneous Complementary Distillation
by: Xu, Liuchi, et al.
Published: (2025)

How to Utilize Complementary Vision-Text Information for 2D Structure Understanding
by: Dong, Jiancheng, et al.
Published: (2026)

Finger in Camera Speaks Everything: Unconstrained Air-Writing for Real-World
by: Wu, Meiqi, et al.
Published: (2024)

Hunyuan-GameCraft-2: Instruction-following Interactive Game World Model
by: Tang, Junshu, et al.
Published: (2025)

HAVE-FUN: Human Avatar Reconstruction from Few-Shot Unconstrained Images
by: Yang, Xihe, et al.
Published: (2023)

Generalizable Sparse-View 3D Reconstruction from Unconstrained Images
by: Gupta, Vinayak, et al.
Published: (2026)

Rotate Your Character: Revisiting Video Diffusion Models for High-Quality 3D Character Generation
by: Wang, Jin, et al.
Published: (2026)

Semi-Supervised Unconstrained Head Pose Estimation in the Wild
by: Zhou, Huayi, et al.
Published: (2024)

Distillation-guided Representation Learning for Unconstrained Gait Recognition
by: Guo, Yuxiang, et al.
Published: (2023)

WildActor: Unconstrained Identity-Preserving Video Generation
by: Guo, Qin, et al.
Published: (2026)

Label-Efficient Object Detection via Region Proposal Network Pre-Training
by: Dong, Nanqing, et al.
Published: (2022)

WE-GS: An In-the-wild Efficient 3D Gaussian Representation for Unconstrained Photo Collections
by: Wang, Yuze, et al.
Published: (2024)

Domain-Invariant Proposals based on a Balanced Domain Classifier for Object Detection
by: Wu, Zhize, et al.
Published: (2022)

Text-Phase Synergy Network with Dual Priors for Unsupervised Cross-Domain Image Retrieval
by: Yang, Jing, et al.
Published: (2026)

SupScene: Scene-Structured Overlap Supervision for Image Retrieval in Unconstrained SfM
by: Shi, Xulei, et al.
Published: (2026)

Deep Fourier-embedded Network for RGB and Thermal Salient Object Detection
by: Lyu, Pengfei, et al.
Published: (2024)

Tri-path DINO: Feature Complementary Learning for Remote Sensing Multi-Class Change Detection
by: Zheng, Kai, et al.
Published: (2026)

UP2You: Fast Reconstruction of Yourself from Unconstrained Photo Collections
by: Cai, Zeyu, et al.
Published: (2025)

SmileSplat: Generalizable Gaussian Splats for Unconstrained Sparse Images
by: Li, Yanyan, et al.
Published: (2024)