:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Zhang, Zongye, Kong, Bohan, Liu, Qingjie, Wang, Yunhong
Natura:	Preprint
Pubblicazione:	2025
Soggetti:	Computer Vision and Pattern Recognition Multimedia I.3.8
Accesso online:	https://arxiv.org/abs/2505.11013
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

StyleMM: Stylized 3D Morphable Face Model via Text-Driven Aligned Image Translation
di: Lee, Seungmi, et al.
Pubblicazione: (2025)

StyleID: A Perception-Aware Dataset and Metric for Stylization-Agnostic Facial Identity Recognition
di: Yun, Kwan, et al.
Pubblicazione: (2026)

Sketch2Manga: Shaded Manga Screening from Sketch with Diffusion Models
di: Lin, Jian, et al.
Pubblicazione: (2024)

SkeletonX: Data-Efficient Skeleton-based Action Recognition via Cross-sample Feature Aggregation
di: Zhang, Zongye, et al.
Pubblicazione: (2025)

GSEdit: Efficient Text-Guided Editing of 3D Objects via Gaussian Splatting
di: Palandra, Francesco, et al.
Pubblicazione: (2024)

From Dead Pixels to Editable Slides: Infographic Reconstruction into Native Google Slides via Vision-Language Region Understanding
di: Gonzalez, Leonardo
Pubblicazione: (2026)

Seeing The Words: Evaluating AI-generated Biblical Art
di: Makimei, Hidde, et al.
Pubblicazione: (2025)

SHREC 2025: Protein surface shape retrieval including electrostatic potential
di: Yacoub, Taher, et al.
Pubblicazione: (2025)

Light Future: Multimodal Action Frame Prediction via InstructPix2Pix
di: Zhong, Zesen, et al.
Pubblicazione: (2025)

Quantized FCA: Efficient Zero-Shot Texture Anomaly Detection
di: Ardelean, Andrei-Timotei, et al.
Pubblicazione: (2025)

Patient-Specific Dynamic Digital-Physical Twin for Coronary Intervention Training: An Integrated Mixed Reality Approach
di: Wang, Shuo, et al.
Pubblicazione: (2025)

A Scalable System for Visual Analysis of Ocean Data
di: Jain, Toshit, et al.
Pubblicazione: (2025)

DeepTaxon: An Interpretable Retrieval-Augmented Multimodal Framework for Unified Species Identification and Discovery
di: Wang, Jiawei, et al.
Pubblicazione: (2026)

TAG-WM: Tamper-Aware Generative Image Watermarking via Diffusion Inversion Sensitivity
di: Chen, Yuzhuo, et al.
Pubblicazione: (2025)

Multi-scale Cycle Tracking in Dynamic Planar Graphs
di: Rasheed, Farhan, et al.
Pubblicazione: (2024)

Innovative Integration of 4D Cardiovascular Reconstruction and Hologram: A New Visualization Tool for Coronary Artery Bypass Grafting Planning
di: Wang, Shuo, et al.
Pubblicazione: (2025)

Pinching Visuo-haptic Display: Investigating Cross-Modal Effects of Visual Textures on Electrostatic Cloth Tactile Sensations
di: Kitagishi, Takekazu, et al.
Pubblicazione: (2025)

Two-step Authentication: Multi-biometric System Using Voice and Facial Recognition
di: Chen, Kuan Wei, et al.
Pubblicazione: (2026)

A Hybrid Deterministic Framework for Named Entity Extraction in Broadcast News Video
di: Lucas, Andrea Filiberto, et al.
Pubblicazione: (2026)

Gaussian Splatting: 3D Reconstruction and Novel View Synthesis, a Review
di: Dalal, Anurag, et al.
Pubblicazione: (2024)

Differentiable Hierarchical Visual Tokenization
di: Aasan, Marius, et al.
Pubblicazione: (2025)

FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields
di: Yun, Kwan, et al.
Pubblicazione: (2025)

Experimental Evaluation of Static Image Sub-Region-Based Search Models Using CLIP
di: Jäckl, Bastian, et al.
Pubblicazione: (2025)

EditP23: 3D Editing via Propagation of Image Prompts to Multi-View
di: Bar-On, Roi, et al.
Pubblicazione: (2025)

Scene Detection Policies and Keyframe Extraction Strategies for Large-Scale Video Analysis
di: Korolkov, Vasilii
Pubblicazione: (2025)

Understanding Identity Continuity in Thermal Video through Scene-Level Consistency
di: Sun, Wei-Chieh, et al.
Pubblicazione: (2026)

StableMoFusion: Towards Robust and Efficient Diffusion-based Motion Generation Framework
di: Huang, Yiheng, et al.
Pubblicazione: (2024)

Lens Distortion Encoding System Version 1.0
di: Fober, Jakub Maksymilian
Pubblicazione: (2024)

Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video Captioning
di: Ge, Shiping, et al.
Pubblicazione: (2024)

Engineering Mythology: A Digital-Physical Framework for Culturally-Inspired Public Art
di: Das, Jnaneshwar, et al.
Pubblicazione: (2026)

Motion Attribution for Video Generation
di: Wu, Xindi, et al.
Pubblicazione: (2026)

Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration
di: Lu, Wanglong, et al.
Pubblicazione: (2024)

Hierarchical Masked Autoregressive Models with Low-Resolution Token Pivots
di: Zheng, Guangting, et al.
Pubblicazione: (2025)

Semantic2Graph: Graph-based Multi-modal Feature Fusion for Action Segmentation in Videos
di: Zhang, Junbin, et al.
Pubblicazione: (2022)

Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Temporal Sampling
di: Liu, Xinhang, et al.
Pubblicazione: (2024)

AnyMoLe: Any Character Motion In-betweening Leveraging Video Diffusion Models
di: Yun, Kwan, et al.
Pubblicazione: (2025)

HeadEvolver: Text to Head Avatars via Expressive and Attribute-Preserving Mesh Deformation
di: Wang, Duotun, et al.
Pubblicazione: (2024)

Text2VDM: Text to Vector Displacement Maps for Expressive and Interactive 3D Sculpting
di: Meng, Hengyu, et al.
Pubblicazione: (2025)

3DGesPolicy: Phoneme-Aware Holistic Co-Speech Gesture Generation Based on Action Control
di: Sha, Xuanmeng, et al.
Pubblicazione: (2026)

Saliency-Aware Diffusion Reconstruction for Effective Invisible Watermark Removal
di: Alam, Inzamamul, et al.
Pubblicazione: (2025)