Saved in:
| Main Authors: | Dai, Jiayue, Wang, Yunya, Fang, Yihan, Chen, Yuetong, Xiong, Butian |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.15060 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Leveraging Segment Anything Model in Identifying Buildings within Refugee Camps (SAM4Refugee) from Satellite Imagery for Humanitarian Operations
by: Gao, Yunya
Published: (2024)
by: Gao, Yunya
Published: (2024)
MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model
by: Dai, Wenxun, et al.
Published: (2024)
by: Dai, Wenxun, et al.
Published: (2024)
SHMT: Self-supervised Hierarchical Makeup Transfer via Latent Diffusion Models
by: Sun, Zhaoyang, et al.
Published: (2024)
by: Sun, Zhaoyang, et al.
Published: (2024)
Reward Guided Latent Consistency Distillation
by: Li, Jiachen, et al.
Published: (2024)
by: Li, Jiachen, et al.
Published: (2024)
BYOM: Building Your Own Multi-Task Model For Free
by: Jiang, Weisen, et al.
Published: (2023)
by: Jiang, Weisen, et al.
Published: (2023)
GauU-Scene: A Scene Reconstruction Benchmark on Large Scale 3D Reconstruction Dataset Using Gaussian Splatting
by: Xiong, Butian, et al.
Published: (2024)
by: Xiong, Butian, et al.
Published: (2024)
LatentEdit: Adaptive Latent Control for Consistent Semantic Editing
by: Liu, Siyi, et al.
Published: (2025)
by: Liu, Siyi, et al.
Published: (2025)
Self-Consistent Latent Reasoning: Long Latent Sequence Reasoning for Vision-Language Model
by: Wang, Chenfeng, et al.
Published: (2026)
by: Wang, Chenfeng, et al.
Published: (2026)
GauU-Scene V2: Assessing the Reliability of Image-Based Metrics with Expansive Lidar Image Dataset Using 3DGS and NeRF
by: Xiong, Butian, et al.
Published: (2024)
by: Xiong, Butian, et al.
Published: (2024)
Splat Feature Solver
by: Xiong, Butian, et al.
Published: (2025)
by: Xiong, Butian, et al.
Published: (2025)
Consistency^2: Consistent and Fast 3D Painting with Latent Consistency Models
by: Wang, Tianfu, et al.
Published: (2024)
by: Wang, Tianfu, et al.
Published: (2024)
LatentGeo: Learnable Auxiliary Constructions in Latent Space for Multimodal Geometric Reasoning
by: Xu, Haiying, et al.
Published: (2026)
by: Xu, Haiying, et al.
Published: (2026)
Fuse Your Latents: Video Editing with Multi-source Latent Diffusion Models
by: Lu, Tianyi, et al.
Published: (2023)
by: Lu, Tianyi, et al.
Published: (2023)
Trajectory Consistency Distillation: Improved Latent Consistency Distillation by Semi-Linear Consistency Function with Trajectory Mapping
by: Zheng, Jianbin, et al.
Published: (2024)
by: Zheng, Jianbin, et al.
Published: (2024)
Graph Your Own Prompt
by: Ding, Xi, et al.
Published: (2025)
by: Ding, Xi, et al.
Published: (2025)
REGLUE Your Latents with Global and Local Semantics for Entangled Diffusion
by: Petsangourakis, Giorgos, et al.
Published: (2025)
by: Petsangourakis, Giorgos, et al.
Published: (2025)
One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models
by: Razin, Aleksandr, et al.
Published: (2025)
by: Razin, Aleksandr, et al.
Published: (2025)
Your Latent Mask is Wrong: Pixel-Equivalent Latent Compositing for Diffusion Models
by: Bradbury, Rowan, et al.
Published: (2025)
by: Bradbury, Rowan, et al.
Published: (2025)
PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models
by: Chen, Junsong, et al.
Published: (2024)
by: Chen, Junsong, et al.
Published: (2024)
Customize Your Own Paired Data via Few-shot Way
by: Chen, Jinshu, et al.
Published: (2024)
by: Chen, Jinshu, et al.
Published: (2024)
Nested Diffusion Models Using Hierarchical Latent Priors
by: Zhang, Xiao, et al.
Published: (2024)
by: Zhang, Xiao, et al.
Published: (2024)
Representing Beauty: Towards a Participatory but Objective Latent Aesthetics
by: Rusnak, Alexander Michael
Published: (2025)
by: Rusnak, Alexander Michael
Published: (2025)
VALA: Learning Latent Anchors for Training-Free and Temporally Consistent
by: Wu, Zhangkai, et al.
Published: (2025)
by: Wu, Zhangkai, et al.
Published: (2025)
Latte: Latent Diffusion Transformer for Video Generation
by: Ma, Xin, et al.
Published: (2024)
by: Ma, Xin, et al.
Published: (2024)
From Pixels to Tokens: A Systematic Study of Latent Action Supervision for Vision-Language-Action Models
by: Lin, Yihan, et al.
Published: (2026)
by: Lin, Yihan, et al.
Published: (2026)
Denoising Reuse: Exploiting Inter-frame Motion Consistency for Efficient Video Latent Generation
by: Wang, Chenyu, et al.
Published: (2024)
by: Wang, Chenyu, et al.
Published: (2024)
LEO: Generative Latent Image Animator for Human Video Synthesis
by: Wang, Yaohui, et al.
Published: (2023)
by: Wang, Yaohui, et al.
Published: (2023)
Improved Training Technique for Latent Consistency Models
by: Dao, Quan, et al.
Published: (2025)
by: Dao, Quan, et al.
Published: (2025)
Class Relevance Learning For Out-of-distribution Detection
by: Xiong, Butian, et al.
Published: (2023)
by: Xiong, Butian, et al.
Published: (2023)
Latent Space Consistency for Sparse-View CT Reconstruction
by: Chen, Duoyou, et al.
Published: (2025)
by: Chen, Duoyou, et al.
Published: (2025)
Calibrating Biased Distribution in VFM-derived Latent Space via Cross-Domain Geometric Consistency
by: Ma, Yanbiao, et al.
Published: (2025)
by: Ma, Yanbiao, et al.
Published: (2025)
NanoGS: Training-Free Gaussian Splat Simplification
by: Xiong, Butian, et al.
Published: (2026)
by: Xiong, Butian, et al.
Published: (2026)
Multimodal Latent Reasoning via Hierarchical Visual Cues Injection
by: Zhang, Yiming, et al.
Published: (2026)
by: Zhang, Yiming, et al.
Published: (2026)
LatentUMM: Dual Latent Alignment for Unified Multimodal Models
by: Luo, Yinyi, et al.
Published: (2026)
by: Luo, Yinyi, et al.
Published: (2026)
GRIP: Generating Interaction Poses Using Spatial Cues and Latent Consistency
by: Taheri, Omid, et al.
Published: (2023)
by: Taheri, Omid, et al.
Published: (2023)
LaMo: Self-Supervised Latent Motion Priors for Physical Realism in Video Generation
by: Jiang, Bo, et al.
Published: (2026)
by: Jiang, Bo, et al.
Published: (2026)
Fine-Grained VLM Fine-tuning via Latent Hierarchical Adapter Learning
by: Zhao, Yumiao, et al.
Published: (2025)
by: Zhao, Yumiao, et al.
Published: (2025)
Latent Embedding Clustering for Occlusion Robust Head Pose Estimation
by: Celestino, José, et al.
Published: (2024)
by: Celestino, José, et al.
Published: (2024)
BYO-Eval: Build Your Own Dataset for Fine-Grained Visual Assessment of Multimodal Language Models
by: Arnould, Ludovic, et al.
Published: (2025)
by: Arnould, Ludovic, et al.
Published: (2025)
Adaptive Calibration: A Unified Conversion Framework of Spiking Neural Networks
by: Wang, Ziqing, et al.
Published: (2023)
by: Wang, Ziqing, et al.
Published: (2023)
Similar Items
-
Leveraging Segment Anything Model in Identifying Buildings within Refugee Camps (SAM4Refugee) from Satellite Imagery for Humanitarian Operations
by: Gao, Yunya
Published: (2024) -
MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model
by: Dai, Wenxun, et al.
Published: (2024) -
SHMT: Self-supervised Hierarchical Makeup Transfer via Latent Diffusion Models
by: Sun, Zhaoyang, et al.
Published: (2024) -
Reward Guided Latent Consistency Distillation
by: Li, Jiachen, et al.
Published: (2024) -
BYOM: Building Your Own Multi-Task Model For Free
by: Jiang, Weisen, et al.
Published: (2023)