:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Peng, Yansong, Zhu, Kai, Liu, Yu, Wu, Pingyu, Li, Hebei, Sun, Xiaoyan, Wu, Feng
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2507.03738
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Scene Adaptive Sparse Transformer for Event-based Object Detection
by: Peng, Yansong, et al.
Published: (2024)

D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement
by: Peng, Yansong, et al.
Published: (2024)

Create Anything Anywhere: Layout-Controllable Personalized Diffusion Model for Multiple Subjects
by: Li, Wei, et al.
Published: (2025)

Efficient Event-Based Semantic Segmentation via Exploiting Frame-Event Fusion: A Hybrid Neural Network Approach
by: Li, Hebei, et al.
Published: (2025)

Dome-DETR: DETR with Density-Oriented Feature-Query Manipulation for Efficient Tiny Object Detection
by: Hu, Zhangchi, et al.
Published: (2025)

Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model
by: Wu, Pingyu, et al.
Published: (2025)

DASH: 4D Hash Encoding with Self-Supervised Decomposition for Real-Time Dynamic Scene Rendering
by: Chen, Jie, et al.
Published: (2025)

Event-assisted Low-Light Video Object Segmentation
by: Li, Hebei, et al.
Published: (2024)

RiO-DETR: DETR for Real-time Oriented Object Detection
by: Hu, Zhangchi, et al.
Published: (2026)

LLaDA-VLA: Vision Language Diffusion Action Models
by: Wen, Yuqing, et al.
Published: (2025)

Deep Multi-Threshold Spiking-UNet for Image Processing
by: Li, Hebei, et al.
Published: (2023)

Enhanced Object Detection: A Study on Vast Vocabulary Object Detection Track for V3Det Challenge 2024
by: Wu, Peixi, et al.
Published: (2024)

Enhancing Object Discovery for Unsupervised Instance Segmentation and Object Detection
by: Feng, Xingyu, et al.
Published: (2025)

Efficient Spiking Point Mamba for Point Cloud Analysis
by: Wu, Peixi, et al.
Published: (2025)

EE-MLLM: A Data-Efficient and Compute-Efficient Multimodal Large Language Model
by: Ma, Feipeng, et al.
Published: (2024)

Improved Video VAE for Latent Video Diffusion Model
by: Wu, Pingyu, et al.
Published: (2024)

SSCM: A Spatial-Semantic Consistent Model for Multi-Contrast MRI Super-Resolution
by: Wu, Xiaoman, et al.
Published: (2025)

Memorize-and-Generate: Towards Long-Term Consistency in Real-Time Video Generation
by: Zhu, Tianrui, et al.
Published: (2025)

FlowConsist: Make Your Flow Consistent with Real Trajectory
by: Zhang, Tianyi, et al.
Published: (2026)

Beyond Chain-of-Thought: Rewrite as a Universal Interface for Generative Multimodal Embeddings
by: Wu, Peixi, et al.
Published: (2026)

SCoT: Unifying Consistency Models and Rectified Flows via Straight-Consistent Trajectories
by: Wu, Zhangkai, et al.
Published: (2025)

DreamLight: Towards Harmonious and Consistent Image Relighting
by: Liu, Yong, et al.
Published: (2025)

UVCG: Leveraging Temporal Consistency for Universal Video Protection
by: Li, KaiZhou, et al.
Published: (2024)

FlowIE: Efficient Image Enhancement via Rectified Flow
by: Zhu, Yixuan, et al.
Published: (2024)

AMR-CCR: Anchored Modular Retrieval for Continual Chinese Character Recognition
by: Wu, Yuchuan, et al.
Published: (2026)

FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis
by: Liang, Feng, et al.
Published: (2023)

MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model
by: Dai, Wenxun, et al.
Published: (2024)

Beyond Sliders: Mastering the Art of Diffusion-based Image Manipulation
by: Tang, Yufei, et al.
Published: (2025)

Uncertainty-Aware Pedestrian Attribute Recognition via Evidential Deep Learning
by: Lou, Zhuofan, et al.
Published: (2026)

GoMVS: Geometrically Consistent Cost Aggregation for Multi-View Stereo
by: Wu, Jiang, et al.
Published: (2024)

NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model
by: Liu, Jinpeng, et al.
Published: (2024)

From Contrast to Consistency: Rethinking Event-based Continuous-Time Optical Flow Estimation
by: Hu, Rui, et al.
Published: (2026)

Dream4D: Lifting Camera-Controlled I2V towards Spatiotemporally Consistent 4D Generation
by: Liu, Xiaoyan, et al.
Published: (2025)

Consistency Flow Matching: Defining Straight Flows with Velocity Consistency
by: Yang, Ling, et al.
Published: (2024)

Text as Any-Modality for Zero-Shot Classification by Consistent Prompt Tuning
by: Wu, Xiangyu, et al.
Published: (2025)

Fast Image Super-Resolution via Consistency Rectified Flow
by: Xu, Jiaqi, et al.
Published: (2026)

FVAR: Visual Autoregressive Modeling via Next Focus Prediction
by: Li, Xiaofan, et al.
Published: (2025)

ConsistencyTrack: A Robust Multi-Object Tracker with a Generation Strategy of Consistency Model
by: Jiang, Lifan, et al.
Published: (2024)

ScoreHOI: Physically Plausible Reconstruction of Human-Object Interaction via Score-Guided Diffusion
by: Li, Ao, et al.
Published: (2025)

Beyond Deceptive Flatness: Dual-Order Solution for Strengthening Adversarial Transferability
by: Zhang, Zhixuan, et al.
Published: (2025)