Saved in:
| Main Authors: | Peng, Yansong, Zhu, Kai, Liu, Yu, Wu, Pingyu, Li, Hebei, Sun, Xiaoyan, Wu, Feng |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.03738 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Scene Adaptive Sparse Transformer for Event-based Object Detection
by: Peng, Yansong, et al.
Published: (2024)
by: Peng, Yansong, et al.
Published: (2024)
D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement
by: Peng, Yansong, et al.
Published: (2024)
by: Peng, Yansong, et al.
Published: (2024)
Create Anything Anywhere: Layout-Controllable Personalized Diffusion Model for Multiple Subjects
by: Li, Wei, et al.
Published: (2025)
by: Li, Wei, et al.
Published: (2025)
Efficient Event-Based Semantic Segmentation via Exploiting Frame-Event Fusion: A Hybrid Neural Network Approach
by: Li, Hebei, et al.
Published: (2025)
by: Li, Hebei, et al.
Published: (2025)
Dome-DETR: DETR with Density-Oriented Feature-Query Manipulation for Efficient Tiny Object Detection
by: Hu, Zhangchi, et al.
Published: (2025)
by: Hu, Zhangchi, et al.
Published: (2025)
Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model
by: Wu, Pingyu, et al.
Published: (2025)
by: Wu, Pingyu, et al.
Published: (2025)
DASH: 4D Hash Encoding with Self-Supervised Decomposition for Real-Time Dynamic Scene Rendering
by: Chen, Jie, et al.
Published: (2025)
by: Chen, Jie, et al.
Published: (2025)
Event-assisted Low-Light Video Object Segmentation
by: Li, Hebei, et al.
Published: (2024)
by: Li, Hebei, et al.
Published: (2024)
RiO-DETR: DETR for Real-time Oriented Object Detection
by: Hu, Zhangchi, et al.
Published: (2026)
by: Hu, Zhangchi, et al.
Published: (2026)
LLaDA-VLA: Vision Language Diffusion Action Models
by: Wen, Yuqing, et al.
Published: (2025)
by: Wen, Yuqing, et al.
Published: (2025)
Deep Multi-Threshold Spiking-UNet for Image Processing
by: Li, Hebei, et al.
Published: (2023)
by: Li, Hebei, et al.
Published: (2023)
Enhanced Object Detection: A Study on Vast Vocabulary Object Detection Track for V3Det Challenge 2024
by: Wu, Peixi, et al.
Published: (2024)
by: Wu, Peixi, et al.
Published: (2024)
Enhancing Object Discovery for Unsupervised Instance Segmentation and Object Detection
by: Feng, Xingyu, et al.
Published: (2025)
by: Feng, Xingyu, et al.
Published: (2025)
Efficient Spiking Point Mamba for Point Cloud Analysis
by: Wu, Peixi, et al.
Published: (2025)
by: Wu, Peixi, et al.
Published: (2025)
EE-MLLM: A Data-Efficient and Compute-Efficient Multimodal Large Language Model
by: Ma, Feipeng, et al.
Published: (2024)
by: Ma, Feipeng, et al.
Published: (2024)
Improved Video VAE for Latent Video Diffusion Model
by: Wu, Pingyu, et al.
Published: (2024)
by: Wu, Pingyu, et al.
Published: (2024)
SSCM: A Spatial-Semantic Consistent Model for Multi-Contrast MRI Super-Resolution
by: Wu, Xiaoman, et al.
Published: (2025)
by: Wu, Xiaoman, et al.
Published: (2025)
Memorize-and-Generate: Towards Long-Term Consistency in Real-Time Video Generation
by: Zhu, Tianrui, et al.
Published: (2025)
by: Zhu, Tianrui, et al.
Published: (2025)
FlowConsist: Make Your Flow Consistent with Real Trajectory
by: Zhang, Tianyi, et al.
Published: (2026)
by: Zhang, Tianyi, et al.
Published: (2026)
Beyond Chain-of-Thought: Rewrite as a Universal Interface for Generative Multimodal Embeddings
by: Wu, Peixi, et al.
Published: (2026)
by: Wu, Peixi, et al.
Published: (2026)
SCoT: Unifying Consistency Models and Rectified Flows via Straight-Consistent Trajectories
by: Wu, Zhangkai, et al.
Published: (2025)
by: Wu, Zhangkai, et al.
Published: (2025)
DreamLight: Towards Harmonious and Consistent Image Relighting
by: Liu, Yong, et al.
Published: (2025)
by: Liu, Yong, et al.
Published: (2025)
UVCG: Leveraging Temporal Consistency for Universal Video Protection
by: Li, KaiZhou, et al.
Published: (2024)
by: Li, KaiZhou, et al.
Published: (2024)
FlowIE: Efficient Image Enhancement via Rectified Flow
by: Zhu, Yixuan, et al.
Published: (2024)
by: Zhu, Yixuan, et al.
Published: (2024)
AMR-CCR: Anchored Modular Retrieval for Continual Chinese Character Recognition
by: Wu, Yuchuan, et al.
Published: (2026)
by: Wu, Yuchuan, et al.
Published: (2026)
FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis
by: Liang, Feng, et al.
Published: (2023)
by: Liang, Feng, et al.
Published: (2023)
MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model
by: Dai, Wenxun, et al.
Published: (2024)
by: Dai, Wenxun, et al.
Published: (2024)
Beyond Sliders: Mastering the Art of Diffusion-based Image Manipulation
by: Tang, Yufei, et al.
Published: (2025)
by: Tang, Yufei, et al.
Published: (2025)
Uncertainty-Aware Pedestrian Attribute Recognition via Evidential Deep Learning
by: Lou, Zhuofan, et al.
Published: (2026)
by: Lou, Zhuofan, et al.
Published: (2026)
GoMVS: Geometrically Consistent Cost Aggregation for Multi-View Stereo
by: Wu, Jiang, et al.
Published: (2024)
by: Wu, Jiang, et al.
Published: (2024)
NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model
by: Liu, Jinpeng, et al.
Published: (2024)
by: Liu, Jinpeng, et al.
Published: (2024)
From Contrast to Consistency: Rethinking Event-based Continuous-Time Optical Flow Estimation
by: Hu, Rui, et al.
Published: (2026)
by: Hu, Rui, et al.
Published: (2026)
Dream4D: Lifting Camera-Controlled I2V towards Spatiotemporally Consistent 4D Generation
by: Liu, Xiaoyan, et al.
Published: (2025)
by: Liu, Xiaoyan, et al.
Published: (2025)
Consistency Flow Matching: Defining Straight Flows with Velocity Consistency
by: Yang, Ling, et al.
Published: (2024)
by: Yang, Ling, et al.
Published: (2024)
Text as Any-Modality for Zero-Shot Classification by Consistent Prompt Tuning
by: Wu, Xiangyu, et al.
Published: (2025)
by: Wu, Xiangyu, et al.
Published: (2025)
Fast Image Super-Resolution via Consistency Rectified Flow
by: Xu, Jiaqi, et al.
Published: (2026)
by: Xu, Jiaqi, et al.
Published: (2026)
FVAR: Visual Autoregressive Modeling via Next Focus Prediction
by: Li, Xiaofan, et al.
Published: (2025)
by: Li, Xiaofan, et al.
Published: (2025)
ConsistencyTrack: A Robust Multi-Object Tracker with a Generation Strategy of Consistency Model
by: Jiang, Lifan, et al.
Published: (2024)
by: Jiang, Lifan, et al.
Published: (2024)
ScoreHOI: Physically Plausible Reconstruction of Human-Object Interaction via Score-Guided Diffusion
by: Li, Ao, et al.
Published: (2025)
by: Li, Ao, et al.
Published: (2025)
Beyond Deceptive Flatness: Dual-Order Solution for Strengthening Adversarial Transferability
by: Zhang, Zhixuan, et al.
Published: (2025)
by: Zhang, Zhixuan, et al.
Published: (2025)
Similar Items
-
Scene Adaptive Sparse Transformer for Event-based Object Detection
by: Peng, Yansong, et al.
Published: (2024) -
D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement
by: Peng, Yansong, et al.
Published: (2024) -
Create Anything Anywhere: Layout-Controllable Personalized Diffusion Model for Multiple Subjects
by: Li, Wei, et al.
Published: (2025) -
Efficient Event-Based Semantic Segmentation via Exploiting Frame-Event Fusion: A Hybrid Neural Network Approach
by: Li, Hebei, et al.
Published: (2025) -
Dome-DETR: DETR with Density-Oriented Feature-Query Manipulation for Efficient Tiny Object Detection
by: Hu, Zhangchi, et al.
Published: (2025)