Saved in:
| Main Authors: | Xu, Ruihao, Liu, Yong, Tang, Yansong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.24508 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Fully Aligned Network for Referring Image Segmentation
by: Liu, Yong, et al.
Published: (2024)
by: Liu, Yong, et al.
Published: (2024)
FDDet: Frequency-Decoupling for Boundary Refinement in Temporal Action Detection
by: Zhu, Xinnan, et al.
Published: (2025)
by: Zhu, Xinnan, et al.
Published: (2025)
Flash-VStream: Efficient Real-Time Understanding for Long Video Streams
by: Zhang, Haoji, et al.
Published: (2025)
by: Zhang, Haoji, et al.
Published: (2025)
From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios
by: Xia, Changliang, et al.
Published: (2025)
by: Xia, Changliang, et al.
Published: (2025)
Towards Natural Image Matting in the Wild via Real-Scenario Prior
by: Xia, Ruihao, et al.
Published: (2024)
by: Xia, Ruihao, et al.
Published: (2024)
SAM2-LOVE: Segment Anything Model 2 in Language-aided Audio-Visual Scenes
by: Wang, Yuji, et al.
Published: (2025)
by: Wang, Yuji, et al.
Published: (2025)
DVD: A Comprehensive Dataset for Advancing Violence Detection in Real-World Scenarios
by: Kollias, Dimitrios, et al.
Published: (2025)
by: Kollias, Dimitrios, et al.
Published: (2025)
Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams
by: Zhang, Haoji, et al.
Published: (2024)
by: Zhang, Haoji, et al.
Published: (2024)
UniTalk: Towards Universal Active Speaker Detection in Real World Scenarios
by: Nguyen, Le Thien Phuc, et al.
Published: (2025)
by: Nguyen, Le Thien Phuc, et al.
Published: (2025)
Learning Dual-Level Deformable Implicit Representation for Real-World Scale Arbitrary Super-Resolution
by: Li, Zhiheng, et al.
Published: (2024)
by: Li, Zhiheng, et al.
Published: (2024)
Exploring Efficient Asymmetric Blind-Spots for Self-Supervised Denoising in Real-World Scenarios
by: Chen, Shiyan, et al.
Published: (2023)
by: Chen, Shiyan, et al.
Published: (2023)
DDL: A Large-Scale Datasets for Deepfake Detection and Localization in Diversified Real-World Scenarios
by: Miao, Changtao, et al.
Published: (2025)
by: Miao, Changtao, et al.
Published: (2025)
Network Knowledge Prior Guided Learning for Data-Efficient Surface Defect Detection
by: Dong, Hang-Cheng, et al.
Published: (2026)
by: Dong, Hang-Cheng, et al.
Published: (2026)
Zero-Shot Head Swapping in Real-World Scenarios
by: Kang, Taewoong, et al.
Published: (2025)
by: Kang, Taewoong, et al.
Published: (2025)
Remote Photoplethysmography in Real-World and Extreme Lighting Scenarios
by: Shao, Hang, et al.
Published: (2025)
by: Shao, Hang, et al.
Published: (2025)
A Comprehensive Survey for Real-World Industrial Defect Detection: Challenges, Approaches, and Prospects
by: Cheng, Yuqi, et al.
Published: (2025)
by: Cheng, Yuqi, et al.
Published: (2025)
Open-Vocabulary Segmentation with Semantic-Assisted Calibration
by: Liu, Yong, et al.
Published: (2023)
by: Liu, Yong, et al.
Published: (2023)
IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word Emphasis
by: Wang, Yuji, et al.
Published: (2025)
by: Wang, Yuji, et al.
Published: (2025)
ISP-AD: A Large-Scale Real-World Dataset for Advancing Industrial Anomaly Detection with Synthetic and Real Defects
by: Krassnig, Paul J., et al.
Published: (2025)
by: Krassnig, Paul J., et al.
Published: (2025)
Towards Scalable and Consistent 3D Editing
by: Xia, Ruihao, et al.
Published: (2025)
by: Xia, Ruihao, et al.
Published: (2025)
FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios
by: Zhang, Shiyi, et al.
Published: (2025)
by: Zhang, Shiyi, et al.
Published: (2025)
Are Foundation Models Ready for Industrial Defect Recognition? A Reality Check on Real-World Data
by: Baeuerle, Simon, et al.
Published: (2025)
by: Baeuerle, Simon, et al.
Published: (2025)
Boosting Diffusion-Based Text Image Super-Resolution Model Towards Generalized Real-World Scenarios
by: Pan, Chenglu, et al.
Published: (2025)
by: Pan, Chenglu, et al.
Published: (2025)
Memorize-and-Generate: Towards Long-Term Consistency in Real-Time Video Generation
by: Zhu, Tianrui, et al.
Published: (2025)
by: Zhu, Tianrui, et al.
Published: (2025)
MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
by: Zhang, Yi-Fan, et al.
Published: (2024)
by: Zhang, Yi-Fan, et al.
Published: (2024)
Learning Object-Centric Representations Based on Slots in Real World Scenarios
by: Akan, Adil Kaan
Published: (2025)
by: Akan, Adil Kaan
Published: (2025)
Your Super Resolution Model is not Enough for Tackling Real-World Scenarios
by: Yoon, Dongsik, et al.
Published: (2025)
by: Yoon, Dongsik, et al.
Published: (2025)
PhoStream: Benchmarking Real-World Streaming for Omnimodal Assistants in Mobile Scenarios
by: Lu, Xudong, et al.
Published: (2026)
by: Lu, Xudong, et al.
Published: (2026)
CoSTL: Comprehensive Spatial-Temporal Representation Learning for Moment Retrieval and Highlight Detection
by: Dong, Xin, et al.
Published: (2026)
by: Dong, Xin, et al.
Published: (2026)
Dual-Imbalance Continual Learning for Real-World Food Recognition
by: Zhang, Xiaoyan, et al.
Published: (2026)
by: Zhang, Xiaoyan, et al.
Published: (2026)
MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model
by: Dai, Wenxun, et al.
Published: (2024)
by: Dai, Wenxun, et al.
Published: (2024)
Toward Generalizable Deblurring: Leveraging Massive Blur Priors with Linear Attention for Real-World Scenarios
by: Gao, Yuanting, et al.
Published: (2026)
by: Gao, Yuanting, et al.
Published: (2026)
MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios
by: Li, Zhang, et al.
Published: (2026)
by: Li, Zhang, et al.
Published: (2026)
A Multilevel Strategy to Improve People Tracking in a Real-World Scenario
by: de Oliveira, Cristiano B., et al.
Published: (2024)
by: de Oliveira, Cristiano B., et al.
Published: (2024)
MFFI: Multi-Dimensional Face Forgery Image Dataset for Real-World Scenarios
by: Miao, Changtao, et al.
Published: (2025)
by: Miao, Changtao, et al.
Published: (2025)
Universal Segmentation at Arbitrary Granularity with Language Instruction
by: Liu, Yong, et al.
Published: (2023)
by: Liu, Yong, et al.
Published: (2023)
synth-dacl: Does Synthetic Defect Data Enhance Segmentation Accuracy and Robustness for Real-World Bridge Inspections?
by: Flotzinger, Johannes, et al.
Published: (2025)
by: Flotzinger, Johannes, et al.
Published: (2025)
Online Class-Incremental Learning For Real-World Food Image Classification
by: Raghavan, Siddeshwar, et al.
Published: (2023)
by: Raghavan, Siddeshwar, et al.
Published: (2023)
Real-Time Deepfake Detection in the Real-World
by: Cavia, Bar, et al.
Published: (2024)
by: Cavia, Bar, et al.
Published: (2024)
YOLO-ELA: Efficient Local Attention Modeling for High-Performance Real-Time Insulator Defect Detection
by: Akindele, Olalekan, et al.
Published: (2024)
by: Akindele, Olalekan, et al.
Published: (2024)
Similar Items
-
Fully Aligned Network for Referring Image Segmentation
by: Liu, Yong, et al.
Published: (2024) -
FDDet: Frequency-Decoupling for Boundary Refinement in Temporal Action Detection
by: Zhu, Xinnan, et al.
Published: (2025) -
Flash-VStream: Efficient Real-Time Understanding for Long Video Streams
by: Zhang, Haoji, et al.
Published: (2025) -
From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios
by: Xia, Changliang, et al.
Published: (2025) -
Towards Natural Image Matting in the Wild via Real-Scenario Prior
by: Xia, Ruihao, et al.
Published: (2024)