Saved in:
| Main Authors: | Zhu, Boyuan, Liu, Fagui, Chen, Xi, Tang, Quan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.11704 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation
by: Zhang, Dengke, et al.
Published: (2024)
by: Zhang, Dengke, et al.
Published: (2024)
ModuSeg: Decoupling Object Discovery and Semantic Retrieval for Training-Free Weakly Supervised Segmentation
by: He, Qingze, et al.
Published: (2026)
by: He, Qingze, et al.
Published: (2026)
Exploring Token-Level Augmentation in Vision Transformer for Semi-Supervised Semantic Segmentation
by: Zhang, Dengke, et al.
Published: (2025)
by: Zhang, Dengke, et al.
Published: (2025)
Dynamic Scene Reconstruction: Recent Advance in Real-time Rendering and Streaming
by: Zhu, Jiaxuan, et al.
Published: (2025)
by: Zhu, Jiaxuan, et al.
Published: (2025)
SELECT: Detecting Label Errors in Real-world Scene Text Data
by: Liu, Wenjun, et al.
Published: (2025)
by: Liu, Wenjun, et al.
Published: (2025)
CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection
by: Zhao, Xi, et al.
Published: (2022)
by: Zhao, Xi, et al.
Published: (2022)
Real-Time Text Detection with Similar Mask in Traffic, Industrial, and Natural Scenes
by: Han, Xu, et al.
Published: (2024)
by: Han, Xu, et al.
Published: (2024)
Aggregated Text Transformer for Scene Text Detection
by: Zhou, Zhao, et al.
Published: (2022)
by: Zhou, Zhao, et al.
Published: (2022)
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
by: Zhao, Shuai, et al.
Published: (2023)
by: Zhao, Shuai, et al.
Published: (2023)
Text Region Multiple Information Perception Network for Scene Text Detection
by: Zheng, Jinzhi, et al.
Published: (2024)
by: Zheng, Jinzhi, et al.
Published: (2024)
Periodic Vibration Gaussian: Dynamic Urban Scene Reconstruction and Real-time Rendering
by: Chen, Yurui, et al.
Published: (2023)
by: Chen, Yurui, et al.
Published: (2023)
UniConvNet: Expanding Effective Receptive Field while Maintaining Asymptotically Gaussian Distribution for ConvNets of Any Scale
by: Wang, Yuhao, et al.
Published: (2025)
by: Wang, Yuhao, et al.
Published: (2025)
Text-IRSTD: Leveraging Semantic Text to Promote Infrared Small Target Detection in Complex Scenes
by: Huang, Feng, et al.
Published: (2025)
by: Huang, Feng, et al.
Published: (2025)
Beyond Detection: A Structure-Aware Framework for Scene Text Tracking
by: Yu, Chenmin, et al.
Published: (2026)
by: Yu, Chenmin, et al.
Published: (2026)
Explicit Relational Reasoning Network for Scene Text Detection
by: Su, Yuchen, et al.
Published: (2024)
by: Su, Yuchen, et al.
Published: (2024)
EndoGaussian: Real-time Gaussian Splatting for Dynamic Endoscopic Scene Reconstruction
by: Liu, Yifan, et al.
Published: (2024)
by: Liu, Yifan, et al.
Published: (2024)
Fast Kernel Scene Flow
by: Li, Xueqian, et al.
Published: (2024)
by: Li, Xueqian, et al.
Published: (2024)
MOSE: Boosting Vision-based Roadside 3D Object Detection with Scene Cues
by: Chen, Xiahan, et al.
Published: (2024)
by: Chen, Xiahan, et al.
Published: (2024)
The First Swahili Language Scene Text Detection and Recognition Dataset
by: Douamba, Fadila Wendigoundi, et al.
Published: (2024)
by: Douamba, Fadila Wendigoundi, et al.
Published: (2024)
4DRadar-GS: Self-Supervised Dynamic Driving Scene Reconstruction with 4D Radar
by: Tang, Xiao, et al.
Published: (2025)
by: Tang, Xiao, et al.
Published: (2025)
StyleTextGen: Style-Conditioned Multilingual Scene Text Generation
by: Chen, Zeyu, et al.
Published: (2026)
by: Chen, Zeyu, et al.
Published: (2026)
HUNTER: Unsupervised Human-centric 3D Detection via Transferring Knowledge from Synthetic Instances to Real Scenes
by: Yao, Yichen, et al.
Published: (2024)
by: Yao, Yichen, et al.
Published: (2024)
SceneExpander: Expanding 3D Scenes with Free-Form Inserted Views
by: He, Zijian, et al.
Published: (2026)
by: He, Zijian, et al.
Published: (2026)
Ray-Distance Volume Rendering for Neural Scene Reconstruction
by: Yin, Ruihong, et al.
Published: (2024)
by: Yin, Ruihong, et al.
Published: (2024)
Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors
by: Guan, Tongkun, et al.
Published: (2023)
by: Guan, Tongkun, et al.
Published: (2023)
Research on Multilingual Natural Scene Text Detection Algorithm
by: Wang, Tao
Published: (2023)
by: Wang, Tao
Published: (2023)
Multiview Scene Graph
by: Zhang, Juexiao, et al.
Published: (2024)
by: Zhang, Juexiao, et al.
Published: (2024)
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
by: Duan, Chen, et al.
Published: (2024)
by: Duan, Chen, et al.
Published: (2024)
BPDO:Boundary Points Dynamic Optimization for Arbitrary Shape Scene Text Detection
by: Zheng, Jinzhi, et al.
Published: (2024)
by: Zheng, Jinzhi, et al.
Published: (2024)
IMDPrompter: Adapting SAM to Image Manipulation Detection by Cross-View Automated Prompt Learning
by: Zhang, Quan, et al.
Published: (2025)
by: Zhang, Quan, et al.
Published: (2025)
Partial Scene Text Retrieval
by: Wang, Hao, et al.
Published: (2024)
by: Wang, Hao, et al.
Published: (2024)
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
by: Liu, Jiawei, et al.
Published: (2025)
by: Liu, Jiawei, et al.
Published: (2025)
OV-DEIM: Real-time DETR-Style Open-Vocabulary Object Detection with GridSynthetic Augmentation
by: Wang, Leilei, et al.
Published: (2026)
by: Wang, Leilei, et al.
Published: (2026)
PAT3D: Physics-Augmented Text-to-3D Scene Generation
by: Lin, Guying, et al.
Published: (2025)
by: Lin, Guying, et al.
Published: (2025)
STELLAR: Scene Text Editor for Low-Resource Languages and Real-World Data
by: Seo, Yongdeuk, et al.
Published: (2025)
by: Seo, Yongdeuk, et al.
Published: (2025)
Revisiting Tampered Scene Text Detection in the Era of Generative AI
by: Qu, Chenfan, et al.
Published: (2024)
by: Qu, Chenfan, et al.
Published: (2024)
RMK RetinaNet: Rotated Multi-Kernel RetinaNet for Robust Oriented Object Detection in Remote Sensing Imagery
by: Sun, Huiran
Published: (2026)
by: Sun, Huiran
Published: (2026)
HTR-VT: Handwritten Text Recognition with Vision Transformer
by: Li, Yuting, et al.
Published: (2024)
by: Li, Yuting, et al.
Published: (2024)
Autonomous Character-Scene Interaction Synthesis from Text Instruction
by: Jiang, Nan, et al.
Published: (2024)
by: Jiang, Nan, et al.
Published: (2024)
TextSculptor: Training and Benchmarking Scene Text Editing
by: Lin, Yiheng, et al.
Published: (2026)
by: Lin, Yiheng, et al.
Published: (2026)
Similar Items
-
CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation
by: Zhang, Dengke, et al.
Published: (2024) -
ModuSeg: Decoupling Object Discovery and Semantic Retrieval for Training-Free Weakly Supervised Segmentation
by: He, Qingze, et al.
Published: (2026) -
Exploring Token-Level Augmentation in Vision Transformer for Semi-Supervised Semantic Segmentation
by: Zhang, Dengke, et al.
Published: (2025) -
Dynamic Scene Reconstruction: Recent Advance in Real-time Rendering and Streaming
by: Zhu, Jiaxuan, et al.
Published: (2025) -
SELECT: Detecting Label Errors in Real-world Scene Text Data
by: Liu, Wenjun, et al.
Published: (2025)