Saved in:
| Main Authors: | Xiao, Anyi, Yang, Cihui |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.17522 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
OG-HFYOLO :Orientation gradient guidance and heterogeneous feature fusion for deformation table cell instance segmentation
by: Liu, Long, et al.
Published: (2025)
by: Liu, Long, et al.
Published: (2025)
ClusterTabNet: Supervised clustering method for table detection and table structure recognition
by: Polewczyk, Marek, et al.
Published: (2024)
by: Polewczyk, Marek, et al.
Published: (2024)
EDM: Efficient Deep Feature Matching
by: Li, Xi, et al.
Published: (2025)
by: Li, Xi, et al.
Published: (2025)
A large-scale dataset for end-to-end table recognition in the wild
by: Yang, Fan, et al.
Published: (2023)
by: Yang, Fan, et al.
Published: (2023)
Multilevel neural networks with dual-stage feature fusion for human activity recognition
by: Brery, Abeer FathAllah, et al.
Published: (2026)
by: Brery, Abeer FathAllah, et al.
Published: (2026)
DP-Net: Learning Discriminative Parts for image recognition
by: Sicre, Ronan, et al.
Published: (2024)
by: Sicre, Ronan, et al.
Published: (2024)
A cross-modal network for facial expression recognition
by: Tian, Chunwei, et al.
Published: (2026)
by: Tian, Chunwei, et al.
Published: (2026)
Dense Semantic Matching with VGGT Prior
by: Yang, Songlin, et al.
Published: (2025)
by: Yang, Songlin, et al.
Published: (2025)
Pseudo-Unification: Entropy Probing Reveals Divergent Information Patterns in Unified Multimodal Models
by: Yang, Songlin, et al.
Published: (2026)
by: Yang, Songlin, et al.
Published: (2026)
HCR-Net: A deep learning based script independent handwritten character recognition network
by: Chauhan, Vinod Kumar, et al.
Published: (2021)
by: Chauhan, Vinod Kumar, et al.
Published: (2021)
TraceNet: Segment one thing efficiently
by: Wu, Mingyuan, et al.
Published: (2024)
by: Wu, Mingyuan, et al.
Published: (2024)
An Appearance Defect Detection Method for Cigarettes Based on C-CenterNet
by: Liu, Hongyu, et al.
Published: (2025)
by: Liu, Hongyu, et al.
Published: (2025)
SesaHand: Enhancing 3D Hand Reconstruction via Controllable Generation with Semantic and Structural Alignment
by: Zhao, Zhuoran, et al.
Published: (2026)
by: Zhao, Zhuoran, et al.
Published: (2026)
Continual-learning-based framework for structural damage recognition
by: Shu, Jiangpeng, et al.
Published: (2024)
by: Shu, Jiangpeng, et al.
Published: (2024)
Cracking the neural code for word recognition in convolutional neural networks
by: Agrawal, Aakash, et al.
Published: (2024)
by: Agrawal, Aakash, et al.
Published: (2024)
Controllable Text-to-Motion Generation via Modular Body-Part Phase Control
by: Dai, Minyue, et al.
Published: (2026)
by: Dai, Minyue, et al.
Published: (2026)
Link prediction Graph Neural Networks for structure recognition of Handwritten Mathematical Expressions
by: Nguyen, Cuong Tuan, et al.
Published: (2025)
by: Nguyen, Cuong Tuan, et al.
Published: (2025)
Enhancing point cloud analysis via neighbor aggregation correction based on cross-stage structure correlation
by: Shi, Jiaqi, et al.
Published: (2025)
by: Shi, Jiaqi, et al.
Published: (2025)
HadaSmileNet: Hadamard fusion of handcrafted and deep-learning features for enhancing facial emotion recognition of genuine smiles
by: Hasan, Mohammad Junayed, et al.
Published: (2025)
by: Hasan, Mohammad Junayed, et al.
Published: (2025)
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers
by: Shi, Dachuan, et al.
Published: (2023)
by: Shi, Dachuan, et al.
Published: (2023)
Adversarial Masking Contrastive Learning for vein recognition
by: Qin, Huafeng, et al.
Published: (2024)
by: Qin, Huafeng, et al.
Published: (2024)
Community-aware evaluation and threshold calibration for open-set plankton image recognition
by: Chen, Xi, et al.
Published: (2026)
by: Chen, Xi, et al.
Published: (2026)
Configural processing as an optimized strategy for robust object recognition in neural networks
by: Jang, Hojin, et al.
Published: (2024)
by: Jang, Hojin, et al.
Published: (2024)
Single-stage Multi-human Parsing via Point Sets and Center-based Offsets
by: Chu, Jiaming, et al.
Published: (2023)
by: Chu, Jiaming, et al.
Published: (2023)
LaverNet: Lightweight All-in-one Video Restoration via Selective Propagation
by: Zhao, Haiyu, et al.
Published: (2025)
by: Zhao, Haiyu, et al.
Published: (2025)
Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks
by: Ju, Rui-Yang, et al.
Published: (2022)
by: Ju, Rui-Yang, et al.
Published: (2022)
Emotion recognition in talking-face videos using persistent entropy and neural networks
by: Paluzo-Hidalgo, Eduardo, et al.
Published: (2021)
by: Paluzo-Hidalgo, Eduardo, et al.
Published: (2021)
MSPT: A Lightweight Face Image Quality Assessment Method with Multi-stage Progressive Training
by: Xiao, Xiongwei, et al.
Published: (2025)
by: Xiao, Xiongwei, et al.
Published: (2025)
Mini-ResEmoteNet: Leveraging Knowledge Distillation for Human-Centered Design
by: Murtada, Amna, et al.
Published: (2025)
by: Murtada, Amna, et al.
Published: (2025)
A survey of facial recognition techniques
by: Bahjat, Aya Kaysan
Published: (2026)
by: Bahjat, Aya Kaysan
Published: (2026)
Cross-modal learning for plankton recognition
by: Kareinen, Joona, et al.
Published: (2026)
by: Kareinen, Joona, et al.
Published: (2026)
BCFPL: Binary classification ConvNet based Fast Parking space recognition with Low resolution image
by: Zhang, Shuo, et al.
Published: (2024)
by: Zhang, Shuo, et al.
Published: (2024)
ScriptViz: A Visualization Tool to Aid Scriptwriting based on a Large Movie Database
by: Rao, Anyi, et al.
Published: (2024)
by: Rao, Anyi, et al.
Published: (2024)
A document is worth a structured record: Principled inductive bias design for document recognition
by: Meyer, Benjamin, et al.
Published: (2025)
by: Meyer, Benjamin, et al.
Published: (2025)
OmniControlNet: Dual-stage Integration for Conditional Image Generation
by: Wang, Yilin, et al.
Published: (2024)
by: Wang, Yilin, et al.
Published: (2024)
A multi-task neural network for atypical mitosis recognition under domain shift
by: Percannella, Gennaro, et al.
Published: (2025)
by: Percannella, Gennaro, et al.
Published: (2025)
LRD-Net: A Lightweight Real-Centered Detection Network for Cross-Domain Face Forgery Detection
by: Zhang, Xuecen, et al.
Published: (2026)
by: Zhang, Xuecen, et al.
Published: (2026)
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors
by: Chen, Houyuan, et al.
Published: (2026)
by: Chen, Houyuan, et al.
Published: (2026)
ECC-PolypDet: Enhanced CenterNet with Contrastive Learning for Automatic Polyp Detection
by: Jiang, Yuncheng, et al.
Published: (2024)
by: Jiang, Yuncheng, et al.
Published: (2024)
Wearable-based behaviour interpolation for semi-supervised human activity recognition
by: Duan, Haoran, et al.
Published: (2024)
by: Duan, Haoran, et al.
Published: (2024)
Similar Items
-
OG-HFYOLO :Orientation gradient guidance and heterogeneous feature fusion for deformation table cell instance segmentation
by: Liu, Long, et al.
Published: (2025) -
ClusterTabNet: Supervised clustering method for table detection and table structure recognition
by: Polewczyk, Marek, et al.
Published: (2024) -
EDM: Efficient Deep Feature Matching
by: Li, Xi, et al.
Published: (2025) -
A large-scale dataset for end-to-end table recognition in the wild
by: Yang, Fan, et al.
Published: (2023) -
Multilevel neural networks with dual-stage feature fusion for human activity recognition
by: Brery, Abeer FathAllah, et al.
Published: (2026)