:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Xiao, Anyi, Yang, Cihui
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2504.17522
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

OG-HFYOLO :Orientation gradient guidance and heterogeneous feature fusion for deformation table cell instance segmentation
by: Liu, Long, et al.
Published: (2025)

ClusterTabNet: Supervised clustering method for table detection and table structure recognition
by: Polewczyk, Marek, et al.
Published: (2024)

EDM: Efficient Deep Feature Matching
by: Li, Xi, et al.
Published: (2025)

A large-scale dataset for end-to-end table recognition in the wild
by: Yang, Fan, et al.
Published: (2023)

Multilevel neural networks with dual-stage feature fusion for human activity recognition
by: Brery, Abeer FathAllah, et al.
Published: (2026)

DP-Net: Learning Discriminative Parts for image recognition
by: Sicre, Ronan, et al.
Published: (2024)

A cross-modal network for facial expression recognition
by: Tian, Chunwei, et al.
Published: (2026)

Dense Semantic Matching with VGGT Prior
by: Yang, Songlin, et al.
Published: (2025)

Pseudo-Unification: Entropy Probing Reveals Divergent Information Patterns in Unified Multimodal Models
by: Yang, Songlin, et al.
Published: (2026)

HCR-Net: A deep learning based script independent handwritten character recognition network
by: Chauhan, Vinod Kumar, et al.
Published: (2021)

TraceNet: Segment one thing efficiently
by: Wu, Mingyuan, et al.
Published: (2024)

An Appearance Defect Detection Method for Cigarettes Based on C-CenterNet
by: Liu, Hongyu, et al.
Published: (2025)

SesaHand: Enhancing 3D Hand Reconstruction via Controllable Generation with Semantic and Structural Alignment
by: Zhao, Zhuoran, et al.
Published: (2026)

Continual-learning-based framework for structural damage recognition
by: Shu, Jiangpeng, et al.
Published: (2024)

Cracking the neural code for word recognition in convolutional neural networks
by: Agrawal, Aakash, et al.
Published: (2024)

Controllable Text-to-Motion Generation via Modular Body-Part Phase Control
by: Dai, Minyue, et al.
Published: (2026)

Link prediction Graph Neural Networks for structure recognition of Handwritten Mathematical Expressions
by: Nguyen, Cuong Tuan, et al.
Published: (2025)

Enhancing point cloud analysis via neighbor aggregation correction based on cross-stage structure correlation
by: Shi, Jiaqi, et al.
Published: (2025)

HadaSmileNet: Hadamard fusion of handcrafted and deep-learning features for enhancing facial emotion recognition of genuine smiles
by: Hasan, Mohammad Junayed, et al.
Published: (2025)

CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers
by: Shi, Dachuan, et al.
Published: (2023)

Adversarial Masking Contrastive Learning for vein recognition
by: Qin, Huafeng, et al.
Published: (2024)

Community-aware evaluation and threshold calibration for open-set plankton image recognition
by: Chen, Xi, et al.
Published: (2026)

Configural processing as an optimized strategy for robust object recognition in neural networks
by: Jang, Hojin, et al.
Published: (2024)

Single-stage Multi-human Parsing via Point Sets and Center-based Offsets
by: Chu, Jiaming, et al.
Published: (2023)

LaverNet: Lightweight All-in-one Video Restoration via Selective Propagation
by: Zhao, Haiyu, et al.
Published: (2025)

Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks
by: Ju, Rui-Yang, et al.
Published: (2022)

Emotion recognition in talking-face videos using persistent entropy and neural networks
by: Paluzo-Hidalgo, Eduardo, et al.
Published: (2021)

MSPT: A Lightweight Face Image Quality Assessment Method with Multi-stage Progressive Training
by: Xiao, Xiongwei, et al.
Published: (2025)

Mini-ResEmoteNet: Leveraging Knowledge Distillation for Human-Centered Design
by: Murtada, Amna, et al.
Published: (2025)

A survey of facial recognition techniques
by: Bahjat, Aya Kaysan
Published: (2026)

Cross-modal learning for plankton recognition
by: Kareinen, Joona, et al.
Published: (2026)

BCFPL: Binary classification ConvNet based Fast Parking space recognition with Low resolution image
by: Zhang, Shuo, et al.
Published: (2024)

ScriptViz: A Visualization Tool to Aid Scriptwriting based on a Large Movie Database
by: Rao, Anyi, et al.
Published: (2024)

A document is worth a structured record: Principled inductive bias design for document recognition
by: Meyer, Benjamin, et al.
Published: (2025)

OmniControlNet: Dual-stage Integration for Conditional Image Generation
by: Wang, Yilin, et al.
Published: (2024)

A multi-task neural network for atypical mitosis recognition under domain shift
by: Percannella, Gennaro, et al.
Published: (2025)

LRD-Net: A Lightweight Real-Centered Detection Network for Cross-Domain Face Forgery Detection
by: Zhang, Xuecen, et al.
Published: (2026)

UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors
by: Chen, Houyuan, et al.
Published: (2026)

ECC-PolypDet: Enhanced CenterNet with Contrastive Learning for Automatic Polyp Detection
by: Jiang, Yuncheng, et al.
Published: (2024)

Wearable-based behaviour interpolation for semi-supervised human activity recognition
by: Duan, Haoran, et al.
Published: (2024)