Saved in:
Bibliographic Details
Main Authors: Zheng, Jian-jie, Yang, Chih-kai, Chen, Po-han, Chen, Lyn Chao-ling
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.04190
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916990821597184
author Zheng, Jian-jie
Yang, Chih-kai
Chen, Po-han
Chen, Lyn Chao-ling
author_facet Zheng, Jian-jie
Yang, Chih-kai
Chen, Po-han
Chen, Lyn Chao-ling
contents In the study, the social robot act as a patrol to recognize and notify illegal parking in real-time. Dual-model pipeline method and large multimodal model were compared, and the GPT-4o multimodal model was adopted in license plate recognition without preprocessing. For moving smoothly on a flat ground, the robot navigated in a simulated parking lot in the experiments. The robot changes angle view of the camera automatically to capture the images around with the format of license plate number. From the captured images of the robot, the numbers on the plate are recognized through the GPT-4o model, and identifies legality of the numbers. When an illegal parking is detected, the robot sends Line messages to the system manager immediately. The contribution of the work is that a novel multimodal deep learning method has validated with high accuracy in license plate recognition, and a social assistive robot is also provided for solving problems in a real scenario, and can be applied in an indoor parking lot.
format Preprint
id arxiv_https___arxiv_org_abs_2510_04190
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Zenbo Patrol: A Social Assistive Robot Based on Multimodal Deep Learning for Real-time Illegal Parking Recognition and Notification
Zheng, Jian-jie
Yang, Chih-kai
Chen, Po-han
Chen, Lyn Chao-ling
Robotics
In the study, the social robot act as a patrol to recognize and notify illegal parking in real-time. Dual-model pipeline method and large multimodal model were compared, and the GPT-4o multimodal model was adopted in license plate recognition without preprocessing. For moving smoothly on a flat ground, the robot navigated in a simulated parking lot in the experiments. The robot changes angle view of the camera automatically to capture the images around with the format of license plate number. From the captured images of the robot, the numbers on the plate are recognized through the GPT-4o model, and identifies legality of the numbers. When an illegal parking is detected, the robot sends Line messages to the system manager immediately. The contribution of the work is that a novel multimodal deep learning method has validated with high accuracy in license plate recognition, and a social assistive robot is also provided for solving problems in a real scenario, and can be applied in an indoor parking lot.
title Zenbo Patrol: A Social Assistive Robot Based on Multimodal Deep Learning for Real-time Illegal Parking Recognition and Notification
topic Robotics
url https://arxiv.org/abs/2510.04190