Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Author:	Sulake, Nikhileswara Rao
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2604.03349
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917383682129920
author	Sulake, Nikhileswara Rao
author_facet	Sulake, Nikhileswara Rao
contents	YOLOv11 is the latest iteration in the You Only Look Once (YOLO) series of real-time object detectors, introducing novel architectural modules to improve feature extraction and small-object detection. In this paper, we present a detailed analysis of YOLOv11, including its backbone, neck, and head components. The model key innovations, the C3K2 blocks, Spatial Pyramid Pooling - Fast (SPPF), and C2PSA (Cross Stage Partial with Spatial Attention) modules enhance spatial feature processing while preserving speed. We compare YOLOv11 performance to prior YOLO versions on standard benchmarks, highlighting improvements in mean Average Precision (mAP) and inference speed. Our results demonstrate that YOLOv11 achieves superior accuracy without sacrificing real-time capabilities, making it well-suited for applications in autonomous driving, surveillance, and video analytics.This work formalizes YOLOv11 in a research context, providing a clear reference for future studies.
format	Preprint
id	arxiv_https___arxiv_org_abs_2604_03349
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	YOLOv11 Demystified: A Practical Guide to High-Performance Object Detection Sulake, Nikhileswara Rao Computer Vision and Pattern Recognition YOLOv11 is the latest iteration in the You Only Look Once (YOLO) series of real-time object detectors, introducing novel architectural modules to improve feature extraction and small-object detection. In this paper, we present a detailed analysis of YOLOv11, including its backbone, neck, and head components. The model key innovations, the C3K2 blocks, Spatial Pyramid Pooling - Fast (SPPF), and C2PSA (Cross Stage Partial with Spatial Attention) modules enhance spatial feature processing while preserving speed. We compare YOLOv11 performance to prior YOLO versions on standard benchmarks, highlighting improvements in mean Average Precision (mAP) and inference speed. Our results demonstrate that YOLOv11 achieves superior accuracy without sacrificing real-time capabilities, making it well-suited for applications in autonomous driving, surveillance, and video analytics.This work formalizes YOLOv11 in a research context, providing a clear reference for future studies.
title	YOLOv11 Demystified: A Practical Guide to High-Performance Object Detection
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2604.03349

Similar Items