Saved in:
Bibliographic Details
Main Author: Sulake, Nikhileswara Rao
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2604.03349
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917383682129920
author Sulake, Nikhileswara Rao
author_facet Sulake, Nikhileswara Rao
contents YOLOv11 is the latest iteration in the You Only Look Once (YOLO) series of real-time object detectors, introducing novel architectural modules to improve feature extraction and small-object detection. In this paper, we present a detailed analysis of YOLOv11, including its backbone, neck, and head components. The model key innovations, the C3K2 blocks, Spatial Pyramid Pooling - Fast (SPPF), and C2PSA (Cross Stage Partial with Spatial Attention) modules enhance spatial feature processing while preserving speed. We compare YOLOv11 performance to prior YOLO versions on standard benchmarks, highlighting improvements in mean Average Precision (mAP) and inference speed. Our results demonstrate that YOLOv11 achieves superior accuracy without sacrificing real-time capabilities, making it well-suited for applications in autonomous driving, surveillance, and video analytics.This work formalizes YOLOv11 in a research context, providing a clear reference for future studies.
format Preprint
id arxiv_https___arxiv_org_abs_2604_03349
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle YOLOv11 Demystified: A Practical Guide to High-Performance Object Detection
Sulake, Nikhileswara Rao
Computer Vision and Pattern Recognition
YOLOv11 is the latest iteration in the You Only Look Once (YOLO) series of real-time object detectors, introducing novel architectural modules to improve feature extraction and small-object detection. In this paper, we present a detailed analysis of YOLOv11, including its backbone, neck, and head components. The model key innovations, the C3K2 blocks, Spatial Pyramid Pooling - Fast (SPPF), and C2PSA (Cross Stage Partial with Spatial Attention) modules enhance spatial feature processing while preserving speed. We compare YOLOv11 performance to prior YOLO versions on standard benchmarks, highlighting improvements in mean Average Precision (mAP) and inference speed. Our results demonstrate that YOLOv11 achieves superior accuracy without sacrificing real-time capabilities, making it well-suited for applications in autonomous driving, surveillance, and video analytics.This work formalizes YOLOv11 in a research context, providing a clear reference for future studies.
title YOLOv11 Demystified: A Practical Guide to High-Performance Object Detection
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2604.03349