Saved in:
Bibliographic Details
Main Authors: Farkya, Saurabh, Daniels, Zachary Alan, Raghavan, Aswin, van der Wal, Gooitzen, Isnardi, Michael, Piacentino, Michael, Zhang, David
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2408.04767
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911982691549184
author Farkya, Saurabh
Daniels, Zachary Alan
Raghavan, Aswin
van der Wal, Gooitzen
Isnardi, Michael
Piacentino, Michael
Zhang, David
author_facet Farkya, Saurabh
Daniels, Zachary Alan
Raghavan, Aswin
van der Wal, Gooitzen
Isnardi, Michael
Piacentino, Michael
Zhang, David
contents Recent advancements in sensors have led to high resolution and high data throughput at the pixel level. Simultaneously, the adoption of increasingly large (deep) neural networks (NNs) has lead to significant progress in computer vision. Currently, visual intelligence comes at increasingly high computational complexity, energy, and latency. We study a data-driven system that combines dynamic sensing at the pixel level with computer vision analytics at the video level and propose a feedback control loop to minimize data movement between the sensor front-end and computational back-end without compromising detection and tracking precision. Our contributions are threefold: (1) We introduce anticipatory attention and show that it leads to high precision prediction with sparse activation of pixels; (2) Leveraging the feedback control, we show that the dimensionality of learned feature vectors can be significantly reduced with increased sparsity; and (3) We emulate analog design choices (such as varying RGB or Bayer pixel format and analog noise) and study their impact on the key metrics of the data-driven system. Comparative analysis with traditional pixel and deep learning models shows significant performance enhancements. Our system achieves a 10X reduction in bandwidth and a 15-30X improvement in Energy-Delay Product (EDP) when activating only 30% of pixels, with a minor reduction in object detection and tracking precision. Based on analog emulation, our system can achieve a throughput of 205 megapixels/sec (MP/s) with a power consumption of only 110 mW per MP, i.e., a theoretical improvement of ~30X in EDP.
format Preprint
id arxiv_https___arxiv_org_abs_2408_04767
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Data-Driven Pixel Control: Challenges and Prospects
Farkya, Saurabh
Daniels, Zachary Alan
Raghavan, Aswin
van der Wal, Gooitzen
Isnardi, Michael
Piacentino, Michael
Zhang, David
Computer Vision and Pattern Recognition
Artificial Intelligence
Systems and Control
Recent advancements in sensors have led to high resolution and high data throughput at the pixel level. Simultaneously, the adoption of increasingly large (deep) neural networks (NNs) has lead to significant progress in computer vision. Currently, visual intelligence comes at increasingly high computational complexity, energy, and latency. We study a data-driven system that combines dynamic sensing at the pixel level with computer vision analytics at the video level and propose a feedback control loop to minimize data movement between the sensor front-end and computational back-end without compromising detection and tracking precision. Our contributions are threefold: (1) We introduce anticipatory attention and show that it leads to high precision prediction with sparse activation of pixels; (2) Leveraging the feedback control, we show that the dimensionality of learned feature vectors can be significantly reduced with increased sparsity; and (3) We emulate analog design choices (such as varying RGB or Bayer pixel format and analog noise) and study their impact on the key metrics of the data-driven system. Comparative analysis with traditional pixel and deep learning models shows significant performance enhancements. Our system achieves a 10X reduction in bandwidth and a 15-30X improvement in Energy-Delay Product (EDP) when activating only 30% of pixels, with a minor reduction in object detection and tracking precision. Based on analog emulation, our system can achieve a throughput of 205 megapixels/sec (MP/s) with a power consumption of only 110 mW per MP, i.e., a theoretical improvement of ~30X in EDP.
title Data-Driven Pixel Control: Challenges and Prospects
topic Computer Vision and Pattern Recognition
Artificial Intelligence
Systems and Control
url https://arxiv.org/abs/2408.04767