Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.11525 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1866908368534241280 |
|---|---|
| author | Ganesha Steevan, Rodrigues C., Niranjan U. |
| author_facet | Ganesha Steevan, Rodrigues C., Niranjan U. |
| contents | Software configurable processors (SCP) implement compute intensive applications very efficiently on the special onchip configurable hardware. The SCP by Stretch Inc. converts the computeheavy algorithms into custom instructions, called extension instructions (EI) which run on the onchip logic. The Processor interleaves the EI's between regular instructions and the onchip hardware executes the algorithm in parallel, accelerating the application. This results in a performance gain of more than order of magnitude over an unaccelerated processor. This paper explains the implementation of two compute intensive algorithms on Stretch SCP, namely (i) colour space conversion and (ii) histogram equalisation. The repeated processing required by these algorithms is made easier by the SCP which allows packing of multiple pixels into a vector. The vector processing makes SCP achieve high throughput. Profiling an application identifies computeintensive spots in the program, which are computed on the onchip hardware by issuing EI's. |
| format | Preprint |
| id |
arxiv_https___arxiv_org_abs_2505_11525 |
| institution | arXiv |
| publishDate | 2025 |
| record_format | arxiv |
| spellingShingle | Implementation of Compute Intensive Algorithms on Software Configurable Processor Ganesha Steevan, Rodrigues C., Niranjan U. Hardware Architecture Software configurable processors (SCP) implement compute intensive applications very efficiently on the special onchip configurable hardware. The SCP by Stretch Inc. converts the computeheavy algorithms into custom instructions, called extension instructions (EI) which run on the onchip logic. The Processor interleaves the EI's between regular instructions and the onchip hardware executes the algorithm in parallel, accelerating the application. This results in a performance gain of more than order of magnitude over an unaccelerated processor. This paper explains the implementation of two compute intensive algorithms on Stretch SCP, namely (i) colour space conversion and (ii) histogram equalisation. The repeated processing required by these algorithms is made easier by the SCP which allows packing of multiple pixels into a vector. The vector processing makes SCP achieve high throughput. Profiling an application identifies computeintensive spots in the program, which are computed on the onchip hardware by issuing EI's. |
| title | Implementation of Compute Intensive Algorithms on Software Configurable Processor |
| topic | Hardware Architecture |
| url | https://arxiv.org/abs/2505.11525 |