Low-latency hierarchical routing of reconfigurable neuromorphic systems

A reconfigurable hardware accelerator implementation for spiking neural network (SNN) simulation using field-programmable gate arrays (FPGAs) is promising and attractive research because massive parallelism results in better execution speed. For large-scale SNN simulations, a large number of FPGAs a...

Full description

Saved in:
Bibliographic Details
Main Authors: Samalika Perera, Ying Xu, André van Schaik, Runchun Wang
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-02-01
Series:Frontiers in Neuroscience
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fnins.2025.1493623/full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832542285871448064
author Samalika Perera
Ying Xu
André van Schaik
Runchun Wang
author_facet Samalika Perera
Ying Xu
André van Schaik
Runchun Wang
author_sort Samalika Perera
collection DOAJ
description A reconfigurable hardware accelerator implementation for spiking neural network (SNN) simulation using field-programmable gate arrays (FPGAs) is promising and attractive research because massive parallelism results in better execution speed. For large-scale SNN simulations, a large number of FPGAs are needed. However, inter-FPGA communication bottlenecks cause congestion, data losses, and latency inefficiencies. In this work, we employed a hierarchical tree-based interconnection architecture for multi-FPGAs. This architecture is scalable as new branches can be added to a tree, maintaining a constant local bandwidth. The tree-based approach contrasts with linear Network on Chip (NoC), where congestion can arise from numerous connections. We propose a routing architecture that introduces an arbiter mechanism by employing stochastic arbitration considering data level queues of First In, First Out (FIFO) buffers. This mechanism effectively reduces the bottleneck caused by FIFO congestion, resulting in improved overall latency. Results present measurement data collected for performance analysis of latency. We compared the performance of the design using our proposed stochastic routing scheme to a traditional round-robin architecture. The results demonstrate that the stochastic arbiters achieve lower worst-case latency and improved overall performance compared to the round-robin arbiters.
format Article
id doaj-art-0e09a00a403342169380bcf534530d26
institution Kabale University
issn 1662-453X
language English
publishDate 2025-02-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Neuroscience
spelling doaj-art-0e09a00a403342169380bcf534530d262025-02-04T06:32:02ZengFrontiers Media S.A.Frontiers in Neuroscience1662-453X2025-02-011910.3389/fnins.2025.14936231493623Low-latency hierarchical routing of reconfigurable neuromorphic systemsSamalika PereraYing XuAndré van SchaikRunchun WangA reconfigurable hardware accelerator implementation for spiking neural network (SNN) simulation using field-programmable gate arrays (FPGAs) is promising and attractive research because massive parallelism results in better execution speed. For large-scale SNN simulations, a large number of FPGAs are needed. However, inter-FPGA communication bottlenecks cause congestion, data losses, and latency inefficiencies. In this work, we employed a hierarchical tree-based interconnection architecture for multi-FPGAs. This architecture is scalable as new branches can be added to a tree, maintaining a constant local bandwidth. The tree-based approach contrasts with linear Network on Chip (NoC), where congestion can arise from numerous connections. We propose a routing architecture that introduces an arbiter mechanism by employing stochastic arbitration considering data level queues of First In, First Out (FIFO) buffers. This mechanism effectively reduces the bottleneck caused by FIFO congestion, resulting in improved overall latency. Results present measurement data collected for performance analysis of latency. We compared the performance of the design using our proposed stochastic routing scheme to a traditional round-robin architecture. The results demonstrate that the stochastic arbiters achieve lower worst-case latency and improved overall performance compared to the round-robin arbiters.https://www.frontiersin.org/articles/10.3389/fnins.2025.1493623/fullneuromorphic engineeringFPGA accelerationmulti-FPGAnetworks on chiptransceiversSNN
spellingShingle Samalika Perera
Ying Xu
André van Schaik
Runchun Wang
Low-latency hierarchical routing of reconfigurable neuromorphic systems
Frontiers in Neuroscience
neuromorphic engineering
FPGA acceleration
multi-FPGA
networks on chip
transceivers
SNN
title Low-latency hierarchical routing of reconfigurable neuromorphic systems
title_full Low-latency hierarchical routing of reconfigurable neuromorphic systems
title_fullStr Low-latency hierarchical routing of reconfigurable neuromorphic systems
title_full_unstemmed Low-latency hierarchical routing of reconfigurable neuromorphic systems
title_short Low-latency hierarchical routing of reconfigurable neuromorphic systems
title_sort low latency hierarchical routing of reconfigurable neuromorphic systems
topic neuromorphic engineering
FPGA acceleration
multi-FPGA
networks on chip
transceivers
SNN
url https://www.frontiersin.org/articles/10.3389/fnins.2025.1493623/full
work_keys_str_mv AT samalikaperera lowlatencyhierarchicalroutingofreconfigurableneuromorphicsystems
AT yingxu lowlatencyhierarchicalroutingofreconfigurableneuromorphicsystems
AT andrevanschaik lowlatencyhierarchicalroutingofreconfigurableneuromorphicsystems
AT runchunwang lowlatencyhierarchicalroutingofreconfigurableneuromorphicsystems