Acceleration of spiking neural network based pattern recognition on NVIDIA graphics processors

Bing Han; Tarek M. Taha

doi:10.1364/AO.49.000B83

Applied Optics
Vol. 49,
Issue 10,
pp. B83-B91
(2010)
•https://doi.org/10.1364/AO.49.000B83

Acceleration of spiking neural network based pattern recognition on NVIDIA graphics processors

Bing Han and Tarek M. Taha

Not Accessible

Your library or personal account may give you access

Get PDF
Email
Share
Get Citation
Copy Citation Text
Bing Han and Tarek M. Taha, "Acceleration of spiking neural network based pattern recognition on NVIDIA graphics processors," Appl. Opt. 49, B83-B91 (2010)

Export Citation
- BibTex
- Endnote (RIS)
- HTML
- Plain Text
Citation alert
Save article

Related Topics
Optics & Photonics Topics
?

The topics in this list come from the Optics and Photonics Topics applied to this article.

About this Article
History
- Original Manuscript: November 3, 2009
- Revised Manuscript: February 8, 2010
- Manuscript Accepted: February 26, 2010
- Published: March 17, 2010
Virtual Issues
Virtual Journal for Biomedical Optics Vol. 5, Iss. 8

Abstract

There is currently a strong push in the research community to develop biological scale implementations of neuron based vision models. Systems at this scale are computationally demanding and generally utilize more accurate neuron models, such as the Izhikevich and the Hodgkin–Huxley models, in favor of the more popular integrate and fire model. We examine the feasibility of using graphics processing units (GPUs) to accelerate a spiking neural network based character recognition network to enable such large scale systems. Two versions of the network utilizing the Izhikevich and Hodgkin–Huxley models are implemented. Three NVIDIA general-purpose (GP) GPU platforms are examined, including the GeForce 9800 GX2, the Tesla C1060, and the Tesla S1070. Our results show that the GPGPUs can provide significant speedup over conventional processors. In particular, the fastest GPGPU utilized, the Tesla S1070, provided a speedup of 5.6 and 84.4 over highly optimized implementations on the fastest central processing unit (CPU) tested, a quadcore $2.67 GHz$ Xeon processor, for the Izhikevich and the Hodgkin–Huxley models, respectively. The CPU implementation utilized all four cores and the vector data parallelism offered by the processor. The results indicate that GPUs are well suited for this application domain.

Full Article | PDF Article

More Like This

Photonic spiking neural networks with event-driven femtojoule optoelectronic neurons based on Izhikevich-inspired model

Yun-Jhu Lee, Mehmet Berkay On, Xian Xiao, Roberto Proietti, and S. J. Ben Yoo
Opt. Express 30(11) 19360-19389 (2022)

Next-generation acceleration and code optimization for light transport in turbid media using GPUs

Erik Alerstam, William Chun Yip Lo, Tianyi David Han, Jonathan Rose, Stefan Andersson-Engels, and Lothar Lilge
Biomed. Opt. Express 1(2) 658-675 (2010)

GPU-based Monte Carlo simulation for light propagation in complex heterogeneous tissues

Nunu Ren, Jimin Liang, Xiaochao Qu, Jianfeng Li, Bingjia Lu, and Jie Tian
Opt. Express 18(7) 6811-6823 (2010)

Previous Article Next Article

Cited By

You do not have subscription access to this journal. Cited by links are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Figures (10)

You do not have subscription access to this journal. Figure files are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Tables (4)

You do not have subscription access to this journal. Article tables are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Equations (8)

You do not have subscription access to this journal. Equations are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Input Image Size	$384 \times 384$	$768 \times 768$	$1536 \times 1536$	$3072 \times 3072$
Total neurons	147,504	589,872	2,359,344	9,437,232
Level 2 neurons	48	48	48	48
Level 1 neurons	147,456	589,824	2,359,296	9,437,184

	$384 \times 384$	$768 \times 768$	$1536 \times 1536$	$3072 \times 3072$
S1070	11.7	20.1	23.7	24.6
9800 GX2	5.1	8.8	9.7	—
$C 1060 (\times 1)$	1.2	1.5	1.6	1.7
$C 1060 (\times 2)$	2.3	3.0	3.3	3.6

	$384 \times 384$	$768 \times 768$	$1536 \times 1536$	$3072 \times 3072$
S1070	188.4	199.4	190.5	177.0
9800 GX2	74.4	74.6	—	—
$C 1060 (\times 1)$	22.8	22.4	21.5	21.5
$C 1060 (\times 2)$	45.6	45.1	43.0	43.0

Model/Platform	Bytes/Flop
Model: Izhikevich	2.11
Model: Hodgkin–Huxley	0.38
Card: S1070	0.11
Card: C1060	0.11
Card: 9800 GX2	0.11

Input Image Size	$384 \times 384$	$768 \times 768$	$1536 \times 1536$	$3072 \times 3072$
Total neurons	147,504	589,872	2,359,344	9,437,232
Level 2 neurons	48	48	48	48
Level 1 neurons	147,456	589,824	2,359,296	9,437,184

Abstract

Cited By

Figures (10)

Tables (4)

Equations (8)

Applied Optics