Disk caching with an optical ring

Enrique V. Carrera; Ricardo Bianchini

doi:10.1364/AO.39.006663

Applied Optics
Vol. 39,
Issue 35,
pp. 6663-6680
(2000)
•https://doi.org/10.1364/AO.39.006663

Disk caching with an optical ring

Enrique V. Carrera and Ricardo Bianchini

Not Accessible

Your library or personal account may give you access

Get PDF
Email
Share
Get Citation
Copy Citation Text
Enrique V. Carrera and Ricardo Bianchini, "Disk caching with an optical ring," Appl. Opt. 39, 6663-6680 (2000)

Export Citation
- BibTex
- Endnote (RIS)
- HTML
- Plain Text
Citation alert
Save article

Abstract

We propose a simple extension to the optical network of a scalable multiprocessor that optimizes page swap outs. More specifically, we propose to extend the network with an optical ring that not only transfers swapped-out pages between the local memories and the disks of the multiprocessor but also acts as a systemwide write cache for these pages. This extended optical network confers several performance benefits: It provides a staging area where swapped-out pages can reside until the disk is free, it increases the possibility of combining several writes to disk, and it acts as a victim cache for pages that are swapped out and subsequently accessed by the same or a different processor. To evaluate the extent to which these benefits affect performance, we use detailed execution-driven simulations of several out-of-core parallel applications that run on an eight-node scalable multiprocessor. Our results demonstrate that our optical ring provides consistent performance improvements that derive mostly from faster page swap outs and victim caching. To show that our optical ring can also be applied successfully to traditional multiprocessors in which processors are interconnected with electronic networks, we evaluate its benefits for a mesh-connected multiprocessor. This latter evaluation shows that our optical ring improves performance for a traditional multiprocessor by roughly the same amount as it does for an optically interconnected multiprocessor. On the basis of these results and our parameter-space study our main conclusion is that our optical ring is highly efficient under several architectural assumptions and for most out-of-core parallel applications. Even though our study focuses on optimizing page swap outs, we believe that caching data with an optical ring can be beneficial for other types of disk-write traffic as well.

Full Article | PDF Article

More Like This

Optoelectronic-cache memory system architecture

Donald M. Chiarulli and Steven P. Levitan
Appl. Opt. 35(14) 2449-2456 (1996)

Architectural approach to the role of optics in monoprocessor and multiprocessor machines

Jacques Henri Collet, Daniel Litaize, Jan Van Campenhout, Chris Jesshope, Marc Desmulliez, Hugo Thienpont, James Goodman, and Ahmed Louri
Appl. Opt. 39(5) 671-682 (2000)

SYMNET: an optical interconnection network for scalable high-performance symmetric multiprocessors

Ahmed Louri and Avinash Karanth Kodi
Appl. Opt. 42(17) 3407-3417 (2003)

Previous Article Next Article

Cited By

You do not have subscription access to this journal. Cited by links are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Figures (11)

You do not have subscription access to this journal. Figure files are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Tables (9)

You do not have subscription access to this journal. Article tables are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Equations (1)

You do not have subscription access to this journal. Equations are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Parameter	Value
Number of nodes	8
Number of I/O-enabled nodes	4
Page size	4 kbytes
TLB-miss latency	100 pcycles
TLB-shootdown latency	500 pcycles
Interrupt latency	400 pcycles
Write-buffer size	16 entries
First level
Cache size	4 kbytes
Cache-block size	32 bytes
Cache-hit latency	1 pcycle
Second level
Cache size	16 kbytes
Cache-block size	64 kbytes
Cache-hit latency	12 pcycles
Memory size per node	256 kbytes
Memory-bus (setup) latency	12 pcycles
Memory-bus transfer rate	800 Mbytes/s
I/O-bus (setup) latency	8 pcycles
I/O-bus transfer rate	300 Mbytes/s
Optical transmission rate	10 Gbits/s
Time of flight	1 pcycle
WDM channels on optical ring	8
Optical ring round-trip latency	52 µs
Storage capacity on optical ring	512 kbytes
Optical storage per WDM channel	64 kbytes
Disk-controller cache size	16 kbytes
Minimum seek latency	2 ms
Maximum seek latency	22 ms
Rotational latency	4 ms
Disk transfer rate	20 Mbytes/s

Program	Description of Function	Input Size	Total Data (Mbytes)
Em3D	Electromagnetic-wave propagation	32,000 nodes	2.5
		(5% remote, 10 iterations)
FFT	One-dimensional fast Fourier transform	64,000 points	3.1
Gauss	Unblocked Gaussian elimination	570 × 512 doublesa	2.3
LU	Blocked LU factorization	576 × 576 doubles	2.7
MG	3-D Poisson solver that uses multigrid techniques	32 × 32 × 64 floatsb (10 iterations)	2.4
Radix	Integer Radix sort	320,000 keys (Radix 1024)	2.6
SOR	Successive overrelaxation	640 × 512 floats (10 iterations)	2.6

Program	Under OPTNET (Mpcycles)	Under OWCache (Mpcycles)
Em3D	49.1	1.8
FFT	70.6	3.1
Gauss	30.8	1.0
LU	40.2	1.9
MG	29.8	0.5
Radix	47.1	2.4
SOR	31.6	1.2

Program	Under OPTNET (kpcycles)	Under OWCache (kpcycles)
Em3D	192.7	2.1
FFT	382.1	43.6
Gauss	762.3	78.0
LU	393.3	41.2
MG	89.4	6.1
Radix	1223.1	2.1
SOR	661.3	2.1

Program	Under OPTNET	Under OWCache	Increase
Em3D	1.21	1.24	2%
FFT	1.50	2.06	37%
Gauss	1.06	1.07	1%
LU	1.15	1.25	9%
MG	1.20	1.27	6%
Radix	1.17	1.37	17%
SOR	1.64	2.90	77%

Program	Under OPTNET	Under OWCache	Increase
Em3D	1.16	1.17	1%
FFT	1.28	1.45	13%
Gauss	1.03	1.04	1%
LU	1.04	1.05	1%
MG	1.04	1.19	14%
Radix	1.08	1.12	4%
SOR	1.17	1.50	28%

Program	Optimal Prefetching	Naïve Prefetching
Em3D	9.2%	7.1%
FFT	12.8%	8.4%
Gauss	57.6%	60.9%
LU	18.9%	14.6%
MG	55.8%	46.2%
Radix	20.6%	18.0%
SOR	18.6%	30.7%

Program	Under OPTNET (kpcycles)	Under OWCache (kpcycles)	Decrease
Em3D	6.0	6.2	-3%
FFT	6.6	7.0	-6%
Gauss	5.4	7.1	-31%
LU	5.7	6.2	-9%
MG	6.1	7.4	-21%
Radix	6.4	6.9	-8%
SOR	5.7	6.3	-11%

Abstract

Cited By

Figures (11)

Tables (9)

Equations (1)

Applied Optics