Performance improvements for iterative electron tomography reconstruction using graphics processing units (GPUs)

TitlePerformance improvements for iterative electron tomography reconstruction using graphics processing units (GPUs)
Publication TypeJournal Article
Year of Publication2011
AuthorsW J. Palenstijn, K. J. Batenburg, and J. Sijbers
JournalJournal of structural biology
Volume176
Issue2
Pagination250-253
Date Published2011 Nov
ISSN1095-8657
Abstract

Iterative reconstruction algorithms are becoming increasingly important in electron tomography of biological samples. These algorithms, however, impose major computational demands. Parallelization must be employed to maintain acceptable running times. Graphics Processing Units (GPUs) have been demonstrated to be highly cost-effective for carrying out these computations with a high degree of parallelism. In a recent paper by Xu et al. (2010), a GPU implementation strategy was presented that obtains a speedup of an order of magnitude over a previously proposed GPU-based electron tomography implementation. In this technical note, we demonstrate that by making alternative design decisions in the GPU implementation, an additional speedup can be obtained, again of an order of magnitude. By carefully considering memory access locality when dividing the workload among blocks of threads, the GPU's cache is used more efficiently, making more effective use of the available memory bandwidth.

DOI10.1016/j.jsb.2011.07.017
Alternate JournalJ. Struct. Biol.
PubMed ID21840398
Research area: