ebooks logo journals logo reference works logo abstract databases logo
bullet  SIGN IN Register | Why Register? | Got a Voucher? alerts   marked lists   shopping cart 

informaworld

HOME   |   SEARCH   |   BROWSE
    Issues List       Latest Issue       Forthcoming Articles       Volume 2 Issue 1 & 2       Subscribe       Article       References       Related articles      
<< firstfirst   < prevprev   Table of contentstoc   next >next   last >>last
Publisher Logo Publication Cover
Search within this journal

FINDING OPTIMUM WAVEFRONT OF PARALLEL COMPUTATION 1  

Authors: Balaram Sinharoy a; Boleslaw K. Szymanski b
Affiliations:   a Large Scale Computing Division, IBM Corporation, Poughkeepsie, NY
b Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY
DOI: 10.1080/10637199408915404
Publication Frequency: 6 issues per year
Formats available: PDF (English)
Previously published as: Parallel Algorithms and Applications (1063-7192) until 2005
Article Requests: Order Reprints : Request Permissions
View Article: View Article (PDF) View Article (PDF)


Abstract

Data parallelism, in which the same operation is performed on many elements of an n-dimensional array, is one of the most powerful methods of extracting parallelism in scientific computation. One form of data parallelism involves defining a sequence of parallel wavefronts of a computation. Each wavefront consists of an (n - l)-dimcnsional subarray of the evaluated array and all wavefront elements are evaluated simultaneously. Different wavefronts result in different performance, so the question arises how to determine the wavefronts that result in the minimum computation time. Wavefront determination should define also allocation of wavefront elements to processors.

In this paper we present efficient algorithms for determining the optimum wavefront and for partitioning it into sections assigned to individual processors. Presented algorithms are applicable to computations that are defined over two or higher dimensional arrays and are executed on distributed memory machines interconnected into a one or two-dimensional processor array.
1 *This work sponsored in part by IBM Corp. under the Development Grant, by NSF under grant CCR-8920694 and by ONR under grant N00014-93-1-0076. The content of the information does not necessarily reflect the position or the policy of the Government, and no official endorsement should be inferred.
Keywords: Parallel programming; data dependence; algorithm transformation; uniform dependence algorithm; compile-time scheduling; hyperplane
C.R. CATEGORIES: D.1.3; D.3.4; D.4.1
view references (14)
Bookmark with:
  • CiteULike
  • Del.icio.us
  • BibSonomy
  • Connotea
  • More bookmarks
Privacy Policy | Terms & Conditions | Accessibility | RSS
FAQs in: English . Français . Español . 中文(简体和繁體)
© 2009 Informa plc