STUDY OF SCALAR COMPILATION TECHNIQUES FOR PIPELINED SUPERCOMPUTERS.

Shlomo Weiss*, James E. Smith

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

36 Scopus citations

Abstract

Two compilation techniques for enhancing scalar performance in high-speed scientific processor are studied, namely software pipelining and loop unrolling. The impact of the architecture (size of the register file) and of the hardware (size of instruction buffer) on the efficiency of loop unrolling is examined. A methodology for classifying software pipelining techniques is developed. For loop unrolling, a straightforward scheduling algorithm is shown to produce near-optimal results when not inhibited by recurrences or memory hazards. Software pipelining requires less hardware but also achieves less speedup. It is shown that the performance produced with a modified CRAY-1S scalar architecture and a code scheduler utilizing loop unrolling is comparable to the performance achieved by the CRAY-1S with a vector unit and the CFT vectorizing Fortran compiler.

Original languageEnglish
Title of host publicationUnknown Host Publication Title
PublisherIEEE
Pages105-109
Number of pages5
ISBN (Print)0818608056
StatePublished - 1987
Externally publishedYes

Fingerprint

Dive into the research topics of 'STUDY OF SCALAR COMPILATION TECHNIQUES FOR PIPELINED SUPERCOMPUTERS.'. Together they form a unique fingerprint.

Cite this