Advisor(s)

Miriam E. Leeser

Contributor(s)

Laurie King, James Lebak, Waleed Meleis

Date of Award

2011

Date Accepted

7-2011

Degree Grantor

Northeastern University

Degree Level

M.S.

Degree Name

Master of Science

Department or Academic Unit

College of Engineering, Department of Electrical and Computer Engineering

Keywords

computer engineering, Vforce, VSIPL, reconfigurable computing environments

Disciplines

Electrical and Computer Engineering | Engineering

Abstract

Systems with heterogeneous processing elements, such as commodity software processors combined with special purpose processors like FPGAs or GPUs, offer enormous potential speedups for certain types of workloads. There are, however, significant program development challenges on these systems. Programs written for these systems tend to have a lot of platform specific code integrated into the rest of the application code, making portability difficult. In addition, these systems have different programming models and tools requiring the developer to have hardware specific knowledge in addition to application domain expertise. Compounding these two problems is the short lifetime for these systems. A mechanism for portability across multiple architectures and generations is desirable. This thesis presents Vforce, an extensible framework that extends the VSIPL++ standard to add portable and transparent support for special purpose processors. New library elements that include portable special purpose processor support can be added to VSIPL++ through the use of Vforce's generic hardware interface -- the user application code and binary contain nothing specific to the special purpose processors. The decision about which, if any, special purpose processor to use to execute the new library element is made at runtime by a hardware resource manager that runs on the system independent of the user application. This manager also provides the information necessary to bind Vforce's generic hardware interface to the specific API used by the selected special purpose processor. The implementation of Vforce and two specific usage examples, an FFT and an adaptive time-domain beamformer, are discussed. Results for the two examples on a Cray XD1 heterogeneous supercomputer, as well as an analysis of the overhead added by Vforce, are presented. The results demonstrate the portability and performance achievable with the Vforce framework.

Document Type

Master's Thesis

Rights Information

copyright 2011

Rights Holder

Nicholas John Moore



Click button above to open, or right-click to save.

Share

COinS