Acceleration of the discrete element method on a reconfigurable co-processor

Carrion Schafer, Benjamin (2003). Acceleration of the discrete element method on a reconfigurable co-processor. University of Birmingham. Ph.D.

[img]
Preview
Acceleration_of_the_DEM_on_a_Reconfigurable_Co-Processor_2003.pdf
PDF (acrobat professional)

Download (6MB)

Abstract

Granular materials are important for many different disciplines, e.g. geomechanics, civil engineering and chemical engineering. Many approaches have been used to model their behaviour, but one of the best and most important is the Discrete Element Method (DEM). The DEM was first developed during the 70’s, but its widespread use has been hampered by its extremely computationally demanding nature. The DEM can be run on a parallel computer by farming out different sub-domains onto different processors. However, particles transiting from one sub-domain to another create communication and synchronisation overheads which limit the speed-up achieved by parallel processing. Also, if some cells become much more heavily populated than others, then there will be inefficiencies due to load imbalance between the processors. As a result of these effects, the speed-up achieved by running the DEM on parallel processor computers is far less than linear. This thesis describes work on the acceleration of the DEM using reconfigurable computing. A custom hardware architecture for the DEM has been designed and implemented on a Field Programmable Gate Array (FPGA) mounted on a reconfigurable computing card. The design exploits the low level parallelism of the DEM by using long, wide computational pipelines that compute many arithmetic operations concurrently. It also exploits the high level parallelism by overlapping the main computational tasks using domain decomposition techniques. Speed-ups of a factor of at least 30 per FPGA have been achieved for simulations involving 25,000 to 200,000 particles. A multi-FPGA system has been implemented that allows the full overlap of computation with communication, so that an almost linear speed-up can be achieved as the number of FPGAs is increased. The effect of the short wordlength arithmetic used in the FPGA has been investigated, and the accuracy of the simulations has been found to be acceptable.

Type of Work: Thesis (Doctorates > Ph.D.)
Award Type: Doctorates > Ph.D.
Supervisor(s):
Supervisor(s)EmailORCID
Quigley, Steven FrancisUNSPECIFIEDUNSPECIFIED
Chan, Andrew H. C.UNSPECIFIEDUNSPECIFIED
Licence:
College/Faculty: Schools (1998 to 2008) > School of Engineering
School or Department: School of Engineering, Department of Electronic, Electrical and Systems Engineering
Funders: Other
Other Funders: The University of Birmingham
Subjects: T Technology > TK Electrical engineering. Electronics Nuclear engineering
T Technology > TA Engineering (General). Civil engineering (General)
URI: http://etheses.bham.ac.uk/id/eprint/94

Actions

Request a Correction Request a Correction
View Item View Item

Downloads

Downloads per month over past year