KFusion: obtaining modularity and performance with regards to general purpose GPU computing and co-processors
| dc.contributor.author | Kiemele, Liam | |
| dc.contributor.supervisor | Coady, Yvonne | |
| dc.date.accessioned | 2012-12-14T21:13:06Z | |
| dc.date.available | 2012-12-14T21:13:06Z | |
| dc.date.copyright | 2012 | en_US |
| dc.date.issued | 2012-12-14 | |
| dc.degree.department | Department of Computer Science | |
| dc.degree.level | Master of Science M.Sc. | en_US |
| dc.description.abstract | Concurrency has recently come to the forefront of computing as multi-core processors become more and more common. General purpose graphics processing unit computing brings with them new language support for dealing with co-processor environments such as OpenCL and CUDA. Programming language support for multi-core architectures introduces a fundamentally new mechanism for modularity--a kernel. Developers attempting to leverage these mechanism to separate concerns often incur unanticipated performance penalties. My proposed solution aims to preserve the benefits of kernel boundaries for modularity, while at the same time eliminate these inherent costs at compile time and execution. KFusion is a prototype tool for transforming programs written in OpenCL to make them more efficient. By leveraging loop fusion and deforestation, it can eliminate the costs associated with compositions of kernels that share data. Case studies show that Kfusion can address key memory bandwidth and latency bottlenecks and result in substantial performance improvements. | en_US |
| dc.description.scholarlevel | Graduate | en_US |
| dc.identifier.uri | http://hdl.handle.net/1828/4357 | |
| dc.language | English | eng |
| dc.language.iso | en | en_US |
| dc.rights.temp | Available to the World Wide Web | en_US |
| dc.subject | modularity | en_US |
| dc.subject | performance | en_US |
| dc.subject | OpenCL | en_US |
| dc.subject | concurrency | en_US |
| dc.subject | parallelism | en_US |
| dc.title | KFusion: obtaining modularity and performance with regards to general purpose GPU computing and co-processors | en_US |
| dc.type | Thesis | en_US |