Research on modern computer architecture optimization techniques: implementation and measurements for big data processing

dc.contributor.authorHe, Yan
dc.contributor.supervisorWeber, Jens
dc.date.accessioned2021-09-11T19:37:42Z
dc.date.available2021-09-11T19:37:42Z
dc.date.copyright2021en_US
dc.date.issued2021-09-11
dc.degree.departmentDepartment of Computer Scienceen_US
dc.degree.levelMaster of Science M.Sc.en_US
dc.description.abstractWith the rapid development of big data computing, our programmers need to improve the efficiency of big data processing. In our daily development process, we normally focus on bug-free deployment and often overlook another aspect of software engineering, performance optimization. Actually, a great deal of programming effort is required to achieve good performance. Many techniques are having the potential to significantly improve software performance. To achieve efficiency, I firstly conclude some optimization techniques dealing with memory-bound issues then moving to parallel programming to deal with compute-bound problems. As an example, we apply as many techniques that I mention in the report to a popular algorithm implementation, PageRank based on C++17. By providing constant feedback from performance measurement and profiling, we can see a drastic speed up after implementing all these optimization techniques. I also experiment with a real-world graph dataset provided by IMDb which can rank the top movies, and also evaluate performance before and after optimization. This report hopefully can provide potential future direction towards applying different optimization techniques to various big data processing applications.en_US
dc.description.scholarlevelGraduateen_US
dc.identifier.urihttp://hdl.handle.net/1828/13378
dc.language.isoenen_US
dc.rightsAvailable to the World Wide Weben_US
dc.subjectbig dataen_US
dc.subjectdata localityen_US
dc.subjectOpenMPen_US
dc.subjectGPU computingen_US
dc.subjectprofile dataen_US
dc.subjectCUDAen_US
dc.subjectPageRanken_US
dc.subjectmovie referenceen_US
dc.subjectsingle-threaded optimizationen_US
dc.subjectmulti-core optimizationen_US
dc.titleResearch on modern computer architecture optimization techniques: implementation and measurements for big data processingen_US
dc.typeprojecten_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
He_Yan_MSc_2021.pdf
Size:
3.54 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2 KB
Format:
Item-specific license agreed upon to submission
Description: