A Language-Agnostic Compression Framework for the Bitcoin Blockchain

dc.contributor.authorPapanastassiou, Orestes
dc.contributor.supervisorThomo, Alex
dc.date.accessioned2024-01-18T17:35:31Z
dc.date.available2024-01-18T17:35:31Z
dc.date.copyright2023en_US
dc.date.issued2024-01-18
dc.degree.departmentDepartment of Computer Scienceen_US
dc.degree.levelMaster of Science M.Sc.en_US
dc.description.abstractThe surging interdisciplinary interest in Bitcoin within both academic and enterprise realms underscores the need for a versatile framework that can efficiently transform the raw Bitcoin blockchain data into a streamlined format suitable for high-performance data analysis. This research proposes an abstract framework designed to convert this data into a compact, normal form, facilitating its utilization across various programming languages and data analysis tools. Our approach centers on the development of a highly efficient, language-agnostic Application Programming Interface (API). This API is designed to be implementable by any Turing-complete programming language, ensuring broad accessibility and usability. Beyond mere data extraction, our framework extends its capabilities to assemble the Bitcoin user transaction graph, a fundamental resource for downstream analysis, including network analysis, forensics, and pattern detection. To ensure compatibility and ease of integration with an array of programming languages and data analysis software, we export the processed data to the language-agnostic HDF5 file format, recognized and supported by mainstream data analysis tools. This strategic choice empowers researchers and analysts to harness the power of Bitcoin blockchain data without being constrained by software dependencies. Furthermore, to demonstrate the practicality and efficiency of our proposed framework, we present a fully functional CPython implementation as a compelling proof-of-concept. This implementation showcases the feasibility and real-world applicability of our solution, opening doors to a wide range of data-driven investigations and applications within the realm of Bitcoin research. As the interest in Bitcoin continues to evolve and expand, our research offers a comprehensive solution to the challenges of handling and analyzing the vast data included in the blockchain. By providing an accessible, language-agnostic extraction and compression framework, we contribute to the democratization of Bitcoin blockchain data analysis, enabling both novices and experts to uncover valuable insights and drive innovations within this field.en_US
dc.description.scholarlevelGraduateen_US
dc.identifier.urihttp://hdl.handle.net/1828/15827
dc.languageEnglisheng
dc.language.isoenen_US
dc.rightsAvailable to the World Wide Weben_US
dc.subjectblockchainen_US
dc.subjectbitcoinen_US
dc.subjectdata miningen_US
dc.subjectcryptoen_US
dc.subjectcompressionen_US
dc.titleA Language-Agnostic Compression Framework for the Bitcoin Blockchainen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Thesis.pdf
Size:
433.7 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2 KB
Format:
Item-specific license agreed upon to submission
Description: