Improving Image Matching using an Ensemble of Local Descriptors and Hardware Design

dc.contributor.authorGhaffari, Sina
dc.contributor.supervisorLi, Kin Fun
dc.contributor.supervisorCapson, David W.
dc.date.accessioned2023-04-12T19:20:09Z
dc.date.copyright2023en_US
dc.date.issued2023-04-12
dc.degree.departmentDepartment of Electrical and Computer Engineering
dc.degree.levelDoctor of Philosophy Ph.D.en_US
dc.description.abstractImage matching is one of the fundamental problems in computer vision, and has many applications such as object recognition, structure from motion, and 3D reconstruction. In this work, we aim to accelerate image matching algorithms, and improve their accuracy by proposing approaches that have a minimal impact on speed. With this in mind, we focus on handcrafted descriptor and matching algorithms. Our contributions in this dissertation are twofold. The first set of contributions are related to the acceleration of descriptor algorithms and the reduction of resource utilization for image matching by proposing novel circuits based on Field Programmable Gate Arrays (FPGAs). We use FPGAs as a platform due to their features such as parallel processing, low-power computing, and flexibility in design. This work presents a comprehensive analysis of FPGA-based implementations of the Histogram of Oriented Gradients (HOG) algorithm. A novel hardware-software co-design of the HOG algorithm is introduced to accelerate the execution of this descriptor algorithm. We propose methods such as logarithm-based bin assignment, approximate normalization, and a time-sharing protocol for sequential histogram generation for increasing the speed. A novel task allocation to optimize resource utilization on the hardware platform, in addition to acceleration of the HOG algorithm, is also presented. Next, we focus on binary descriptors and present a novel hardware implementation of the Binary Robust Invariant Scalable Keypoints (BRISK) algorithm. BRISK is faster than non-binary descriptors but is computationally expensive with respect to other binary descriptor algorithms. A new sampling pattern for the BRISK algorithm is proposed to facilitate the hardware implementation of BRISK in multiple scales. Our proposed design reduces FPGA resource utilization while maintaining the image matching accuracy. Furthermore, the proposed fully pipelined design achieves a frame rate of 78 fps on images with full HD resolution. The second set of contributions is related to improving image matching accuracy while maintaining performance in terms of computations. For this purpose, the focus is on handcrafted descriptor algorithms which are known to be more computationally efficient than deep learning based algorithms. We analyze and propose fusion of descriptor algorithms which extracts complementary information to attain higher accuracy. To this end, three ensemble methods are proposed. The first method (weighted-fusion) combines a non-binary and a binary descriptor using their weighted distance metrics. The second method (binary fusion) combines a non-binary and a binary descriptor by converting the non-binary descriptor to a binary descriptor using a learned threshold. The third method (non-binary fusion) combines a non-binary and a binary descriptor by transforming the binary descriptor to a non-binary descriptor using a learned scaling factor. Comprehensive experiments on benchmarks from the HPatches, Brown (Photo tourism) and Oxford Affine Covariant Regions datasets are provided. The experimental results and analysis demonstrate a higher mean Average Precision (mAP) of the fusion methods in comparison with the baseline algorithms. The next contribution for accuracy improvement is adding convolutional neural network (CNN) prefiltering to images prior to keypoint detection. The addition of a shallow CNN as the first step of a handcrafted algorithm to improve accuracy is proposed. The CNN is trained to filter the raw input images to achieve higher mAP. Experimental results indicate an improvement of accuracy using this method on the HPatches dataset. Finally, we demonstrate our proposed approaches on a practical application. This application is relevant to environmental research by providing the basis for a tool for the automated identification of wildlife in tracking habitat (in this case, badgers). The proposed methods outperform the commonly-used handcrafted algorithms on identifying individual badgers in multiple images using their facial characteristics. This is done without fine-tuning the algorithms on the target badger identification dataset, which shows the generality of our proposed methods.en_US
dc.description.scholarlevelGraduateen_US
dc.identifier.bibliographicCitationS. Ghaffari, P. Soleimani, K. F. Li, and D. Capson, “FPGA-based Implementation of HOG Algorithm: Techniques and Challenges,” in 2019 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM), 2019, pp. 1–7. doi: 10.1109/PACRIM47961.2019.8985056.en_US
dc.identifier.bibliographicCitationS. Ghaffari, P. Soleimani, K. F. Li, and D. W. Capson, “Analysis and Comparison of FPGA-Based Histogram of Oriented Gradients Implementations,” IEEE Access, vol. 8, pp. 79 920–79 934, 2020. doi: 10.1109/ACCESS.2020.2989267.en_US
dc.identifier.bibliographicCitationS. Ghaffari, P. Soleimani, K. F. Li, and D. W. Capson, “A Novel Hardware–Software Co-Design and Implementation of the HOG Algorithm,” Sensors, vol. 20, no. 19, 2020, issn: 1424-8220. doi: 10.3390/s20195655. [Online]. Available: https://www.mdpi.com/1424-8220/20/19/5655.en_US
dc.identifier.bibliographicCitationS. Ghaffari, D. W. Capson, and K. F. Li, “A Fully Pipelined FPGA Architecture for Multiscale BRISK Descriptors With a Novel Hardware-Aware Sampling Pattern,” IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 30, no. 6, pp. 826–839, 2022. doi: 10.1109/TVLSI.2022.3151896.en_US
dc.identifier.urihttp://hdl.handle.net/1828/14940
dc.languageEnglisheng
dc.language.isoenen_US
dc.rightsAvailable to the World Wide Weben_US
dc.subjectimage matchingen_US
dc.subjectFPGAen_US
dc.subjectBRISKen_US
dc.subjectSIFTen_US
dc.subjectHOGen_US
dc.subjectHistogram of Oriented Gradientsen_US
dc.subjectHill Climbingen_US
dc.subjectbadger identificationen_US
dc.subjecthardware implementationen_US
dc.subjecthardware software co-designen_US
dc.subjectField Programmable Gate Arraysen_US
dc.subjectComputer Visionen_US
dc.subjectImage processingen_US
dc.subjectLocal Descriptoren_US
dc.subjectKeypointen_US
dc.subjectfusionen_US
dc.titleImproving Image Matching using an Ensemble of Local Descriptors and Hardware Designen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Ghaffari_Sina_PhD_2023.pdf
Size:
23.15 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2 KB
Format:
Item-specific license agreed upon to submission
Description: