Single-Class Instance Segmentation for Vectorization of Line Drawings
dc.contributor.author | Vohra, Rhythm | |
dc.contributor.supervisor | Branzan Albu, Alexandra | |
dc.date.accessioned | 2024-03-15T17:01:49Z | |
dc.date.available | 2024-03-15T17:01:49Z | |
dc.date.issued | 2024 | |
dc.degree.department | Department of Electrical and Computer Engineering | |
dc.degree.level | Master of Applied Science MASc | |
dc.description.abstract | Images can be represented and stored either in raster or in vector formats. Raster images are the most ubiquitous and are defined as matrices of pixel intensities/colours, while vector images consist of a finite set of geometric primitives, such as lines, curves, and polygons. Since geometric shapes are expressed via mathematical equations and defined by a limited number of control points, they can be manipulated in a much easier way than by directly working with pixels; hence, the vector format is much preferred to raster for image editing and understanding purposes. The conversion of a raster image into its vector correspondent is a non-trivial process, called image vectorization. Creating vector images from a given raster image can be time-consuming and requires the expertise of a skilled graphic user. This thesis explores the effectiveness of a Deep Learning based framework to vectorize raster images comprising line drawings with minimal user interventions. To improve the visual representation of the image, each stroke in the line drawing is represented with a different label and vectorized. In this document, we present an in-depth study of image vectorization, the objective of our research, challenges, potential resolutions, and compare the outcomes of our approach on six datasets consisting of different types of hand drawings. More specifically, this thesis begins by comparing raster images with vector images, the importance of image vectorization, and our objective to convert raster images to vector-based representations by accurately separating each stroke from the line drawings. In further chapters of this thesis, a Deep Learning based segmentation methodology is introduced to perform Single-Class Instance Segmentation of hand drawings to process the input raster image by labeling each pixel as belonging to a particular stroke instance. This segmentation approach is able to leverage the spatial relationships between each stroke instance. A novel loss function specifically designed to optimize our highly imbalanced datasets by scaling the margins and adding a regularization term to improve its feature selection technique. The weighted combination of our proposed margin regularized loss function is combined with the Dice loss to reduce the spatial overlap and improve the predictions over infrequent labels. Finally, the effectiveness of our segmentation technique of line drawing vectorization is compared experimentally with the state-of-the-art and our reference method. Our method can successfully handle a wide variety of human drawing styles. The results are comparable in terms of accuracy and way ahead in terms of speed and complexity, with other methods. | |
dc.description.scholarlevel | Graduate | |
dc.identifier.bibliographicCitation | Vohra, R.; Dash, A. and Branzan Albu, A. (2024). Single-Class Instance Segmentation for Vectorization of Line Drawings. In Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 3: VISAPP, ISBN 978-989-758-679-8, ISSN 2184-4321, pages 215-226. | |
dc.identifier.uri | https://hdl.handle.net/1828/16108 | |
dc.language | English | eng |
dc.language.iso | en | |
dc.rights | Available to the World Wide Web | |
dc.subject | Image Vectorization | |
dc.subject | Segmentation | |
dc.subject | Visual Attention | |
dc.subject | Deep Learning | |
dc.subject | Computer Vision | |
dc.subject | Pattern Recognition | |
dc.title | Single-Class Instance Segmentation for Vectorization of Line Drawings | |
dc.type | Thesis |