Text augmentation using a graph-based approach and clonal selection algorithm
| dc.contributor.author | Ahmed, Hadeer | |
| dc.contributor.author | Traore, Issa | |
| dc.contributor.author | Mamun, Mohammad | |
| dc.contributor.author | Saad, Sherif | |
| dc.date.accessioned | 2023-11-06T17:50:09Z | |
| dc.date.available | 2023-11-06T17:50:09Z | |
| dc.date.copyright | 2023 | en_US |
| dc.date.issued | 2023 | |
| dc.description.abstract | Annotated data is critical for machine learning models, but producing large amounts of data with high-quality labeling is a time-consuming and labor-intensive process. Natural language processing (NLP) and machine learning models have traditionally relied on the labels given by human annotators with varying degrees of competency, training, and experience. These kinds of labels are incredibly problematic because they are defined and enforced by arbitrary and ambiguous standards. In order to solve these issues of insufficient high-quality labels, researchers are now investigating automated methods for enhancing training and testing data sets. In this paper, we demonstrate how our proposed method improves the quality and quantity of data in two cybersecurity problems (fake news identification & sensitive data leak) by employing the clonal selection algorithm (CLONALG) and abstract meaning representation (AMR) graphs, and how it improves the performance of a classifier by at least 5% on two datasets. | en_US |
| dc.description.reviewstatus | Reviewed | en_US |
| dc.description.scholarlevel | Faculty | en_US |
| dc.description.sponsorship | This project was supported in part by collaborative research funding from the National Research Council of Canada’s Artificial Intelligence for Logistics Program. | en_US |
| dc.identifier.citation | Ahmed, H., Traore, I., Mamun, M., Saad, S. (2023). Text augmentation using a graph-based approach and clonal selection algorithm. Machine Learning with Applications, 11, 100452. https://doi.org/10.1016/j.mlwa.2023.100452. | en_US |
| dc.identifier.uri | https://doi.org/10.1016/j.mlwa.2023.100452 | |
| dc.identifier.uri | http://hdl.handle.net/1828/15585 | |
| dc.language.iso | en | en_US |
| dc.publisher | Machine Learning with Applications | en_US |
| dc.subject | Data augmentation | |
| dc.subject | Unstructured data | |
| dc.subject | Cybersecurity | |
| dc.subject | Text generation | |
| dc.subject | Clonal selection | |
| dc.subject.department | Department of Electrical and Computer Engineering | |
| dc.title | Text augmentation using a graph-based approach and clonal selection algorithm | en_US |
| dc.type | Article | en_US |