Software Benchmark—Classification Tree Algorithms for Cell Atlases Annotation Using Single-Cell RNA-Sequencing Data

dc.contributor.authorAlaqeeli, O.
dc.contributor.authorXing, L.
dc.contributor.authorZhang, Xuekui
dc.date.accessioned2021-08-18T17:18:27Z
dc.date.available2021-08-18T17:18:27Z
dc.date.copyright2021en_US
dc.date.issued2021
dc.description.abstractClassification tree is a widely used machine learning method. It has multiple implementations as R packages; rpart, ctree, evtree, tree and C5.0. The details of these implementations are not the same, and hence their performances differ from one application to another. We are interested in their performance in the classification of cells using the single-cell RNA-Sequencing data. In this paper, we conducted a benchmark study using 22 Single-Cell RNA-sequencing data sets. Using cross-validation, we compare packages’ prediction performances based on their Precision, Recall, F1-score, Area Under the Curve (AUC).We also compared the Complexity and Run-time of these R packages. Our study shows that rpart and evtree have the best Precision; evtree is the best in Recall, F1-score and AUC; C5.0 prefers more complex trees; tree is consistently much faster than others, although its complexity is often higher than others.en_US
dc.description.reviewstatusRevieweden_US
dc.description.scholarlevelFacultyen_US
dc.description.sponsorshipThe research was funded by the Natural Sciences and Engineering Research Council of Canada (NSERC) Discovery Grants (LX and XZ) and Canada Research Chair Grant (XZ). This research was enabled in part by support provided by WestGrid (www.westgrid.ca, accessed on 6 April 2021) and Compute Canada (www.computecanada.ca, accessed on 6 April 2021).en_US
dc.identifier.citationAlaqeeli, O., Xing, L., Zhang, X. (2021). Software benchmark—Classification tree algorithms for cell atlases annotation using single-cell RNA-sequencing data. Microbiology Research, 12, 317-334. https://doi.org/10.3390/microbiolres12020022en_US
dc.identifier.urihttps://doi.org/10.3390/microbiolres12020022
dc.identifier.urihttp://hdl.handle.net/1828/13272
dc.language.isoenen_US
dc.publisherMicrobiology Researchen_US
dc.subjectclassification treeen_US
dc.subjectsingle-cell RNA-sequencingen_US
dc.subjectbenchmarken_US
dc.subjectprecisionen_US
dc.subjectrecallen_US
dc.subjectF1-scoreen_US
dc.subjectcomplexityen_US
dc.subjectarea under the curveen_US
dc.subjectrun-timeen_US
dc.titleSoftware Benchmark—Classification Tree Algorithms for Cell Atlases Annotation Using Single-Cell RNA-Sequencing Dataen_US
dc.typeArticleen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
alaqeeli_omar_microbiol.res.2021.pdf
Size:
2.02 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2 KB
Format:
Item-specific license agreed upon to submission
Description: