PICS: Probabilistic Inference for ChIP-seq

dc.contributor.authorZhang, Xeukui
dc.contributor.authorRobertson, G.
dc.contributor.authorKrzywinski, M.
dc.contributor.authorDroit, A.
dc.contributor.authorJones, S.
dc.contributor.authorGottardo, R.
dc.contributor.authorNing, Kaida
dc.date.accessioned2021-08-18T18:30:41Z
dc.date.available2021-08-18T18:30:41Z
dc.date.copyright2011en_US
dc.date.issued2011
dc.description.abstractChIP-seq, which combines chromatin immunoprecipitation with massively parallel short-read sequencing, can profile in vivo genome-wide transcription factor-DNA asso- ciation with higher sensitivity, specificity and spatial resolution than ChIP-chip. While it presents new opportunities for research, ChIP-seq poses new challenges for statistical analysis that derive from the complexity of the biological systems characterized and the variability and biases in its digital sequence data. We propose a method called PICS (Probabilistic Inference for ChIP-seq) for extracting information from ChIP-seq aligned-read data in order to identify regions bound by transcription factors. PICS identifies enriched regions by modeling local concentrations of directional reads, and uses DNA fragment length prior information to discriminate closely adjacent bind- ing events via a Bayesian hierarchical t-mixture model. Its per-event fragment length estimates also allow it to remove from analysis regions that have atypical lengths. PICS uses pre-calculated, whole-genome read mappability profiles and a truncated t- distribution to adjust binding event models for reads that are missing due to local genome repetitiveness. It estimates uncertainties in model parameters that can be used to define confidence regions on binding event locations and to filter estimates. Finally, PICS calculates a per-event enrichment score relative to a control sample, and can use a control sample to estimate a false discovery rate. We compared PICS to the alternative methods MACS, QuEST, and CisGenome, using published GABP and FOXA1 data sets from human cell lines, and found that PICS’ predicted binding sites were more consistent with computationally predicted binding motifs.en_US
dc.description.reviewstatusRevieweden_US
dc.description.scholarlevelFacultyen_US
dc.description.sponsorshipThis research is supported by an NSERC Discovery Grant (RG and XZ).en_US
dc.identifier.citationZhang, X., Robertson, G., Krzywinski, M., Ning, K., Droit, A., Jones, S., & Gottardo, R. (2016). PICS: Probabilistic inference for ChIP-seq. Biometrics, 67(1):151-63. https://doi.org/10.1111/j.1541-0420.2010.01441.xen_US
dc.identifier.urihttps://doi.org/10.1111/j.1541-0420.2010.01441.x
dc.identifier.urihttp://hdl.handle.net/1828/13277
dc.language.isoenen_US
dc.publisherBiometricsen_US
dc.subjectBayesian hierarchical model
dc.subjectChIP-seq
dc.subjectEM algorithm
dc.subjectMappability
dc.subjectMissing values
dc.subjectMixture model
dc.subjectTranscription factor
dc.subjectTruncated data
dc.subjectt-distribution
dc.subject.departmentDepartment of Mathematics and Statistics
dc.titlePICS: Probabilistic Inference for ChIP-seqen_US
dc.typeArticleen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
zhang_xuekui_biometrics_2011.pdf
Size:
392.71 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2 KB
Format:
Item-specific license agreed upon to submission
Description: