PICS: Probabilistic Inference for ChIP-seq

Zhang, Xeukui; Robertson, G.; Krzywinski, M.; Droit, A.; Jones, S.; Gottardo, R.; Ning, Kaida

PICS: Probabilistic Inference for ChIP-seq

dc.contributor.author	Zhang, Xeukui
dc.contributor.author	Robertson, G.
dc.contributor.author	Krzywinski, M.
dc.contributor.author	Droit, A.
dc.contributor.author	Jones, S.
dc.contributor.author	Gottardo, R.
dc.contributor.author	Ning, Kaida
dc.date.accessioned	2021-08-18T18:30:41Z
dc.date.available	2021-08-18T18:30:41Z
dc.date.copyright	2011	en_US
dc.date.issued	2011
dc.description.abstract	ChIP-seq, which combines chromatin immunoprecipitation with massively parallel short-read sequencing, can profile in vivo genome-wide transcription factor-DNA asso- ciation with higher sensitivity, specificity and spatial resolution than ChIP-chip. While it presents new opportunities for research, ChIP-seq poses new challenges for statistical analysis that derive from the complexity of the biological systems characterized and the variability and biases in its digital sequence data. We propose a method called PICS (Probabilistic Inference for ChIP-seq) for extracting information from ChIP-seq aligned-read data in order to identify regions bound by transcription factors. PICS identifies enriched regions by modeling local concentrations of directional reads, and uses DNA fragment length prior information to discriminate closely adjacent bind- ing events via a Bayesian hierarchical t-mixture model. Its per-event fragment length estimates also allow it to remove from analysis regions that have atypical lengths. PICS uses pre-calculated, whole-genome read mappability profiles and a truncated t- distribution to adjust binding event models for reads that are missing due to local genome repetitiveness. It estimates uncertainties in model parameters that can be used to define confidence regions on binding event locations and to filter estimates. Finally, PICS calculates a per-event enrichment score relative to a control sample, and can use a control sample to estimate a false discovery rate. We compared PICS to the alternative methods MACS, QuEST, and CisGenome, using published GABP and FOXA1 data sets from human cell lines, and found that PICS’ predicted binding sites were more consistent with computationally predicted binding motifs.	en_US
dc.description.reviewstatus	Reviewed	en_US
dc.description.scholarlevel	Faculty	en_US
dc.description.sponsorship	This research is supported by an NSERC Discovery Grant (RG and XZ).	en_US
dc.identifier.citation	Zhang, X., Robertson, G., Krzywinski, M., Ning, K., Droit, A., Jones, S., & Gottardo, R. (2016). PICS: Probabilistic inference for ChIP-seq. Biometrics, 67(1):151-63. https://doi.org/10.1111/j.1541-0420.2010.01441.x	en_US
dc.identifier.uri	https://doi.org/10.1111/j.1541-0420.2010.01441.x
dc.identifier.uri	http://hdl.handle.net/1828/13277
dc.language.iso	en	en_US
dc.publisher	Biometrics	en_US
dc.subject	Bayesian hierarchical model
dc.subject	ChIP-seq
dc.subject	EM algorithm
dc.subject	Mappability
dc.subject	Missing values
dc.subject	Mixture model
dc.subject	Transcription factor
dc.subject	Truncated data
dc.subject	t-distribution
dc.subject.department	Department of Mathematics and Statistics
dc.title	PICS: Probabilistic Inference for ChIP-seq	en_US
dc.type	Article	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: zhang_xuekui_biometrics_2011.pdf
Size:: 392.71 KB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Faculty and Staff Publications