TY - JOUR
T1 - Deconvolving the contributions of cell-type heterogeneity on cortical gene expression
AU - Patrick, Ellis
AU - Taga, Mariko
AU - Ergun, Ayla
AU - Ng, Bernard
AU - Casazza, William
AU - Cimpean, Maria
AU - Yung, Christina
AU - Schneider, Julie A.
AU - Bennett, David A.
AU - Gaiteri, Chris
AU - de Jager, Philip L.
AU - Bradshaw, Elizabeth M.
AU - Mostafavi, Sara
N1 - Funding Information:
Funding: This work has been partly supported by National Institute of Health (NIH) grants R01AG15819 (DB), R01AG17917 (DB), U01AG61356 (DL, DB), R01NS089674 (EB), R01AG043617 (EB), R01AG057911 (CG) and R01AG061798 (CG), the Natural Sciences and Engineering Research Council of Canada (NSERC) Discovery Grant (SM) and an Australian Research Council Discovery Early Career Researcher Award (DE200100944) funded by the Australian Government (EP). The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Publisher Copyright:
Copyright: © 2020 Patrick et al.
PY - 2020/8
Y1 - 2020/8
N2 - Complexity of cell-type composition has created much skepticism surrounding the interpretation of bulk tissue transcriptomic studies. Recent studies have shown that deconvolution algorithms can be applied to computationally estimate cell-type proportions from gene expression data of bulk blood samples, but their performance when applied to brain tissue is unclear. Here, we have generated an immunohistochemistry (IHC) dataset for five major cell-types from brain tissue of 70 individuals, who also have bulk cortical gene expression data. With the IHC data as the benchmark, this resource enables quantitative assessment of deconvolution algorithms for brain tissue. We apply existing deconvolution algorithms to brain tissue by using marker sets derived from human brain single cell and cell-sorted RNA-seq data. We show that these algorithms can indeed produce informative estimates of constituent cell-type proportions. In fact, neuronal subpopulations can also be estimated from bulk brain tissue samples. Further, we show that including the cell-type proportion estimates as confounding factors is important for reducing false associations between Alzheimer’s disease phenotypes and gene expression. Lastly, we demonstrate that using more accurate marker sets can substantially improve statistical power in detecting cell-type specific expression quantitative trait loci (eQTLs).
AB - Complexity of cell-type composition has created much skepticism surrounding the interpretation of bulk tissue transcriptomic studies. Recent studies have shown that deconvolution algorithms can be applied to computationally estimate cell-type proportions from gene expression data of bulk blood samples, but their performance when applied to brain tissue is unclear. Here, we have generated an immunohistochemistry (IHC) dataset for five major cell-types from brain tissue of 70 individuals, who also have bulk cortical gene expression data. With the IHC data as the benchmark, this resource enables quantitative assessment of deconvolution algorithms for brain tissue. We apply existing deconvolution algorithms to brain tissue by using marker sets derived from human brain single cell and cell-sorted RNA-seq data. We show that these algorithms can indeed produce informative estimates of constituent cell-type proportions. In fact, neuronal subpopulations can also be estimated from bulk brain tissue samples. Further, we show that including the cell-type proportion estimates as confounding factors is important for reducing false associations between Alzheimer’s disease phenotypes and gene expression. Lastly, we demonstrate that using more accurate marker sets can substantially improve statistical power in detecting cell-type specific expression quantitative trait loci (eQTLs).
UR - http://www.scopus.com/inward/record.url?scp=85090078212&partnerID=8YFLogxK
U2 - 10.1371/journal.pcbi.1008120
DO - 10.1371/journal.pcbi.1008120
M3 - Article
C2 - 32804935
AN - SCOPUS:85090078212
SN - 1553-734X
VL - 16
JO - PLoS Computational Biology
JF - PLoS Computational Biology
IS - 8
M1 - e1008120
ER -