Academia.eduAcademia.edu

Outline

An Entropy Approach for Choosing Gene Expression Cutoff

2022, bioRxiv (Cold Spring Harbor Laboratory)

https://doi.org/10.1101/2022.05.05.490711

Abstract

Annotating cell types using single-cell transcriptome data usually requires binarizing the expression data to distinguish between the background noise vs. real expression or low expression vs. high expression cases. A common approach is choosing a "reasonable" cutoff value, but it remains unclear how to choose it. In this work, we describe a simple yet effective approach for finding this threshold value.

References (3)

  1. J. Ross Quinlan. Induction of decision trees. Machine learning, 1(1):81-106, 1986.
  2. Claude Elwood Shannon. A mathematical theory of communication. The Bell system technical journal, 27(3):379-423, 1948.
  3. Lei Zhang, Xin Yu, Liangtao Zheng, Yuanyuan Zhang, Yansen Li, Qiao Fang, Ranran Gao, Boxi Kang, Qiming Zhang, Julie Y Huang, et al. Lineage tracking reveals dynamic relationships of t cells in colorectal cancer. Nature, 564(7735):268-272, 2018.