Welcome to Journal of Beijing Institute of Technology
Volume 31Issue 3
Jun. 2022
Turn off MathJax
Article Contents
Xiumin Shi, Xiyuan Wu, Hengyu Qin. A Feature Extraction Method for scRNA-seq Processing and Its Application on COVID-19 Data Analysis[J]. JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2022, 31(3): 285-292. doi: 10.15918/j.jbit1004-0579.2022.052
Citation: Xiumin Shi, Xiyuan Wu, Hengyu Qin. A Feature Extraction Method for scRNA-seq Processing and Its Application on COVID-19 Data Analysis[J].JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2022, 31(3): 285-292.doi:10.15918/j.jbit1004-0579.2022.052

A Feature Extraction Method for scRNA-seq Processing and Its Application on COVID-19 Data Analysis

doi:10.15918/j.jbit1004-0579.2022.052
More Information
  • Author Bio:

    Xiumin Shireceived his Ph.D. degree from Beijing Institute of Technology, China in 2012. From 2015 to 2016, he was a visiting scholar at Stanford University, USA. His current research interests include computational neuroscience, machine learning, computer-aided drug design and bioinformatics, etc

    Xiyuan Wureceived the B.S. degree from Communication University of China, China, in 2019. He is currently pursuing the master’s degree with the School of Information and Electronics, Beijing Institute of Technology, China. His research interests include machine learning, natural language processing and bioinformatics

    Hengyu Qinreceived the B.S. degree from Communication University of China, China, in 2021. He is currently pursuing the master’s degree with the School of Information and Electronics, Beijing Institute of Technology, China. His research interests include machine learning and single-cell omics

  • Corresponding author:sxm@bit.edu.cn
  • Received Date:2022-04-29
  • Rev Recd Date:2022-05-06
  • Accepted Date:2022-05-28
  • Publish Date:2022-06-28
  • Single-cell RNA-sequencing (scRNA-seq) is a rapidly increasing research area in biomedical signal processing. However, the high complexity of single-cell data makes efficient and accurate analysis difficult. To improve the performance of single-cell RNA data processing, two single-cell features calculation method and corresponding dual-input neural network structures are proposed. In this feature extraction and fusion scheme, the features at the cluster level are extracted by hierarchical clustering and differential gene analysis, and the features at the cell level are extracted by the calculation of gene frequency and cross cell frequency. Our experiments on COVID-19 data demonstrate that the combined use of these two feature achieves great results and high robustness for classification tasks.
  • loading
  • [1]
    A. Cohen, Biomedical signal processing: Volume 2: Compression and automatic recognition,Boca Raton, FL, NJ, USA: CRC, pp. 10-16, 2019.
    [2]
    S. Krishnan and Y. Athavale, “Trends in biomedical signal feature extraction, ” Biomedical Signal Processing and Control, vol. 43, pp. 41-63, 2018.
    [3]
    S. S. Potter, “Single-cell RNA sequencing for the study of development, physiology and disease, ” Nature Reviews Nephrology, vol. 14, no. 8, pp. 479-492, 2018.
    [4]
    E. Papalexi and R. Satija, “Single-cell RNA sequencing to explore immune cell heterogeneity, ” Nature Reviews Immunology, vol. 18, no. 1, pp. 35-45, 2018.
    [5]
    J. Yang, D. Zhang, A. F. Frangi, and J. Yang, “Two-dimensional PCA: A new approach to appearance-based face representation and recognition, ” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, no. 1, pp. 131-137, 2004.
    [6]
    B. A. Draper, K. Baek, M. S. Bartlett, and J. R. Beveridge, “Recognizing faces with PCA and ICA, ” Computer Vision and Image Understanding, vol. 91, no. 1-2, pp. 115-137, 2003.
    [7]
    L. Maaten and G. Hinton, “Visualizing data using t-SNE, ” Journal of Machine Learning Research, vol. 9, no. 11, pp. 2579-2625, 2008.
    [8]
    L. McInnes, J. Healy, and J. Melville, “UMAP: Uniform manifold approximation and projection for dimension reduction, ” preprint arXiv: 1802.03426, 2018.
    [9]
    G. Pasquini, J. E. R. Arias, P. Schäfer, and V. Busskamp, “Automated methods for cell type annotation on scRNA-seq data, ” Computational and Structural Biotechnology Journal, vol. 19, pp. 961-969, 2021.
    [10]
    J. Feng, X. Niu, J. Zhang, and J. Wang, “Gene selection and classification of scRNA-seq data combining information gain ratio and genetic algorithm with dynamic crossover, ” Wireless Communications and Mobile Computing, vol. 2022, pp. 1-16, 2022.
    [11]
    L. Li, H. Tang, R. Xia, H. Dai, R. Liu , and L. Chen, “Intrinsic entropy model for feature selection of scRNA-seq data, ” Journal of Molecular Cell Biology, vol. 14, no. 2 , pp. 1-21, 2022.
    [12]
    M. Flores, Z. Liu, T. Zhang, M. Hasib, Y. Chiu, Z. Ye, K. Paniagua, S. Jo, J. Zhang, and S. Gao, “Deep learning tackles single-cell analysis—A survey of deep learning for scRNA-seq analysis, ” Briefings in Bioinformatics, vol. 23, no. 1, pp. 1-31, 2022.
    [13]
    H. Wang and X. Ma, “Learning deep features and topological structure of cells for clustering of scRNA-sequencing data, ” Briefings in Bioinformatics, vol. 23, no. 3, pp. 1-14, 2022.
    [14]
    D. Buterez, I. Bica, I. Tariq, H. Andrés-Terré, Helena, and P. Liò, “CellVGAE: An unsupervised scRNA-seq analysis workflow with graph attention networks, ” Bioinformatics, vol. 38, no. 5, pp. 1277-1286, 2022.
    [15]
    M. Ciortan and M. Defrance, “GNN-based embedding for clustering scRNA-seq data, ” Bioinformatics, vol. 38, no. 4, pp. 1037-1044, 2022.
    [16]
    J. Hu, X. Li, and G. Hu, “Iterative transfer learning with neural network for clustering and cell type classification in single-cell RNA-seq analysis, ” Nature Machine Intelligence, vol. 2, no. 10, pp. 607-618, 2020.
    [17]
    F. Wagner and I. Yanai. “Moana: A robust and scalable cell type classification framework for single-cell RNA-Seq data, ” preprint BioRxiv, 2018.
    [18]
    X. Shao, H. Yang, X. Zhuang, J. Liao, P. Yang, J. Cheng, X. Lu, H. Chen, and X. Fan, “scDeepSort: A pre-trained cell-type annotation method for single-cell transcriptomics using deep learning with a weighted graph neural network, ” Nucleic Acids Research, vol. 49, no. 21, pp. 1-13, 2021.
    [19]
    Y. Yang, J. Song, and Z. Huang, “Multi-feature fusion via hierarchical regression for multimedia analysis, ” IEEE Transactions on Multimedia, vol. 15, no. 3, pp. 572-581, 2012.
    [20]
    F. Li, Y. Fan, and X. Zhang, “Multi-feature fusion method based on EEG signal and its application in stroke classification, ” Journal of Medical Systems, vol. 44, no. 2, pp. 1-11, 2019.
    [21]
    F. Wolf, P. Angerer, and F. Theis, “SCANPY: Large-scale single-cell gene expression data analysis, ” Genome Biology, vol. 19, no. 1, pp. 1-5, 2018.
    [22]
    A. Lun, K. Bach, and J. Marioni, “Pooling across cells to normalize single-cell RNA sequencing data with many zero counts, ” Genome Biology, vol. 17, no. 1, pp. 1-14, 2016.
    [23]
    F. Murtagh and P. Contreras, “Algorithms for hierarchical clustering: An overview, ” Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, vol. 2, no. 1, pp. 86-97, 2012.
    [24]
    J. Cuzick, “A Wilcoxon-type test for trend, ” Statistics in Medicine, vol. 4, no. 1, pp. 87-90, 1985.
    [25]
    A. Aizawa, “An information-theoretic perspective of tf-idf measures, ” Information Processing & Management, vol. 39, no. 1, pp. 45-65, 2003.
    [26]
    E. Voita, D. Talbot, and F. Moiseev, “Analyzing multi-head self-attention: Specialized heads do the heavy lifting, the rest can be pruned, ” preprint arXiv: 1905.09418, 2019.
    [27]
    R. Dey and F. Salem, “Gate-variants of gated recurrent unit (GRU) neural networks, ” in 2017 IEEE 60th International Midwest Symposium on Circuits and Systems, Boston, MA, USA, pp. 1597-1600, 2017.
    [28]
    E. Ballestar and D. Farber, “Single cell profiling of COVID-19 patients: An international data resource from multiple tissues, ” preprint Medrxiv, 2020.
  • 加载中

Catalog

    通讯作者:陈斌, bchen63@163.com
    • 1.

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Figures(6)/Tables(6)

    Article Metrics

    Article views (1194) PDF downloads(28) Cited by()
    Proportional views
    Related

    /

    Return
    Return
      Baidu
      map