주메뉴 바로가기 본문내용 바로가기 사이트정보 바로가기

논문특허 4차년도

제목 - 설명
  • [논문] 바이오정보: Integrative Analysis of ATAC-Seq and RNA-Seq through Machine Learning Identifies 10 Signature Genes for Breast Cancer Intrinsic Subtypes

    • 등록일
      2024.10.31
    • 조회수
      12

•연구자: 의생명시스템학부 이제근

 

•발표일: 2024.10.7

 

•DOI: https://doi.org/10.3390/biology13100799

 

•Jeong-Woon Park and Je-Keun Rhee, Biology (Q1);  Volume 13, Issue 10 (2024)

 

•Abstract

Breast cancer is a heterogeneous disease composed of various biologically distinct subtypes, each characterized by unique molecular features. Its formation and progression involve a complex, multistep process that includes the accumulation of numerous genetic and epigenetic alterations. Although integrating RNA-seq transcriptome data with ATAC-seq epigenetic information provides a more comprehensive understanding of gene regulation and its impact across different conditions, no classification model has yet been developed for breast cancer intrinsic subtypes based on such integrative analyses. In this study, we employed machine learning algorithms to predict intrinsic subtypes through the integrative analysis of ATAC-seq and RNA-seq data. We identified 10 signature genes (CDH3, ERBB2, TYMS, GREB1, OSR1, MYBL2, FAM83D, ESR1, FOXC1, and NAT1) using recursive feature elimination with cross-validation (RFECV) and a support vector machine (SVM) based on SHAP (SHapley Additive exPlanations) feature importance. Furthermore, we found that these genes were primarily associated with immune responses, hormone signaling, cancer progression, and cellular proliferation.

상단으로 이동