论文库首页  论文库
 
论文编号:
论文题目: The use of Gene Ontology terms and KEGG pathways for analysis and prediction of oncogenes
英文论文题目: The use of Gene Ontology terms and KEGG pathways for analysis and prediction of oncogenes
第一作者: Xing, ZH; Chu, C; Chen, L; Kong, XY
英文第一作者: Xing, ZH; Chu, C; Chen, L; Kong, XY
联系作者: Kong, XY (reprint author), Shanghai Inst Biol Sci, Inst Hlth Sci, Shanghai, Peoples R China.
英文联系作者: Kong, XY (reprint author), Shanghai Inst Biol Sci, Inst Hlth Sci, Shanghai, Peoples R China.
外单位作者单位:
英文外单位作者单位:
发表年度: 2016
卷: 1860
期: 11
页码: 2725-2734
摘要: Background: Oncogenes are a type of genes that have the potential to cause cancer. Most normal cells undergo programmed cell death, namely apoptosis, but activated oncogenes can help cells avoid apoptosis and survive. Thus, studying oncogenes is helpful for obtaining a good understanding of the formation and development of various types of cancers. Methods: In this study, we proposed a computational method, called OPM, for investigating oncogenes from the view of Gene Ontology (GO) and biological pathways. All investigated genes, including validated oncogenes retrieved from some public databases and other genes that have not been reported to be oncogenes thus far, were encoded into numeric vectors according to the enrichment theory of GO terms and KEGG pathways. Some popular feature selection methods, minimum redundancy maximum relevance and incremental feature selection, and an advanced machine learning algorithm, random forest, were adopted to analyze the numeric vectors to extract key GO terms and KEGG pathways. Results: Along with the oncogenes, GO terms and KEGG pathways were discussed in terms of their relevance in this study. Some important GO terms and KEGG pathways were extracted using feature selection methods and were confirmed to be highly related to oncogenes. Additionally, the importance of these terms and pathways in predicting oncogenes was further demonstrated by finding new putative oncogenes based on them. Conclusions: This study investigated oncogenes based on GO terms and KEGG pathways. Some important GO terms and KEGG pathways were confirmed to be highly related to oncogenes. We hope that these GO terms and KEGG pathways can provide new insight for the study of oncogenes, particularly for building more effective prediction models to identify novel oncogenes. The program is available upon request. General significance: We hope that the new findings listed in this study may provide a new insight for the investigation of oncogenes.
英文摘要: Background: Oncogenes are a type of genes that have the potential to cause cancer. Most normal cells undergo programmed cell death, namely apoptosis, but activated oncogenes can help cells avoid apoptosis and survive. Thus, studying oncogenes is helpful for obtaining a good understanding of the formation and development of various types of cancers. Methods: In this study, we proposed a computational method, called OPM, for investigating oncogenes from the view of Gene Ontology (GO) and biological pathways. All investigated genes, including validated oncogenes retrieved from some public databases and other genes that have not been reported to be oncogenes thus far, were encoded into numeric vectors according to the enrichment theory of GO terms and KEGG pathways. Some popular feature selection methods, minimum redundancy maximum relevance and incremental feature selection, and an advanced machine learning algorithm, random forest, were adopted to analyze the numeric vectors to extract key GO terms and KEGG pathways. Results: Along with the oncogenes, GO terms and KEGG pathways were discussed in terms of their relevance in this study. Some important GO terms and KEGG pathways were extracted using feature selection methods and were confirmed to be highly related to oncogenes. Additionally, the importance of these terms and pathways in predicting oncogenes was further demonstrated by finding new putative oncogenes based on them. Conclusions: This study investigated oncogenes based on GO terms and KEGG pathways. Some important GO terms and KEGG pathways were confirmed to be highly related to oncogenes. We hope that these GO terms and KEGG pathways can provide new insight for the study of oncogenes, particularly for building more effective prediction models to identify novel oncogenes. The program is available upon request. General significance: We hope that the new findings listed in this study may provide a new insight for the investigation of oncogenes.
刊物名称: BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS
英文刊物名称: BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS
论文全文:
英文论文全文:
全文链接:
其它备注:
英文其它备注:
学科: Biochemistry & Molecular Biology; Biophysics
英文学科: Biochemistry & Molecular Biology; Biophysics
影响因子: 4.702
第一作者所在部门:
英文第一作者所在部门:
论文出处:
英文论文出处:
论文类别: Article
英文论文类别: Article
参与作者:
英文参与作者:
 
2014 中国科学院上海生命科学研究院 版权所有