标题
  • 标题
  • 作者
  • 关键词

Protein Residue Contact Prediction Based on Deep Learning and Massive Statistical Features from Multi-Sequence Alignment

2022-07-06分类号:Q518.2;TP18

【作者】Huiling Zhang  Min Hao  Hao Wu  Hing-Fung Ting  Yihong Tang  Wenhui Xi  Yanjie Wei  
【部门】Shenzhen Institutes of Advanced Technology  Chinese Academy of Sciences  University of Chinese Academy of Sciences  College of Electronic and Information Engineering  Southwest University  School of Software Engineering  University of Science and Technology of China  Department of Computer Science  The University of Hong Kong  School of Computer Science  Beijing University of Posts and Telecommunications  
【摘要】Sequence-based protein tertiary structure prediction is of fundamental importance because the function of a protein ultimately depends on its 3 D structure.An accurate residue-residue contact map is one of the essential elements for current ab initio prediction protocols of 3 D structure prediction.Recently,with the combination of deep learning and direct coupling techniques,the performance of residue contact prediction has achieved significant progress.However,a considerable number of current Deep-Learning(DL)-based prediction methods are usually time-consuming,mainly because they rely on different categories of data types and third-party programs.In this research,we transformed the complex biological problem into a pure computational problem through statistics and artificial intelligence.We have accordingly proposed a feature extraction method to obtain various categories of statistical information from only the multi-sequence alignment,followed by training a DL model for residue-residue contact prediction based on the massive statistical information.The proposed method is robust in terms of different test sets,showed high reliability on model confidence score,could obtain high computational efficiency and achieve comparable prediction precisions with DL methods that relying on multi-source inputs.
【关键词】multi-sequence alignment  residue-residue contact prediction  feature extraction  statistical information  Deep Learning(DL)  high computational efficiency
【基金】supported by the Strategic Priority CAS Project (No. XDB38050100);; the National Key Research and Development Program of China (No. 2018YFB0204403);; the National Natural Science Foundation of China (No. U1813203);; the Shenzhen Basic Research Fund (Nos. RCYX2020071411473419,JCYJ20200109114818703,and JSGG20201102163800001);; CAS Key Lab (No. 2011DP173015);; Hong Kong Research Grant Council (No. GRF-17208019);; the Outstanding Youth Innovation Fund (Doctoral Students) of CAS-SIAT (No. Y9G054)
【所属期刊栏目】Tsinghua Science and Technology
文献传递