Identification of Antioxidant Proteins With Deep Learning From Sequence Information

Antioxidant proteins have been found closely linked to disease control for its ability to eliminate excess free radicals. Because of its medicinal value, the study of identifying antioxidant proteins is on the upsurge. Many machine-learning classifiers have performed poorly owing to the nonlinear an...

Full description

Saved in:
Bibliographic Details
Main Authors: Lifen Shao (Author), Hui Gao (Author), Zhen Liu (Author), Juan Feng (Author), Lixia Tang (Author), Hao Lin (Author)
Format: Book
Published: Frontiers Media S.A., 2018-09-01T00:00:00Z.
Subjects:
Online Access:Connect to this object online.
Tags: Add Tag
No Tags, Be the first to tag this record!

MARC

LEADER 00000 am a22000003u 4500
001 doaj_8c5aaf98e91b419e98c2f3d30cfb3916
042 |a dc 
100 1 0 |a Lifen Shao  |e author 
700 1 0 |a Hui Gao  |e author 
700 1 0 |a Zhen Liu  |e author 
700 1 0 |a Juan Feng  |e author 
700 1 0 |a Lixia Tang  |e author 
700 1 0 |a Hao Lin  |e author 
245 0 0 |a Identification of Antioxidant Proteins With Deep Learning From Sequence Information 
260 |b Frontiers Media S.A.,   |c 2018-09-01T00:00:00Z. 
500 |a 1663-9812 
500 |a 10.3389/fphar.2018.01036 
520 |a Antioxidant proteins have been found closely linked to disease control for its ability to eliminate excess free radicals. Because of its medicinal value, the study of identifying antioxidant proteins is on the upsurge. Many machine-learning classifiers have performed poorly owing to the nonlinear and unbalanced nature of biological data. Recently, deep learning techniques showed advantages over many state-of-the-art machine learning methods in various fields. In this study, a deep learning based classifier was proposed to identify antioxidant proteins based on mixed g-gap dipeptide composition feature vector. The classifier employed deep autoencoder to extract nonlinear representation from raw input. The t-Distributed Stochastic Neighbor Embedding (t-SNE) was used for dimensionality reduction. Support vector machine was finally performed for classification. The classifier achieved F1 score of 0.8842 and MCC of 0.7409 in 10-fold cross validation. Experimental results show that our proposed method outperformed the traditional machine learning methods and could be a promising tool for antioxidant protein identification. For the convenience of others' scientific research, we have developed a user-friendly web server called IDAod for antioxidant protein identification, which can be accessed freely at http://bigroup.uestc.edu.cn/IDAod/. 
546 |a EN 
690 |a antioxidant proteins 
690 |a deep learning 
690 |a g-gap dipeptide 
690 |a feature selection 
690 |a webserver 
690 |a Therapeutics. Pharmacology 
690 |a RM1-950 
655 7 |a article  |2 local 
786 0 |n Frontiers in Pharmacology, Vol 9 (2018) 
787 0 |n https://www.frontiersin.org/article/10.3389/fphar.2018.01036/full 
787 0 |n https://doaj.org/toc/1663-9812 
856 4 1 |u https://doaj.org/article/8c5aaf98e91b419e98c2f3d30cfb3916  |z Connect to this object online.