Identification of Antioxidant Proteins With Deep Learning From Sequence Information
Antioxidant proteins have been found closely linked to disease control for its ability to eliminate excess free radicals. Because of its medicinal value, the study of identifying antioxidant proteins is on the upsurge. Many machine-learning classifiers have performed poorly owing to the nonlinear an...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | Book |
Published: |
Frontiers Media S.A.,
2018-09-01T00:00:00Z.
|
Subjects: | |
Online Access: | Connect to this object online. |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
MARC
LEADER | 00000 am a22000003u 4500 | ||
---|---|---|---|
001 | doaj_8c5aaf98e91b419e98c2f3d30cfb3916 | ||
042 | |a dc | ||
100 | 1 | 0 | |a Lifen Shao |e author |
700 | 1 | 0 | |a Hui Gao |e author |
700 | 1 | 0 | |a Zhen Liu |e author |
700 | 1 | 0 | |a Juan Feng |e author |
700 | 1 | 0 | |a Lixia Tang |e author |
700 | 1 | 0 | |a Hao Lin |e author |
245 | 0 | 0 | |a Identification of Antioxidant Proteins With Deep Learning From Sequence Information |
260 | |b Frontiers Media S.A., |c 2018-09-01T00:00:00Z. | ||
500 | |a 1663-9812 | ||
500 | |a 10.3389/fphar.2018.01036 | ||
520 | |a Antioxidant proteins have been found closely linked to disease control for its ability to eliminate excess free radicals. Because of its medicinal value, the study of identifying antioxidant proteins is on the upsurge. Many machine-learning classifiers have performed poorly owing to the nonlinear and unbalanced nature of biological data. Recently, deep learning techniques showed advantages over many state-of-the-art machine learning methods in various fields. In this study, a deep learning based classifier was proposed to identify antioxidant proteins based on mixed g-gap dipeptide composition feature vector. The classifier employed deep autoencoder to extract nonlinear representation from raw input. The t-Distributed Stochastic Neighbor Embedding (t-SNE) was used for dimensionality reduction. Support vector machine was finally performed for classification. The classifier achieved F1 score of 0.8842 and MCC of 0.7409 in 10-fold cross validation. Experimental results show that our proposed method outperformed the traditional machine learning methods and could be a promising tool for antioxidant protein identification. For the convenience of others' scientific research, we have developed a user-friendly web server called IDAod for antioxidant protein identification, which can be accessed freely at http://bigroup.uestc.edu.cn/IDAod/. | ||
546 | |a EN | ||
690 | |a antioxidant proteins | ||
690 | |a deep learning | ||
690 | |a g-gap dipeptide | ||
690 | |a feature selection | ||
690 | |a webserver | ||
690 | |a Therapeutics. Pharmacology | ||
690 | |a RM1-950 | ||
655 | 7 | |a article |2 local | |
786 | 0 | |n Frontiers in Pharmacology, Vol 9 (2018) | |
787 | 0 | |n https://www.frontiersin.org/article/10.3389/fphar.2018.01036/full | |
787 | 0 | |n https://doaj.org/toc/1663-9812 | |
856 | 4 | 1 | |u https://doaj.org/article/8c5aaf98e91b419e98c2f3d30cfb3916 |z Connect to this object online. |