4.6 Article

Machine learning can differentiate venom toxins from other proteins having non-toxic physiological functions

Journal

PEERJ COMPUTER SCIENCE
Volume -, Issue -, Pages -

Publisher

PEERJ INC
DOI: 10.7717/peerj-cs.90

Keywords

Protein sequences; Biological function; Animal venom; Automatic annotation; Functional prediction

Funding

  1. United Kingdom Medical Research Council (MRC) [G82144A]
  2. Universidade de Sao Paulo (USP) [13.1.1502.9.8]

Ask authors/readers for more resources

Ascribing function to sequence in the absence of biological data is an ongoing challenge in bioinformatics. Differentiating the toxins of venomous animals from homologues having other physiological functions is particularly problematic as there are no universally accepted methods by which to attribute toxin function using sequence data alone. Bioinformatics tools that do exist are difficult to implement for researchers with little bioinformatics training. Here we announce a machine learning tool called 'ToxClassifier' that enables simple and consistent discrimination of toxins from non-toxin sequences with >99% accuracy and compare it to commonly used toxin annotation methods. 'ToxClassifer' also reports the best-hit annotation allowing placement of a toxin into the most appropriate toxin protein family, or relates it to a non-toxic protein having the closest homology, giving enhanced curation of existing biological databases and new venomics projects. 'ToxClassifier' is available for free, either to download (https://github.com/rgacesa/ToxClassifier) or to use on a web-based server (http://bioserv7.bioinfo.pbf.hr/ToxClassifier/).

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available