4.5 Article

Allergenicity prediction by artificial neural networks

Journal

JOURNAL OF CHEMOMETRICS
Volume 28, Issue 4, Pages 282-286

Publisher

WILEY
DOI: 10.1002/cem.2597

Keywords

amino acid descriptors; ACC transformation; descriptor fingerprint; artificial neural networks

Funding

  1. Bulgarian Science Fund [DCVNP 02-1/2009, IO1/7]

Ask authors/readers for more resources

Two artificial neural network (ANN)-based algorithms for allergenicity prediction were developed and tested. The first algorithm consists of three steps. Initially, the protein sequences are described by amino acid principal properties as hydrophobicity, size, relative abundance, helix and -strand forming propensities. Second, the generated strings of different length are converted into vectors with equal length by auto-covariance and cross-covariance (ACC). At the third step, ANN is applied to discriminate between allergens and non-allergens. The second algorithm consists of four steps. It has one additional step before the final ANN modeling. At this step, the ACC vectors are transformed into binary fingerprints. The algorithms were applied to a set of 2427 known allergens and 2427 non-allergens and compared in terms of predictive ability. The three-step algorithm performed better than the four-step one identifying 82% versus 76% of the allergens and non-allergens. The ANN algorithms presented here are universal. They could be applied for any classification problem in computational biology. The amino acid descriptors are able to capture the main structural and physicochemical properties of amino acids building the proteins. The ACC transformation overcomes the main problem in the alignment-based comparative studies arising from the different length of the aligned protein sequences. The uniform-length vectors allow similarity search and classification by different computational methods. Optionally, the ACC vectors could be converted into binary descriptor fingerprints. The comparative study on several Web tools for allergenicity prediction showed that the usage of more than one predictor is reasonable and recommendable because some of the tools recognize better the allergens, some of themthe non-allergens, but none of themboth. Copyright (c) 2014 John Wiley & Sons, Ltd. Two artificial neural network (ANN)-based algorithms for allergenicity prediction were developed and tested. The first algorithm consists of three steps, and the second four steps. The algorithms were applied to a set of 2427 known allergens and 2427 non-allergens and compared in terms of predictive ability. The three-step algorithm identified 82% of the allergens and non-allergens, and the four-step algorithm 76% of them. The ANN algorithms presented here are universal. They could be applied for any classification problem in computational biology.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available