4.5 Article

Probabilistic hazard assessment for skin sensitization potency by dose-response modeling using feature elimination instead of quantitative structure-activity relationships

Journal

JOURNAL OF APPLIED TOXICOLOGY
Volume 35, Issue 11, Pages 1361-1371

Publisher

WILEY
DOI: 10.1002/jat.3172

Keywords

LLNA; in vitro; skin sensitization; integrated testing strategy; machine learning; hidden Markov model; QSAR; feature selection

Categories

Funding

  1. Conselho Nacional de Desenvolvimento Cientifico e Tecnologico - CNPq [238194/2012-4]
  2. NIEHS [PA-08-190, T32 ES007141]

Ask authors/readers for more resources

Supervised learning methods promise to improve integrated testing strategies (ITS), but must be adjusted to handle high dimensionality and dose-response data. ITS approaches are currently fueled by the increasing mechanistic understanding of adverse outcome pathways (AOP) and the development of tests reflecting these mechanisms. Simple approaches to combine skin sensitization data sets, such as weight of evidence, fail due to problems in information redundancy and high dimensionality. The problem is further amplified when potency information (dose/response) of hazards would be estimated. Skin sensitization currently serves as the foster child for AOP and ITS development, as legislative pressures combined with a very good mechanistic understanding of contact dermatitis have led to test development and relatively large high-quality data sets. We curated such a data set and combined a recursive variable selection algorithm to evaluate the information available through in silico, in chemico and in vitro assays. Chemical similarity alone could not cluster chemicals' potency, and in vitro models consistently ranked high in recursive feature elimination. This allows reducing the number of tests included in an ITS. Next, we analyzed with a hidden Markov model that takes advantage of an intrinsic inter-relationship among the local lymph node assay classes, i.e. the monotonous connection between local lymph node assay and dose. The dose-informed random forest/hidden Markov model was superior to the dose-naive random forest model on all data sets. Although balanced accuracy improvement may seem small, this obscures the actual improvement in misclassifications as the dose-informed hidden Markov model strongly reduced false-negatives (i.e. extreme sensitizers as non-sensitizer) on all data sets. Copyright (c) 2015 John Wiley & Sons, Ltd. Supervised learning methods promise to improve integrated testing strategies. We curated a skin sensitization data set and evaluated the information from in silico, in chemico and in vitro assays by recursive variable selection. Cell assays consistently ranked high in the feature elimination. A hidden Markov model taking advantage of monotonicity between local lymph node assay and dose is shown to outperform dose-naive models on all data sets and strongly reduce extreme sensitizers classified as non-sensitizers.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available