4.6 Article

A guide to pre-processing high-throughput animal tracking data

期刊

JOURNAL OF ANIMAL ECOLOGY
卷 91, 期 2, 页码 287-307

出版社

WILEY
DOI: 10.1111/1365-2656.13610

关键词

ATLAS tracking; atlastools; big data; biotelemetry; data cleaning; high-throughput movement ecology; residence patch; reverse GPS

资金

  1. Minerva Foundation
  2. Israel Science Foundation [ISF ISF--965/15]
  3. Dutch Research Council [VI.Veni.192.051]

向作者/读者索取更多资源

Cleaning modern, high-throughput animal tracking data is crucial to reduce location errors and improve data quality for subsequent analyses. Developing automated pre-processing pipelines that balance ease of use with computational efficiency is essential for handling large datasets and enhancing reproducibility in movement ecology research. The use of standardized methods and tools like the atlastools R package can lead to better inferences and exploration of animal space use patterns.
1. Modern, high-throughput animal tracking increasingly yields 'big data' at very fine temporal scales. At these scales, location error can exceed the animal's step size, leading to misestimation of behaviours inferred from movement. 'Cleaning' the data to reduce location errors is one of the main ways to deal with position uncertainty. Although data cleaning is widely recommended, inclusive, uniform guidance on this crucial step, and on how to organise the cleaning of massive datasets, is relatively scarce. 2. A pipeline for cleaning massive high-throughput datasets must balance ease of use and computationally efficiency, in which location errors are rejected while preserving valid animal movements. Another useful feature of a pre-processing pipeline is efficiently segmenting and clustering location data for statistical methods while also being scalable to large datasets and robust to imperfect sampling. Manual methods being prohibitively time-consuming, and to boost reproducibility, pre-processing pipelines must be automated. 3. We provide guidance on building pipelines for pre-processing high-throughput animal tracking data to prepare it for subsequent analyses. We apply our proposed pipeline to simulated movement data with location errors, and also show how large volumes of cleaned data can be transformed into biologically meaningful 'residence patches', for exploratory inference on animal space use. We use tracking data from the Wadden Sea ATLAS system (WATLAS) to show how preprocessing improves its quality, and to verify the usefulness of the residence patch method. Finally, with tracks from Egyptian fruit bats Rousettus aegyptiacus, we demonstrate the pre-processing pipeline and residence patch method in a fully worked out example. 4. To help with fast implementation of standardised methods, we developed the R package atlastools, which we also introduce here. Our pre-processing pipeline and atlastools can be used with any high-throughput animal movement data in which the high data-volume combined with knowledge of the tracked individuals' movement capacity can be used to reduce location errors. atlastools is easy to use for beginners while providing a template for further development. The common use of simple yet robust pre-processing steps promotes standardised methods in the field of movement ecology and leads to better inferences from data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据