Project 3598: A. Porto, K. L. Voje. 2020. ML-morph: A fast, accurate and general approach for automated detection and landmarking of biological structures in images. Methods in Ecology and Evolution. (In Press)
Specimen: Pseudanthias (unvouchered)
View: Lateral

Abstract

1. Morphometrics has become an indispensable component of the statistical analysis of size and shape variation in biological structures. Morphometric data has traditionally been gathered through low-throughput manual landmark annotation, which represents a significant bottleneck for morphometric-based phenomics. Here we propose a machine-learning-based high-throughput pipeline to collect high-dimensional morphometric data in two dimensional images of semi rigid biological structures.

2. The proposed framework has four main strengths. First, it allows for dense phenotyping with minimal impact on specimens. Second, it presents landmarking accuracy comparable to manual annotators, when applied to standardized datasets. Third, it performs data collection at speeds several orders of magnitude higher than manual annotators. And finally, it is of general applicability (i.e., not tied to a specific study system).

3. State-of-the-art validation procedures show that the method achieves low error levels when applied to three morphometric datasets of increasing complexity, with error varying from 0.57% to 2.2% of the structure’s length in the automated placement of landmarks. As a benchmark for the speed of the entire automated landmarking pipeline, our framework places 23 landmarks on 13,686 objects (zooids) detected in 1684 pictures of fossil bryozoans in 3.12 minutes using a personal computer.

4. The proposed machine-learning-based phenotyping pipeline can greatly increase the scale, reproducibility and speed of data collection within biological research. To aid the use of the framework, we have developed a file conversion algorithm that can be used to leverage current morphometric datasets for automation, allowing the entire procedure, from model training all the way to prediction, to be performed in a matter of hours.



Read the article »

Project DOI: 10.7934/P3598, http://dx.doi.org/10.7934/P3598
This project contains
  • 861 Media
  • 3 Taxa
  • 3 Specimens
Total size of project's media files: 289.01M

Download Project SDD File
Currently Viewing:
MorphoBank Project 3598
  • Creation Date:
    12 December 2019
  • Publication Date:
    03 February 2020
  • Media downloads: 25

    Authors' Institutions

    • University of Oslo



    Members

    member name taxa specimens media
    Arthur Porto
    Project Administrator
    33861


    Project has no matrices defined.



    Project downloads

    type number of downloads Individual items downloaded (where applicable)
    Total downloads from project171
    Project downloads146
    Media downloads25M684288 (1 download); M684289 (1 download); M684290 (3 downloads); M684292 (1 download); M684293 (2 downloads); M684294 (1 download); M684295 (1 download); M684296 (1 download); M684298 (1 download); M684299 (1 download); M684303 (1 download); M684481 (1 download); M684281 (2 downloads); M684280 (1 download); M684284 (1 download); M684285 (1 download); M684301 (1 download); M684306 (1 download); M684314 (1 download); M684312 (1 download); M684320 (1 download);