Skip to main content

Change your cookie settings

We cannot change your cookie settings at the moment because JavaScript is not running in your browser. To fix this, try:

  1. turning on JavaScript in your browser settings
  2. reloading this page
View cookies

Change your cookie settings

We cannot change your cookie settings at the moment because JavaScript is not running in your browser. To fix this, try:

  1. turning on JavaScript in your browser settings
  2. reloading this page
View cookies
  1. Home
  2. Search
  3. Biotope (macrofaunal assemblage) map and associated confidence layer based on grab and core data from 1976 to 2020

Biotope (macrofaunal assemblage) map and associated confidence layer based on grab and core data from 1976 to 2020

Summary

Two vector (.shp) files are provided. The first, (*macro_assemblages.shp*) shows the modelled (random forest) macrofaunal assemblage type based on a clustering of abundance data from the OneBenthic database (see `https://sway.office.com/HM5VkWvBoZ86atYP?ref=Link`_). The second file, *(macro_assemblages_confidence.shp)* shows associated confidence in the modelled output, with darker shades (high values) indicating higher confidence and lighter shades (lower values) indicating lower confidence. Both layers can be viewed in the OneBenthic Layers tool *( `https://rconnect.cefas.co.uk/onebenthic_layers/`_)*, together with further details of the methodology used to produce them. .. _`https://sway.office.com/hm5vkwvboz86atyp?ref=link`: https://sway.office.com/HM5VkWvBoZ86atYP?ref=Link .. _`https://rconnect.cefas.co.uk/onebenthic_layers/`: https://rconnect.cefas.co.uk/onebenthic_layers/

Categories

Keywords

N/A

Use limitation statement

There are no public access constraints to this data. Use of this data is subject to the licence identified.

Attribution statement

See Cefas Data Portal for details – link to dataset below.

Technical information

Update frequency

notPlanned

Lineage

The modelled layer for macrofaunal assemblage is based on a random forest modelling of point sample data from the OneBenthic *(OB, `https://rconnect.cefas.co.uk/onebenthic_dashboard`_/)*dataset, largely following the methodology in Cooper et al. (2019), but with an expanded dataset covering the Greater North Sea and including data from the EurOBI (`https://www.eurobis.org/`_) data repository. Of the 44,407 samples within OB, we selected a subset of 31,845 for which data were considered comparable (i.e. sample acquired using a 0.1 m2 grab or core, processed using a 1 mm sieve and not taken from a known impacted site). Colonial taxa were included and given a value of one. To take account of potential differences in taxonomic resolution between surveys, macrofaunal data were aggregated to family level using the taxonomic hierarchy provided by the World Register of Marine Species (`https://www.marinespecies.org/`_). This reduced the number of taxa from 3,659 to 750. To address spatial autocorrelation in the data, and in keeping with the previous approach, samples closer than 50 m were removed from the dataset, reducing the overall number to 18,348. A fourth-root transformation was then applied to the data to down weight the influence of highly abundant taxa. Data were then subjected to clustering using k-means. A species distribution modelling approach, based on random forest, was then used to model cluster group (i.e. macrofaunal assemblage or biotope) identity across the study area (Greater North Sea). Cross-validation via repeated sub-sampling was done to evaluate the robustness of the model estimate and predictions to data sub-setting and to extract additional information from the model outputs to produce maps of confidence in the predicted distribution, following the approach described in Mitchell et al. (2018). The cross-validation was done on 10 split sample data sets with 75% used to train and 25% to test models, randomly sampled within the levels of the response variable to maintain the class balance. The final model output was plotted as the cluster class with the majority vote of all 10 model runs. An associated confidence map was produced by multiplying map layers for 1) the frequency of the most common class and ii) the average probability of the most common class. Model outputs are used in the OneBenthic Layers Tool ( `https://rconnect.cefas.co.uk/onebenthic_layers/)`_. Cooper, K.M.; Bolam, S.G.; Downie, A.-L.; Barry, J. 2019. Biological-based habitat classification approaches promote cost-efficient monitoring: An example using seabed assemblages. J. Appl. Ecol. 56:1085–1098. `https://doi.org/10.1111/1365-2664.13381`_ Mitchell, P.J., Downie, A.-L., Diesing, M. How good is my map? 2018. A tool for semi-automated thematic mapping and spatially explicit confidence assessment. Env. Model. Softw. 108, 111–122. `https://doi.org/10.1016/j.envsoft.2018.07.014`_ .. _`https://rconnect.cefas.co.uk/onebenthic_dashboard`: https://rconnect.cefas.co.uk/onebenthic_dashboard .. _`https://www.eurobis.org/`: https://www.eurobis.org/ .. _`https://www.marinespecies.org/`: https://www.marinespecies.org/ .. _`https://rconnect.cefas.co.uk/onebenthic_layers/)`: https://rconnect.cefas.co.uk/onebenthic_layers/) .. _`https://doi.org/10.1111/1365-2664.13381`: https://doi.org/10.1111/1365-2664.13381 .. _`https://doi.org/10.1016/j.envsoft.2018.07.014`: https://doi.org/10.1016/j.envsoft.2018.07.014

Spatial information

Coordinate reference system

N/A

Geographic extent

  • Latitude from: 52.4581 to 52.4595
  • Longitude from: 1.73881 to 1.74086
Metadata information

Language

English

Metadata identifier

a4d03890-4759-46cb-984f-9e9d70ccf487


Published by

Centre for Environment, Fisheries & Aquaculture Science

Contact publisher

data.manager@cefas.gov.uk

Dataset reference dates

Creation date

11 February 2022

Revision date

26 May 2023

Publication date

11 February 2022

Period

  • From: 16 November 1976
  • To: 09 August 2020

Search

Data and Supporting Information
Data services and download by area of interestLinkAction
The Cefas Data Portal contains metadata records and data sets available to download and connect to in support of our commitment to open science. Data is available in the following formats: CSV, ESRI Shapefile. The data can also be accessed via the WFS and WMS protocols.Open link