Identification

Title

Larval and juvenile fish abundance, habitat, water quality, flow and climate data from English rivers, 1984-2017

Abstract

[This dataset is embargoed until August 31, 2025]. This dataset contains monthly/annual time series of species-specific abundances and covariates for 137 targeted larval/juvenile fish surveys at sites in a range of English lowland rivers. Larval/juvenile fish data come from two different sources: The Environment Agency’s (EA) National Fish Population Database (NFPD) between 1974 and 2017 and a dataset created by the University of Hull (UoH) between 1984 and 2017 inclusive. Biological data consists of density estimates of each fish species from each survey (per metre squared) and also the average length of specified species at the end of their first year of growth. Covariate data include habitat quality indicator (River Habitat Survey), climatic variables (Gulf Stream and North Atlantic Oscillation indices), land-use change, river hydrology, water temperature, effluent dilution factor and concentrations of chemical determinands. This dataset was created as larvae and juveniles may be particularly useful indicators and respond differently than older fish to environmental stressors. The work was supported by the Natural Environment Research Council (Grant NE/S000100/2). Full details about this dataset can be found at https://doi.org/10.5285/c904c5f0-7f21-4759-a7d4-c262da230b53

Resource type

dataset

Resource locator

Unique resource identifier

code

https://catalogue.ceh.ac.uk/id/c904c5f0-7f21-4759-a7d4-c262da230b53

codeSpace

doi:

Dataset language

eng

Spatial reference system

code identifying the spatial reference system

Classification of spatial data and services

Topic category

biota

Keywords

Keyword set

keyword value

Environmental Monitoring Facilities

originating controlled vocabulary

title

GEMET - INSPIRE themes, version 1.0

reference date

date type

publication

effective date

2008-06-01

Geographic location

West bounding longitude

-6.452

East bounding longitude

1.768

North bounding latitude

55.812

South bounding latitude

49.864

Temporal reference

Temporal extent

Begin position

1984-01-01

End position

2017-12-31

Dataset reference date

date type

publication

effective date

2024-06-18

Frequency of update

Quality and validity

Lineage

Fish Data Juvenile fish density data was collected from two sources: the EA NFPD "Freshwater Fish Counts for all species for all Areas and all Years" dataset (1974-2017) and the UoH fish count dataset (1984-2017). Fish counts were converted into densities (numbers per m-2) by dividing species counts by the sampling area (converted NFPD units from 100 m-2 to m-2 to match UoH data). To determine the mean length of juvenile fish (chub, roach, common bream, dace, bleak, gudgeon, minnow, and three-spined sticklebacks) at the end of the growing season (last surveys between August and October), length frequency histograms were created for each species and survey using R 4.2.1. Only individuals in their first year were considered in the calculation, providing the mean length of first-year fish at the end of the growing season for each survey. Species in the NFPD dataset were assigned Chempop Species Codes adapted from EA species codes, and when multiple EA species codes matched one Chempop Species Code, their densities were summed for each survey. Data without a Chempop species code (e.g., hybrids or broad categories like "all species") were removed, while surveys with zero fish captures were retained in the dataset. Habitat Quality indicators Habitat scores were obtained from the Environment Agency's River Habitat Survey (RHS) dataset, with ArcGIS used to link fish sites to their nearest RHS site. Two key scores were extracted: the Habitat Modification Score (HMS), which reflects the degree of artificial channel modification, and the Habitat Quality Assessment (HQA), which provides a broad assessment of habitat quality based on the diversity and abundance of natural features in the river channel and riparian zone. The HQA is suitable for comparing similar river types. To enable comparisons with data from 1994, new rules for flow-types and channel vegetation were established and applied across all sites, resulting in the HQA adjusted for 1994 data. The most recent RHS survey data was extracted for this analysis. Land Use Land use in the catchment upstream of fish sites (NFPD) were calculated from the UKCEH Land Cover Map 2015 (25m raster, GB). The 21 land cover types were grouped to cover four broad categories of: Woodland, Arable, Seminatural and Urban. The catchment tool in DataLabs was used to extract and process the data, which were summed into percentage and areas of each land use group. The sum of the percentages of the grouped categories were checked and were found to fall in the range of 99.96-100.04 %. Water Temperature To assess its influence on fish growth, the cumulative annual degree-days ≥12 degC, indicating the total temperature sum for days exceeding 12 degC in the year prior to the survey date, was calculated. Water temperature data was sourced from the EA Surface Water Temperature Archive for sites with at least 3 years of records, and daily mean air temperature was collected from the Climate Hydrology and Ecology Research Support System (CHESS). The Baseflow Index for each water temperature gauging station and fish site was determined using the UKCEH National River Flow Archive. Generalized additive mix modeling (GAMM) was chosen to model water temperature, accounting for seasonal and year-to-year variation in water temperature, along with the nonlinear relationship between air and water temperature. Seven GAMM models, each covering a 0.1 BFI range, were developed, with the best model (Model: month, time, and air temperature, using ARMA structures) identified based on evaluation metrics like RMSE, NRMSE, and SI. This model was employed to estimate water temperature for fish sites. River Flow River flow data was sourced from the National River Flow Archive (NRFA) hosted by UKCEH and matched with fish sites based on the shortest distance along the river network. Data was extracted for the 12 months leading up to each fish survey, and various statistics were computed, including mean, median, standard deviation, coefficient of variation, Q5/mean, and Q95/mean. Additionally, threshold statistics were calculated for different percentiles (5, 25, 75, and 95), encompassing the threshold discharge value (estimated discharge quantile), days exceeded, number of exceedance events, and average event length. Water Quality-Effluent Dilution factor To gauge river pollution levels across England, the study used the LowFlows2000-WQX (lowFlows2000 Water Quality eXtension) model, which characterizes wastewater discharge by wastewater treatment works (WWTWs) in terms of location, domestic population served, dry weather flow (DWF), and treatment type. It's important to note that only significant WWTWs were considered for computational purposes, based on specific criteria: those in catchments contributing to 95% of each hydrometric area and those accounting for 95% of the total discharged DWF to the estuary, ranked by DWF. The model produced predicted wastewater percentages for all reaches in the modelled river network, which is a truncated version of the 1:50,000 digital river network derived from the UK Ordnance Survey Panorama Data Set. To estimate the percentage of wastewater at each fish site, a python script was used to link each site to the nearest reach in the river network. The results are reported in terms of mean, standard deviation, 90th percentile, and 95th percentile, denoted as EDF_Mn, EDF_SD, EDF_Q90, and EDF_Q95, respectively. Chemical Determinands Chemical concentration data spanning 1960-2017, involving 41 determinands, was extracted from the UK's Environment Agency's Water Quality Data Archive. Fish sites were matched to the nearest chemical determinand site using specific criteria like site distance, sample count, and data collection timeframe through ArcGIS. To identify and exclude erroneous data, a threshold of 10 mg/l was set for some determinands, with values exceeding this threshold omitted from statistical calculations. Handling values below the Limit of Quantification (LoQ) involved three options: setting values to LoQ (LoQ1), setting them to 0 (LoQ2), or setting them to LoQ/2 (LoQ3). For the 12 months preceding the surveys at each fish site, various statistics were computed, including minimum, maximum, median, mean, standard deviation, total sample count, and counts of samples below and above the LoQ. Some sites had to be excluded from the final dataset due to licensing restrictions. Altitude Elevation values for each fish site were extracted from the Integrated Hydrological Digital Terrain Model (IHDTM) using Arc GIS' Extract Multi Values to Points' tool. Two elevation values were calculated: Elevation values corresponding to the value of the IHDTM cell centre. Elevation values corresponding to an average value calculated from the adjacent cells with valid values using bilinear interpolation; NoData values were ignored in the interpolation unless all adjacent cells were NoData. Climate variables The Gulf Stream Index (GSI) data spanning 1975-2017 was obtained from the Plymouth Marine Laboratory website (http://www.pml-gulfstream.org.uk/data.htm). Annual mean data were employed due to substantial month-to-month variability resulting from Gulf Stream meandering (Plymouth Marine Laboratory, 2019). Positive GSI values signify a northward shift from the long-term mean location, while negative values indicate a southward movement. Data for the North Atlantic Oscillation Index (NAOI) spanning 1975-2017 was obtained from the National Centre for Atmospheric Research website (https://www.ncei.noaa.gov/access/monitoring/nao). The NAOI's winter (December-March) station-based index relies on the normalized sea level pressure (SLP) difference between Lisbon, Portugal, and Reykjavik, Iceland, dating back to 1864. Positive NAOI values indicate stronger westerlies over mid-latitudes, more intense North Atlantic weather systems, and wetter/milder conditions in Western Europe.

Conformity

Conformity report

specification

title

Commission Regulation (EU) No 1089/2010 of 23 November 2010 implementing Directive 2007/2/EC of the European Parliament and of the Council as regards interoperability of spatial data sets and services

reference date

date type

publication

effective date

2010-12-08

degree

explanation

Data format

name of format

Comma-separated values (CSV)

version of format

Constraints related to access and use

Constraint set

Limitations on public access

Embargoed

Constraint set

Limitations on public access

Constraint set

Limitations on public access

If you reuse this data, you should cite: Ainsworth, R.F., Keller, V., Bachiller-Jareno, N., Jürgens, M.D., Eastman, M., Sadykova, D, Rizzo, C., Scarlett, P., Peirson, G. , Eley, F., Antoniou, V., Cowx, I.G., Johnson, A.C., Nunn, A.D. (2024). Larval and juvenile fish abundance, habitat, water quality, flow and climate data from English rivers, 1984-2017. NERC EDS Environmental Information Data Centre https://doi.org/10.5285/c904c5f0-7f21-4759-a7d4-c262da230b53

Responsible organisations

Responsible party

organisation name

University of Hull

email address

r.ainsworth@hull.ac.uk

web address

https://orcid.org/0000-0001-6669-0356

name: ORCID record

description: ORCID is an open, non-profit, community-driven effort to create and maintain a registry of unique researcher identifiers and a transparent method of linking research activities and outputs to these identifiers.

function: information

responsible party role

author

Responsible party

organisation name

email address

enquiries@ceh.ac.uk

web address

https://orcid.org/0000-0003-4489-5363

name: ORCID record

description: ORCID is an open, non-profit, community-driven effort to create and maintain a registry of unique researcher identifiers and a transparent method of linking research activities and outputs to these identifiers.

function: information

responsible party role

author

Responsible party

organisation name

University of Portsmouth

email address

nuria.bachiller-jareno@port.ac.uk

web address

https://orcid.org/0000-0001-9732-6725

name: ORCID record

description: ORCID is an open, non-profit, community-driven effort to create and maintain a registry of unique researcher identifiers and a transparent method of linking research activities and outputs to these identifiers.

function: information

responsible party role

author

Responsible party

organisation name

email address

enquiries@ceh.ac.uk

web address

https://orcid.org/0000-0002-6526-589X

name: ORCID record

description: ORCID is an open, non-profit, community-driven effort to create and maintain a registry of unique researcher identifiers and a transparent method of linking research activities and outputs to these identifiers.

function: information

responsible party role

author

Responsible party

organisation name

Met Office

email address

Michael.eastman@metoffice.gov.uk

web address

https://orcid.org/0000-0002-8212-5872

name: ORCID record

description: ORCID is an open, non-profit, community-driven effort to create and maintain a registry of unique researcher identifiers and a transparent method of linking research activities and outputs to these identifiers.

function: information

responsible party role

author

Responsible party

organisation name

email address

enquiries@ceh.ac.uk

web address

https://orcid.org/0000-0002-3930-5974

name: ORCID record

description: ORCID is an open, non-profit, community-driven effort to create and maintain a registry of unique researcher identifiers and a transparent method of linking research activities and outputs to these identifiers.

function: information

responsible party role

author

Responsible party

organisation name

Wallingford HydroSolutions

email address

clarissa.rizzo@hydrosolutions.co.uk

responsible party role

author

Responsible party

organisation name

email address

enquiries@ceh.ac.uk

web address

https://orcid.org/0000-0001-6242-8158

name: ORCID record

description: ORCID is an open, non-profit, community-driven effort to create and maintain a registry of unique researcher identifiers and a transparent method of linking research activities and outputs to these identifiers.

function: information

responsible party role

author

Responsible party

organisation name

UK Environment Agency

email address

graeme.peirson@environment-agency.gov.uk

responsible party role

author

Responsible party

organisation name

UK Environment Agency

email address

frances.eley@environment-agency.gov.uk

responsible party role

author

Responsible party

organisation name

Agricultural University of Athens

email address

vantoniou@aua.gr

web address

https://orcid.org/0000-0001-6011-4075

name: ORCID record

description: ORCID is an open, non-profit, community-driven effort to create and maintain a registry of unique researcher identifiers and a transparent method of linking research activities and outputs to these identifiers.

function: information

responsible party role

author

Responsible party

organisation name

University of Hull

email address

i.g.cowx@hull.ac.uk

responsible party role

author

Responsible party

organisation name

email address

enquiries@ceh.ac.uk

web address

https://orcid.org/0000-0003-1570-3764

name: ORCID record

description: ORCID is an open, non-profit, community-driven effort to create and maintain a registry of unique researcher identifiers and a transparent method of linking research activities and outputs to these identifiers.

function: information

responsible party role

author

Responsible party

organisation name

University of Hull

email address

a.d.nunn@hull.ac.uk

web address

https://orcid.org/0000-0001-8370-1221

name: ORCID record

description: ORCID is an open, non-profit, community-driven effort to create and maintain a registry of unique researcher identifiers and a transparent method of linking research activities and outputs to these identifiers.

function: information

responsible party role

author

Responsible party

organisation name

email address

info@eidc.ac.uk

responsible party role

custodian

Responsible party

organisation name

University of Hull

email address

r.ainsworth@hull.ac.uk

web address

https://orcid.org/0000-0001-6669-0356

name: ORCID record

description: ORCID is an open, non-profit, community-driven effort to create and maintain a registry of unique researcher identifiers and a transparent method of linking research activities and outputs to these identifiers.

function: information

responsible party role

pointOfContact

Responsible party

organisation name

email address

info@eidc.ac.uk

responsible party role

publisher

Responsible party

organisation name

The University of Hull

email address

EMAIL NOT PROVIDED

responsible party role

owner

Metadata on metadata

Metadata point of contact

organisation name

full postal address

Lancaster Environment Centre, Library Avenue, Bailrigg

Lancaster

LA1 4AP

UK

email address

info@eidc.ac.uk

web address

https://eidc.ac.uk/

name: EIDC website

description: The Environmental Information Data Centre (EIDC) is the UK's national data centre for terrestrial and freshwater sciences.

function: information

responsible party role

pointOfContact

Metadata date

2024-09-30T11:40:21

Metadata language

eng