Abstract
RDBqc is an R package designed for quality control of data submitted to the Regional Database Fisheries Information System (RDBFIS). It provides a structured and automated approach to validate, check, and assess data consistency before submission, ensuring compliance with predefined data quality standards.
The updated version of the package is available on the GitHub repository (https://github.com/COISPA/RDBqc)
RDBqc allows to carry out a set of a priori quality checks on detailed sampling data and on aggregated landing data, and a posteriori quality check on MEDBS, FDI and GFCM data call formats.
The supported quality checks are:
A priori quality checks
The supported quality checks are:
A priori quality checks
- RCG CS - biological sampling data
- RCG CL - aggregated landing data
A posteriori quality checks
The dataset structure and formats for MED&BS data call are based on the guidelines outlined in the official Data Call for the STECF Expert Working Groups on Mediterranean and Black Sea in 2024, published by the Joint Research Centre (JRC) of the European Commission. The document, available at this link, provides a comprehensive framework for reporting fisheries-dependent and biological data in the Mediterranean and Black Sea under the EU Data Collection Framework (DCF).
The dataset includes the following structured tables:
- MEDBS - catch dataMEDBS - catch data
- MEDBS - discard data
- MEDBS - landing data
- MEDBS - GP table
- MEDBS - LW table
- MEDBS - MA table
- MEDBS - ML table
- MEDBS - SA table
- MEDBS - SL table
The FDI dataset structure and formats are based on the guidelines outlined in the official “FDI Data Call Annex 2024”, published by the Joint Research Centre (JRC) of the European Commission. The document provides a comprehensive framework for reporting fisheries-dependent data under the EU Data Collection Framework (DCF).
The dataset adheres to the standardized formats specified for tables such as:
- Table A
- Table G
- Tables H
- Tables I
- Table J
For the GFCM data call, the following tables, as further described in the dedicated manual (GFCM Data Call Manual), are included and verified:
- GFCM - Task II.2 table
- GFCM - Task III table
- GFCM - Task VII.2 table
- GFCM - Task VII.3.1 table
- GFCM - Task VII.3.2 table
Cross-checks among data calls
Since RBDqc version 0.0.15, the package also includes cross-check functions to perform checks between data call tables. In particular, the following cross-check functions are available:
- Landing consistency among the different data calls: MED & BS, FDI and Annual Economic Report (AER)
- Discard consistency between MED & BS and FDI data calls
- Landing consistency between FDI and AER data calls
- Landing value consistency between FDI and AER data calls
- Number of trips consistency between MED & BS and Annual report (AR) data calls
- Number of ages consistency between MED & BS and AR data calls
- Number of weights consistency between MED & BS and AR data calls
- Number of maturity data consistency between MED & BS and AR data calls