SMHI IFCB Plankton Image Reference Library
https://doi.org/10.17044/SCILIFELAB.25883455
This repository includes manually annotated plankton images by phytoplankton experts at the Swedish Meteorological and Hydrological Institute (SMHI). The images were captured using an Imaging FlowCytobot (IFCB, McLane Research Laboratories (https://mclanelabs.com/imaging-flowcytobot) ) from different locations and seasons in the Skagerrak, Kattegat, and Baltic Proper. These images can be used for training automatic image classifiers to identify various plankton species.
From version 6 onward, the images have been consolidated into a single dataset, combining three previously separate sources: RV Svea (Baltic Proper, 2022–2026), RV Svea (Skagerrak–Kattegat, 2022–2026), and Tångesund (2016). Previous versions are still accessible in this repository.
The dataset consists of two ZIP archives. The first, annotated_images, contains .png images organized into class-specific subfolders, along with accompanying .tsv files that store image-level and class metadata. The second, matlab_files, includes raw data files (.roi, .hdr, .adc) as well as .mat files intended for developing a random forest image classifier using MATLAB code from the ifcb-analysis repository.
The images in this dataset undergo continuous quality control, and new images are regularly added. Consequently, this dataset will be updated on a regular basis. If you find any mislabeled images, please contact the authors.
Version history
- Version 6 (2026-03-31): 86,232 annotated images. The three datasets in the previous versions has been merged into a single dataset.
- Version 5 (2025-12-19): 82,123 annotated images.
- Version 4 (2024-11-04): 76,032 annotated images. Corrected class names to better match WoRMS, and continued quality control of images in the Tångesund dataset.
- Version 3 (2024-08-05): 72,086 annotated images. Added iRfcb dataset for user and unit testing.
- Version 2 (2024-06-03): 71,525 annotated images. Updated class names and corrected manual files in the Tångesund dataset. Continued quality control of images in the Tångesund dataset.
- Version 1 (2024-05-31): 65,435 annotated images
Gå till källa för data
https://doi.org/10.17044/SCILIFELAB.25883455
Citering och åtkomst
Citering och åtkomst
Skapare/primärforskare:
- Ann-Turi Skjevik
- Malin Mohlin
- Maria Karlberg
Forskningshuvudman:
Citering:
Administrativ information
Administrativ information
Finansiering
Finansiering
Finansiär:
- Swedish Research Council
Öppnar nytt fönster hos ror.org.
ROR
Referensnummer:
2019-00242_VR
Projektnamn på ansökan:
Swedish Biodiversity Data Infrastructure
Ämnesområde och nyckelord
Ämnesområde och nyckelord
Standard för svensk indelning av forskningsämnen 2025:
Nyckelord:
- Marine and estuarine ecology (incl. marine ichthyology)
- Phycology (incl. marine grasses)
Relationer
Relationer
Har metadata:
Är del av:
Metadata
Metadata

SMHI - Sveriges meteorologiska och hydrologiska institut