An annotated high-content fluorescence microscopy dataset with EGFP-Galectin-3-stained cells and manually labelled outlineszenodo

Many forms of bioimage analysis involve the detection of objects and their outlines. In the context of microscopy-based high-throughput drug and genomic screening and even in smaller scale microscopy experiments, the objects that most often need to be detected are cells. In order to develop and benc...

Full description

Saved in:
Bibliographic Details
Main Authors: Salma Kazemi Rashed, Malou Arvidsson, Rafsan Ahmed, Sonja Aits
Format: Article
Language:English
Published: Elsevier 2025-02-01
Series:Data in Brief
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352340924011107
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832576494242627584
author Salma Kazemi Rashed
Malou Arvidsson
Rafsan Ahmed
Sonja Aits
author_facet Salma Kazemi Rashed
Malou Arvidsson
Rafsan Ahmed
Sonja Aits
author_sort Salma Kazemi Rashed
collection DOAJ
description Many forms of bioimage analysis involve the detection of objects and their outlines. In the context of microscopy-based high-throughput drug and genomic screening and even in smaller scale microscopy experiments, the objects that most often need to be detected are cells. In order to develop and benchmark algorithms and neural networks that can perform this task, high-quality datasets with annotated cell outlines are needed.We have created a dataset, named Aitslab_bioimaging2, consisting of 60 fluorescence microscopy images with EGFP-Galectin-3 labelled cells and their hand-labelled outlines. Images were acquired on a Thermo Fischer CX7 high-content imaging system at 20x magnification created as part of an RNA interference screen with a modified U2OS osteosarcoma cell line. Outlines were labelled by three annotators, who had high inter-annotator agreement between them and with a biomedical expert, who labelled some of the objects for comparison and reviewed a subset of the labels, making minor corrections as needed.The dataset comprises over 2200 annotated cell objects in total, making it sufficient in size to train high-performing neural networks for instance or semantic segmentation. Labels can also easily be converted to boxes for object detection tasks. The dataset is already pre-divided into training, development, and test sets. Matching nuclear staining and outlines are available for part of the dataset from a previous publication (dataset Aitslab_bioimaging1) [1].
format Article
id doaj-art-976fb3b831c843aeb04efae8d6e4df66
institution Kabale University
issn 2352-3409
language English
publishDate 2025-02-01
publisher Elsevier
record_format Article
series Data in Brief
spelling doaj-art-976fb3b831c843aeb04efae8d6e4df662025-01-31T05:11:25ZengElsevierData in Brief2352-34092025-02-0158111148An annotated high-content fluorescence microscopy dataset with EGFP-Galectin-3-stained cells and manually labelled outlineszenodoSalma Kazemi Rashed0Malou Arvidsson1Rafsan Ahmed2Sonja Aits3Cell Death, Lysosomes and Artificial Intelligence Group, Department of Experimental Medical Science, Faculty of Medicine, Lund University, BMC D10, 22184 Lund, SwedenCell Death, Lysosomes and Artificial Intelligence Group, Department of Experimental Medical Science, Faculty of Medicine, Lund University, BMC D10, 22184 Lund, SwedenCell Death, Lysosomes and Artificial Intelligence Group, Department of Experimental Medical Science, Faculty of Medicine, Lund University, BMC D10, 22184 Lund, SwedenCorresponding author.; Cell Death, Lysosomes and Artificial Intelligence Group, Department of Experimental Medical Science, Faculty of Medicine, Lund University, BMC D10, 22184 Lund, SwedenMany forms of bioimage analysis involve the detection of objects and their outlines. In the context of microscopy-based high-throughput drug and genomic screening and even in smaller scale microscopy experiments, the objects that most often need to be detected are cells. In order to develop and benchmark algorithms and neural networks that can perform this task, high-quality datasets with annotated cell outlines are needed.We have created a dataset, named Aitslab_bioimaging2, consisting of 60 fluorescence microscopy images with EGFP-Galectin-3 labelled cells and their hand-labelled outlines. Images were acquired on a Thermo Fischer CX7 high-content imaging system at 20x magnification created as part of an RNA interference screen with a modified U2OS osteosarcoma cell line. Outlines were labelled by three annotators, who had high inter-annotator agreement between them and with a biomedical expert, who labelled some of the objects for comparison and reviewed a subset of the labels, making minor corrections as needed.The dataset comprises over 2200 annotated cell objects in total, making it sufficient in size to train high-performing neural networks for instance or semantic segmentation. Labels can also easily be converted to boxes for object detection tasks. The dataset is already pre-divided into training, development, and test sets. Matching nuclear staining and outlines are available for part of the dataset from a previous publication (dataset Aitslab_bioimaging1) [1].http://www.sciencedirect.com/science/article/pii/S2352340924011107Instance segmentationFluorescence microscopyBiomedical image analysisHigh-content screeningComputer visionDeep learning training and evaluation
spellingShingle Salma Kazemi Rashed
Malou Arvidsson
Rafsan Ahmed
Sonja Aits
An annotated high-content fluorescence microscopy dataset with EGFP-Galectin-3-stained cells and manually labelled outlineszenodo
Data in Brief
Instance segmentation
Fluorescence microscopy
Biomedical image analysis
High-content screening
Computer vision
Deep learning training and evaluation
title An annotated high-content fluorescence microscopy dataset with EGFP-Galectin-3-stained cells and manually labelled outlineszenodo
title_full An annotated high-content fluorescence microscopy dataset with EGFP-Galectin-3-stained cells and manually labelled outlineszenodo
title_fullStr An annotated high-content fluorescence microscopy dataset with EGFP-Galectin-3-stained cells and manually labelled outlineszenodo
title_full_unstemmed An annotated high-content fluorescence microscopy dataset with EGFP-Galectin-3-stained cells and manually labelled outlineszenodo
title_short An annotated high-content fluorescence microscopy dataset with EGFP-Galectin-3-stained cells and manually labelled outlineszenodo
title_sort annotated high content fluorescence microscopy dataset with egfp galectin 3 stained cells and manually labelled outlineszenodo
topic Instance segmentation
Fluorescence microscopy
Biomedical image analysis
High-content screening
Computer vision
Deep learning training and evaluation
url http://www.sciencedirect.com/science/article/pii/S2352340924011107
work_keys_str_mv AT salmakazemirashed anannotatedhighcontentfluorescencemicroscopydatasetwithegfpgalectin3stainedcellsandmanuallylabelledoutlineszenodo
AT malouarvidsson anannotatedhighcontentfluorescencemicroscopydatasetwithegfpgalectin3stainedcellsandmanuallylabelledoutlineszenodo
AT rafsanahmed anannotatedhighcontentfluorescencemicroscopydatasetwithegfpgalectin3stainedcellsandmanuallylabelledoutlineszenodo
AT sonjaaits anannotatedhighcontentfluorescencemicroscopydatasetwithegfpgalectin3stainedcellsandmanuallylabelledoutlineszenodo
AT salmakazemirashed annotatedhighcontentfluorescencemicroscopydatasetwithegfpgalectin3stainedcellsandmanuallylabelledoutlineszenodo
AT malouarvidsson annotatedhighcontentfluorescencemicroscopydatasetwithegfpgalectin3stainedcellsandmanuallylabelledoutlineszenodo
AT rafsanahmed annotatedhighcontentfluorescencemicroscopydatasetwithegfpgalectin3stainedcellsandmanuallylabelledoutlineszenodo
AT sonjaaits annotatedhighcontentfluorescencemicroscopydatasetwithegfpgalectin3stainedcellsandmanuallylabelledoutlineszenodo