An annotated high-content fluorescence microscopy dataset with EGFP-Galectin-3-stained cells and manually labelled outlineszenodo
Many forms of bioimage analysis involve the detection of objects and their outlines. In the context of microscopy-based high-throughput drug and genomic screening and even in smaller scale microscopy experiments, the objects that most often need to be detected are cells. In order to develop and benc...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2025-02-01
|
Series: | Data in Brief |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2352340924011107 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832576494242627584 |
---|---|
author | Salma Kazemi Rashed Malou Arvidsson Rafsan Ahmed Sonja Aits |
author_facet | Salma Kazemi Rashed Malou Arvidsson Rafsan Ahmed Sonja Aits |
author_sort | Salma Kazemi Rashed |
collection | DOAJ |
description | Many forms of bioimage analysis involve the detection of objects and their outlines. In the context of microscopy-based high-throughput drug and genomic screening and even in smaller scale microscopy experiments, the objects that most often need to be detected are cells. In order to develop and benchmark algorithms and neural networks that can perform this task, high-quality datasets with annotated cell outlines are needed.We have created a dataset, named Aitslab_bioimaging2, consisting of 60 fluorescence microscopy images with EGFP-Galectin-3 labelled cells and their hand-labelled outlines. Images were acquired on a Thermo Fischer CX7 high-content imaging system at 20x magnification created as part of an RNA interference screen with a modified U2OS osteosarcoma cell line. Outlines were labelled by three annotators, who had high inter-annotator agreement between them and with a biomedical expert, who labelled some of the objects for comparison and reviewed a subset of the labels, making minor corrections as needed.The dataset comprises over 2200 annotated cell objects in total, making it sufficient in size to train high-performing neural networks for instance or semantic segmentation. Labels can also easily be converted to boxes for object detection tasks. The dataset is already pre-divided into training, development, and test sets. Matching nuclear staining and outlines are available for part of the dataset from a previous publication (dataset Aitslab_bioimaging1) [1]. |
format | Article |
id | doaj-art-976fb3b831c843aeb04efae8d6e4df66 |
institution | Kabale University |
issn | 2352-3409 |
language | English |
publishDate | 2025-02-01 |
publisher | Elsevier |
record_format | Article |
series | Data in Brief |
spelling | doaj-art-976fb3b831c843aeb04efae8d6e4df662025-01-31T05:11:25ZengElsevierData in Brief2352-34092025-02-0158111148An annotated high-content fluorescence microscopy dataset with EGFP-Galectin-3-stained cells and manually labelled outlineszenodoSalma Kazemi Rashed0Malou Arvidsson1Rafsan Ahmed2Sonja Aits3Cell Death, Lysosomes and Artificial Intelligence Group, Department of Experimental Medical Science, Faculty of Medicine, Lund University, BMC D10, 22184 Lund, SwedenCell Death, Lysosomes and Artificial Intelligence Group, Department of Experimental Medical Science, Faculty of Medicine, Lund University, BMC D10, 22184 Lund, SwedenCell Death, Lysosomes and Artificial Intelligence Group, Department of Experimental Medical Science, Faculty of Medicine, Lund University, BMC D10, 22184 Lund, SwedenCorresponding author.; Cell Death, Lysosomes and Artificial Intelligence Group, Department of Experimental Medical Science, Faculty of Medicine, Lund University, BMC D10, 22184 Lund, SwedenMany forms of bioimage analysis involve the detection of objects and their outlines. In the context of microscopy-based high-throughput drug and genomic screening and even in smaller scale microscopy experiments, the objects that most often need to be detected are cells. In order to develop and benchmark algorithms and neural networks that can perform this task, high-quality datasets with annotated cell outlines are needed.We have created a dataset, named Aitslab_bioimaging2, consisting of 60 fluorescence microscopy images with EGFP-Galectin-3 labelled cells and their hand-labelled outlines. Images were acquired on a Thermo Fischer CX7 high-content imaging system at 20x magnification created as part of an RNA interference screen with a modified U2OS osteosarcoma cell line. Outlines were labelled by three annotators, who had high inter-annotator agreement between them and with a biomedical expert, who labelled some of the objects for comparison and reviewed a subset of the labels, making minor corrections as needed.The dataset comprises over 2200 annotated cell objects in total, making it sufficient in size to train high-performing neural networks for instance or semantic segmentation. Labels can also easily be converted to boxes for object detection tasks. The dataset is already pre-divided into training, development, and test sets. Matching nuclear staining and outlines are available for part of the dataset from a previous publication (dataset Aitslab_bioimaging1) [1].http://www.sciencedirect.com/science/article/pii/S2352340924011107Instance segmentationFluorescence microscopyBiomedical image analysisHigh-content screeningComputer visionDeep learning training and evaluation |
spellingShingle | Salma Kazemi Rashed Malou Arvidsson Rafsan Ahmed Sonja Aits An annotated high-content fluorescence microscopy dataset with EGFP-Galectin-3-stained cells and manually labelled outlineszenodo Data in Brief Instance segmentation Fluorescence microscopy Biomedical image analysis High-content screening Computer vision Deep learning training and evaluation |
title | An annotated high-content fluorescence microscopy dataset with EGFP-Galectin-3-stained cells and manually labelled outlineszenodo |
title_full | An annotated high-content fluorescence microscopy dataset with EGFP-Galectin-3-stained cells and manually labelled outlineszenodo |
title_fullStr | An annotated high-content fluorescence microscopy dataset with EGFP-Galectin-3-stained cells and manually labelled outlineszenodo |
title_full_unstemmed | An annotated high-content fluorescence microscopy dataset with EGFP-Galectin-3-stained cells and manually labelled outlineszenodo |
title_short | An annotated high-content fluorescence microscopy dataset with EGFP-Galectin-3-stained cells and manually labelled outlineszenodo |
title_sort | annotated high content fluorescence microscopy dataset with egfp galectin 3 stained cells and manually labelled outlineszenodo |
topic | Instance segmentation Fluorescence microscopy Biomedical image analysis High-content screening Computer vision Deep learning training and evaluation |
url | http://www.sciencedirect.com/science/article/pii/S2352340924011107 |
work_keys_str_mv | AT salmakazemirashed anannotatedhighcontentfluorescencemicroscopydatasetwithegfpgalectin3stainedcellsandmanuallylabelledoutlineszenodo AT malouarvidsson anannotatedhighcontentfluorescencemicroscopydatasetwithegfpgalectin3stainedcellsandmanuallylabelledoutlineszenodo AT rafsanahmed anannotatedhighcontentfluorescencemicroscopydatasetwithegfpgalectin3stainedcellsandmanuallylabelledoutlineszenodo AT sonjaaits anannotatedhighcontentfluorescencemicroscopydatasetwithegfpgalectin3stainedcellsandmanuallylabelledoutlineszenodo AT salmakazemirashed annotatedhighcontentfluorescencemicroscopydatasetwithegfpgalectin3stainedcellsandmanuallylabelledoutlineszenodo AT malouarvidsson annotatedhighcontentfluorescencemicroscopydatasetwithegfpgalectin3stainedcellsandmanuallylabelledoutlineszenodo AT rafsanahmed annotatedhighcontentfluorescencemicroscopydatasetwithegfpgalectin3stainedcellsandmanuallylabelledoutlineszenodo AT sonjaaits annotatedhighcontentfluorescencemicroscopydatasetwithegfpgalectin3stainedcellsandmanuallylabelledoutlineszenodo |