Nucleoli segmentation and feature extraction using CellProfiler

Overview

Questions:
  • How do I run an image analysis pipeline on public data using CellProfiler?

  • How do I analyse the DNA channel of fluorescence siRNA screens?

  • How do I download public image data into my history?

  • How do I segment and label cell nuclei?

  • How do I segment nucleoli (as the absence of DNA)?

  • How do I combine nuclei and nucleoli into one segmentation mask?

  • How do I extract the background of an image?

  • How do I relate the nucleoli to their parent nucleus?

  • How do I measure the image and object features?

  • How do I measure the image quality?

Objectives:
  • How to download images from a public image repository.

  • How to segment cell nuclei using CellProfiler in Galaxy.

  • How to segment cell nucleoli using CellProfiler in Galaxy.

  • How to extract features for images, nuclei and nucleoli.

Requirements:
Time estimation: 4 hours
Supporting Materials:
Last modification: Mar 12, 2021
License: Tutorial Content is licensed under Creative Commons Attribution 4.0 International License The GTN Framework is licensed under MIT

Introduction

The nucleolus is a prominent structure of the nucleus of eukaryotic cells and is involved in ribosome biogenesis and cell cycle regulation. In DNA staining of cells, nucleoli can be identified as the absence of DNA in nuclei (Fig. 1).
Phenotypes caused by reduced gene function are widely used to elucidate gene function and image-based RNA interference (RNAi) screens are routinely used to find and characterize genes involved in a particular biological process. While screens typically focus on one biological process of interest, the molecular markers used can also inform on other processes. Re-using published screens image data can then be a cost-effective alternative to performing new experiments.
In particular, regardless of the targeted biological process, many screens include a DNA label and therefore can also reveal the effect of gene knock-downs on nucleoli.

DNA channel.
Figure 1: DNA channel from the screen described in Integration of biological data by kernels on graph nodes allows prediction of new genes involved in mitotic chromosome condensation” 2014. The red arrows point at nucleoli.

In this tutorial, we will analyse DNA channel images of publicly available RNAi screens to extract numerical descriptors (i.e. features) of nucleoli. The images and associated metadata will be retrieved from the Image Data Resource (IDR), a repository that collects image datasets of tissues and cells.

To process and analyse the images, we will use CellProfiler (CellProfiler 3.0: Next-generation image processing for biology” 2018), a popular image analysis software. CellProfiler normally comes as a desktop application in which users can compose image analysis workflows from a series of modules. Many of these modules are now also available as tools in Galaxy.

To fully emulate the behaviour of the standalone CellProfiler in Galaxy, each image analysis workflow needs to have three parts:

1) StartingModules tool to initialise the pipeline,

2) tools performing the analysis (Fig. 2): identification of the nuclei, nucleoli and background, together with the feature extraction,

3) CellProfiler tool to actually run the pipeline.

High-level view of the workflow.
Figure 2: High-level view of the workflow

In this tutorial, you will learn how to create a workflow that downloads a selection of images from the IDR, and uses CellProfiler to segment the nuclei and nucleoli. You will also learn how to extract and export features at three different levels: image, nucleus, nucleolus.

Agenda

In this tutorial, we will cover:

  1. Get data
  2. Start CellProfiler pipeline
  3. Object segmentation
    1. Segment nuclei
    2. Segment nucleoli
    3. Combine segmentation masks
  4. Background extraction
    1. Identify the foreground
    2. Remove the foreground from the original image
  5. Feature extraction
    1. Measure the granularity, texture, intensity, size and shape
    2. Relate nucleoli to their parent nucleus
    3. Measure the image quality-related parameters
    4. Export the features to a CSV
  6. Run the CellProfiler pipeline

Get data

hands_on Hands-on: Download images from the IDR

  1. If you are logged in, create a new history for this tutorial.

    Tip: Creating a new history

    Click the new-history icon at the top of the history panel.

    If the new-history is missing:

    1. Click on the galaxy-gear icon (History options) on the top of the history panel
    2. Select the option Create New from the menu
  2. IDR Download tool with the following parameters:

    • “How would you like to specify the IDs of images to download?”: As text (comma-separated list of IDs or a valid IDR link)
    • “Image IDs to download”: http://idr.openmicroscopy.org/webclient/?show=image-295900|image-295905|image-295910|image-295918|image-295928|image-295934
    • “Name of the channel to download”: Cy3
    • “z-plane of images to download”: 0
    • “Image frame to download”: 0
    • “Limit the download to a selected region of the image?”: No, download the entire image plane
    • “Skip failed retrievals?”: Yes
    • “Download images in a tarball?”: Yes

    To get the valid IDR link, go to the dataset of interest in the IDR and select in the preview of a plate a few images ((figure Fig. 3 - 1)). Once you see them at the bottom of the page (figure Fig. 3 - 2), select them again and click the link button in the top-right corner of the right panel (figure Fig. 3 - 3).

    IDR interface.
    Figure 3: IDR interface

    comment Comment

    IMPORTANT: When the number of images to download is high, it is strongly recommended to enable the option “Download images in a tarball?” in order to improve the performance.

question Question

  1. Why are we taking the Cy3 channel in the example data?
  2. How could we download 100,000 images in one go?

solution Solution

  1. The Cy3 dye was used in the study to stain DNA. Since we want to segment the abscence of DNA, Cy3 is the only channel that we need to download from the IDR.

  2. We could upload a text file with the image ids of interest.

Start CellProfiler pipeline

The tool Starting Modules tool comprises the first 4 modules of the standalone CellProfiler. It needs to be the first tool of a workflow because it sets the naming and metadata handling for the rest of tools.

hands_on Hands-on: Specify metadata to CellProfiler

  1. Starting Modules tool with the following parameters:
    • Images
      • “Do you want to filter only the images? “: Select the images only
    • Metadata
      • “Do you want to extract the metadata?”: Yes, specify metadata
        • “Metadata extraction method”: Extract from file/folder names
      • “Metadata source”: File name
      • “Select the pattern to extract metadata from the file name”: field1__field2__field3__field4__field5__field6
      • “Extract metadata from”: All images
    • NamesAndTypes
      • “Process 3D”: No, do not process 3D data
      • “Assign a name to”: Give every image the same name
      • “Name to assign these images”: DNA
      • “Select the image type”: Grayscale image
        • “Set intensity range from”: Image metadata
    • Groups
      • “Do you want to group your images?”: Yes, group the images
      • “param”: field1

    comment Comment

    The images downloaded from the IDR are named following the pattern: plateName__imageID__cropX__cropY__cropWidth__cropHeight. These fields indicate to which plate the image belongs, what is the identifier of the image in the IDR, and the 4 cropping parameters selected. In our case, the upper-left corner (X, Y) and the width and height from there. We have a total of 6 metadata values encoded in the name of the file, separated by __. The pattern to extract our metadata from the file name properly is, therefore: field1__field2__field3__field4__field5__field6. It is important to keep in mind that, for CellProfiler, our plateName will be called field1, imageID will be field2, cropX will be field3, etc. These matches are relevant for a meaningful interpretation of the features.

Object segmentation

Segment nuclei

Since we are interested in segmenting the nucleoli, you may wonder why we need to segment nuclei first. There are several reasons for that:

  • Get the nuclei features. The intensity, size, shape, number of nucleoli per nucleus, etc. can be informative to study the nucleoli.

  • Avoid the detection of wrong spots. In a first segmentation, we detect the nuclei, while in the second pass we will segment the holes (nucleoli). The holes need to fall inside the nuclei, and hence the importance of having them segmented too.

In the first step, we will identify the nuclei that are complete, meaning that they are not touching the borders of the image.

hands_on Hands-on: Segment nuclei that are complete within the boundaries of the image

  1. IdentifyPrimaryObjects tool with the following parameters:
    • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of Starting Modules tool)
    • “Use advanced settings?”: Yes, use advanced settings
      • “Enter the name of the input image (from NamesAndTypes)”: DNA
      • “Enter the name of the primary objects to be identified”: Nuclei
      • “Typical minimum diameter of objects, in pixel units (Min)”: 15
      • “Typical maximum diameter of objects, in pixel units (Max)”: 200
      • “Discard objects outside the diameter range?”: Yes
      • “Discard objects touching the border of the image?”: Yes
      • “Threshold strategy”: Global
        • “Thresholding method”: Otsu
          • “Two-class or three-class thresholding?”: Two classes
          • “Threshold correction factor”: 0.9
      • “Method to distinguish clumped objects”: Shape
        • “Method to draw dividing lines between clumped objects”: Shape
          • “Automatically calculate size of smoothing filter for declumping?”: Yes
          • “Automatically calculate minimum allowed distance between local maxima?”: Yes
      • “Handling of objects if excessive number of objects identified”: Continue

    comment Comment

    • The names entered to the input image and objects have to be consistent (case sensitive) with the ones in NamesAndTypes and the tools to follow.
    • The min and max diameter of the objects (Typical minimum diameter of objects, in pixel units (Min) and Typical minimum diameter of objects, in pixel units (Max)) will have to be adjusted to the resolution of the images.
    • The Threshold correction factor is used to adjust the threshold in Otsu’s method. The value depends on the images and it might be useful to test several values and pick the one that works best for a particular kind of image.

question Questions

We are using here Otsu’s thresholding method for segmentation. What other segmentation options are available? What is the difference between them?

solution Solution

In the global methods we have Manual, Measurement, Minimum cross entropy, Otsu and Robust background. For the adaptive ones we only have Otsu. Check the parameters’ help to get more information on each one.

From the previous tool, we got a group of objects (nuclei). Now, we want to export the segmentation masks as a single image to check how well the segmentation algorithm is performing. We also want to label the nuclei with their identifiers for future visual inspection of the results. The output of this step will look like (you will only get this image after running the whole workflow):

Identified nuclei with labels.
Figure 4: Identified nuclei with labels.

question Questions

Why are some nuclei not labeled in the image above?

solution Solution

We have indicated to the tool IdentifyPrimaryObjects tool that the nuclei that are either outside the diameter range or touching the border should be discarded.

hands_on Hands-on: Mask the nuclei detected

  1. ConvertObjectsToImage tool with the following parameters:
    • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of IdentifyPrimaryObjects tool)
    • “Enter the name of the input objects you want to convert to an image”: Nuclei
    • “Enter the name of the resulting image”: MaskNuclei
    • “Select the color format”: Binary (black & white)
  2. DisplayDataOnImage tool with the following parameters:
    • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of ConvertObjectsToImage tool)
    • “Display object or image measurements?”: Object
      • “Enter the name of the input objects”: Nuclei
      • “Measurement category”: Number
    • “Enter the name of the image on which to display the measurements”: DNA
    • “Display mode”: Text
      • “Text color”: #ff0000
      • “Number of decimals”: 0
    • “Name the output image that has the measurements displayed”: ImageDisplay
  3. SaveImages tool with the following parameters:
    • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of DisplayDataOnImage tool)
    • “Select the type of image to save”: Image
      • “Saved the format to save the image(s)”: tiff
    • “Enter the name of the image to save”: ImageDisplay
    • “Select method for constructing file names”: From image filename
      • “Enter the image name (from NamesAndTypes) to be used as file prefix”: DNA
      • “Append a suffix to the image file name?”: Yes
        • “Text to append to the image name”: _nucleiNumbers
    • “Overwrite existing files without warning?”: Yes

    comment Comment

    The Text color parameter can be any of your choice, the hexa code is not really relevant. The only consideration is that it needs to be visible on top of the nuclei.

Segment nucleoli

The nucleoli are lacking intensity in the DNA staining and therefore, we need to enhance the black holes before masking.

hands_on Hands-on: Detect and mask dark holes

  1. EnhanceOrSuppressFeatures tool with the following parameters:
    • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of SaveImages tool)
    • “Enter the name of the input image”: DNA
    • “Enter a name for the resulting image”: DNAdarkholes
    • “Select the operation”: Enhance
      • “Feature type”: Dark holes
        • “Maximum hole size”: 15
  2. MaskImage tool with the following parameters:
    • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of EnhanceOrSuppressFeatures tool)
    • “Enter the name of the input image”: DNAdarkholes
    • “Enter the name of the resulting image”: MaskDNAdarkholes
    • “Use objects or an image as a mask?”: Objects
      • “Enter the name objects to mask the input image”: Nuclei
    • “Invert the mask?”: No

Now that we have all the holes in one mask, we can segment the nucleoli as individual objects in the same way as we did with the nuclei. All the nucleoli can be then combined into one single image.

hands_on Hands-on: Segment nucleoli as individual objects

  1. IdentifyPrimaryObjects tool with the following parameters:
    • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of MaskImage tool)
    • “Use advanced settings?”: Yes, use advanced settings
      • “Enter the name of the input image (from NamesAndTypes)”: MaskDNAdarkholes
      • “Enter the name of the primary objects to be identified”: Nucleoli
      • “Typical minimum diameter of objects, in pixel units (Min)”: 2
      • “Typical maximum diameter of objects, in pixel units (Max)”: 15
      • “Discard objects touching the border of the image?”: Yes
      • “Threshold strategy”: Global
        • “Thresholding method”: Otsu
          • “Two-class or three-class thresholding?”: Two classes
          • “Threshold correction factor”: 0.9
      • “Method to distinguish clumped objects”: Shape
        • “Method to draw dividing lines between clumped objects”: Shape
          • “Automatically calculate size of smoothing filter for declumping?”: Yes
          • “Automatically calculate minimum allowed distance between local maxima?”: Yes
      • “Handling of objects if excessive number of objects identified”: Continue
  2. ConvertObjectsToImage tool with the following parameters:
    • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of IdentifyPrimaryObjects tool)
    • “Enter the name of the input objects you want to convert to an image”: Nucleoli
    • “Enter the name of the resulting image”: MaskNucleoli
    • “Select the color format”: Binary (black & white)

Combine segmentation masks

We have now one segmentation mask per image with all the nuclei detected, MaskNuclei, and another one for the nucleoli, MaskNucleoli. These are binary masks in which the background is black and the objects detected are white. We would like to check whether both segmentation steps went well. That could be achieved by combining both of them (using different colors) into one image. Here, we are converting the nucleus mask to blue and the nucleoli to magenta. The outcome that you will obtain after the execution of the workflow will look like:

Combined mask for nuclei and nucleoli.
Figure 5: Combined segmentation masks for nuclei (blue) and nucleoli (magenta).

hands_on Hands-on: Convert and save the nuclei and nucleoli masks

  1. GrayToColor tool with the following parameters:
    • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of ConvertObjectsToImage tool)
    • “Enter the name of the resulting image”: CombinedMask
    • “Select a color scheme”: RGB
      • “Enter the name of the image to be colored red”: MaskNucleoli
      • “Relative weight for the red image”: 0.8
      • “Enter the name of the image to be colored blue”: MaskNuclei
      • “Relative weight for the blue image”: 0.5
  2. SaveImages tool with the following parameters:
    • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of GrayToColor tool)
    • “Select the type of image to save”: Image
      • “Saved the format to save the image(s)”: tiff
    • “Enter the name of the image to save”: CombinedMask
    • “Select method for constructing file names”: From image filename
      • “Enter the image name (from NamesAndTypes) to be used as file prefix”: DNA
      • “Append a suffix to the image file name?”: Yes
        • “Text to append to the image name”: _combinedMask
    • “Overwrite existing files without warning?”: Yes

    comment Comment

    • You can pick any other color of your choice, as long as the contrast is good enough to distinguish both objects.
    • We are saving here a tiff image but any other format of your choice would work too.

Background extraction

The background extraction is useful for quality control. For instance, in a high-exposed or low-contrast image, the nuclei will not be very different from the background and that may lead to the wrong segmentation.

To extract the background, we first need to get the foreground and subtract it from the original image. We already have the nuclei mask, however, we excluded incomplete nuclei, i.e., those touching the borders or those which sizes are outside the specified range. This means that the mask is not covering all the nuclei and we need to get rid of the constraints to get the complete foreground. At this stage, we want to detect everything with a certain intensity (foreground) and subtract it from the complete image to get the background.

Identify the foreground

hands_on Hands-on: Segment all nuclei

  1. IdentifyPrimaryObjects tool with the following parameters:
    • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of SaveImages tool)
    • “Use advanced settings?”: Yes, use advanced settings
      • “Enter the name of the input image (from NamesAndTypes)”: DNA
      • “Enter the name of the primary objects to be identified”: NucleiIncludingTouchingBorders
      • “Typical minimum diameter of objects, in pixel units (Min)”: 15
      • “Typical maximum diameter of objects, in pixel units (Max)”: 200
      • “Discard objects outside the diameter range?”: No
      • “Discard objects touching the border of the image?”: No
      • “Threshold strategy”: Global
        • “Thresholding method”: Otsu
          • “Two-class or three-class thresholding?”: Two classes
          • “Threshold correction factor”: 0.9
      • “Method to distinguish clumped objects”: Shape
        • “Method to draw dividing lines between clumped objects”: Shape
          • “Automatically calculate size of smoothing filter for declumping?”: Yes
          • “Automatically calculate minimum allowed distance between local maxima?”: Yes
      • “Handling of objects if excessive number of objects identified”: Continue
  2. ConvertObjectsToImage tool with the following parameters:
    • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of IdentifyPrimaryObjects tool)
    • “Enter the name of the input objects you want to convert to an image”: NucleiIncludingTouchingBorders
    • “Enter the name of the resulting image”: Image_NucleiIncludingTouchingBorders
    • “Select the color format”: Binary (black & white)

Remove the foreground from the original image

hands_on Hands-on: Subtract the foreground from the original image

ImageMath tool with the following parameters:

  • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of ConvertObjectsToImage tool)
  • “Enter a name for the resulting image”: BG
  • “Operation”: Subtract
    • In “First Image”:
      • “Image or measurement?”: Image
        • “Enter the name of the first image”: DNA
    • In “Second Image”:
      • “Image or measurement?”: Image
        • “Enter the name of the second image”: Image_NucleiIncludingTouchingBorders
  • “Ignore the image masks?”: No

Feature extraction

Now that we have the objects of interest segmented and the background extracted, we can start measuring parameters on them. In particular it is relevant to:

  • measure the granularity, texture and intensity of the nuclei and original image,
  • measure the intensity of the original image, the background and foreground,
  • measure the size and shape of nuclei and nucleoli,
  • assess the quality of the original image,
  • measure the area ocuppied by nuclei and nucleoli.

A step that requires special attention is the relationship nucleolus-nucleus. This is useful to compute statistics on the number of nucleoli by nucleus.

comment Comment

The order in which the tools in this section are chained is not relevant for the outcome.

Measure the granularity, texture, intensity, size and shape

hands_on Hands-on: Measure the granularity, texture, intensity, size and shape

  1. MeasureGranularity tool with the following parameters:
    • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of ImageMath tool)
    • In “new image”:
      • param-repeat “Insert new image”
        • “Enter the name of a greyscale image to measure”: DNA
  2. MeasureTexture tool with the following parameters:
    • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of MeasureGranularity tool)
    • In “new image”:
      • param-repeat “Insert new image”
        • “Enter the name of an image to measure”: DNA
    • “Measure images or objects?”: Objects
      • In “new object”:
        • param-repeat “Insert new object”
          • “Enter the names of the objects to measure”: Nuclei
  3. MeasureObjectIntensity tool with the following parameters:
    • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of MeasureTexture tool)
    • In “new image”:
      • param-repeat “Insert new image”
        • “Enter the name of an image to measure”: DNA
    • In “new object”:
      • param-repeat “Insert new object”
        • “Enter the name of the objects to measure”: Nuclei
  4. MeasureObjectSizeShape tool with the following parameters:
    • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of MeasureObjectIntensity tool)
    • In “new object”:
      • param-repeat “Insert new object”
        • “Enter the name of the object to measure”: Nuclei
      • param-repeat “Insert new object”
        • “Enter the name of the object to measure”: Nucleoli

question Questions

Why are we measuring the granularity, texture and intensity of the original image and the nuclei only?

solution Solution

The nucleoli was not stained in the DNA channel and hence, the granularity, texture and intensity are constant values.

Relate nucleoli to their parent nucleus

It might be relevant to compute some statistics on the number of nucleoli inside each nucleus. CellProfiler has a very interesting module to relate both objects, in which each one of the nucleoli is assigned an identifier and linked to the identifier of its parent nucleus.

hands_on Hands-on: Relate nucleoli to their parent nucleus

  1. RelateObjects tool with the following parameters:
    • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of MeasureObjectSizeShape tool)
    • “Parent objects”: Nuclei
    • “Child objects”: Nucleoli
    • “Calculate child-parent distances?”: Both
      • “Calculate distances to other parents?”: No
    • “Do you want to save the children with parents as a new object set?”: Yes

In this section, we will measure the image quality, the area occupied by the nuclei and nucleoli in the original image and the intensity of the foreground and background.

hands_on Hands-on: Measure the image quality

  1. MeasureImageQuality tool with the following parameters:
    • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of RelateObjects tool)
    • “Calculate blur metrics?”: Yes
    • “Calculate thresholds?”: Yes
      • “Use all thresholding methods?”: No
        • In “new threshold method”:
          • param-repeat “Insert new threshold method”
            • “Select a thresholding method”: Otsu
              • “Two-class or three-class thresholding?”: Two classes
  2. MeasureImageAreaOccupied tool with the following parameters:
    • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of MeasureImageQuality tool)
    • In “new area”:
      • param-repeat “Insert new area”
        • “Measure the area occupied in a binary image, or in objects?”: Objects
          • “Enter the name of the objects to measure”: Nuclei
      • param-repeat “Insert new area”
        • “Measure the area occupied in a binary image, or in objects?”: Objects
          • “Enter the name of the objects to measure”: Nucleoli
  3. MeasureImageIntensity tool with the following parameters:
    • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of MeasureImageAreaOccupied tool)
    • In “new image”:
      • param-repeat “Insert new image”
        • “Enter the name of the image to measure”: DNA
        • “Measure the intensity only from areas enclosed by objects?”: No
      • param-repeat “Insert new image”
        • “Enter the name of the image to measure”: DNA
        • “Measure the intensity only from areas enclosed by objects?”: Yes
          • “Enter the name of the objects that the intensity will be aggregated within”: Nuclei
      • param-repeat “Insert new image”
        • “Enter the name of the image to measure”: BG
        • “Measure the intensity only from areas enclosed by objects?”: No

question Questions

In step 3, we are measuring the area occupied by objects for the DNA twice. Why is that?

solution Solution

We want to have the global intensity of the image, so the parameter “Measure the intensity only from areas enclosed by objects?” has to be set to No. However, we also want to obtain the intensity of the foreground, so only the parts of the images that are enclosed by nuclei.

Export the features to a CSV

All the parameters that we have measured related to the images and objects need to be exported to a file for each one of 6 example images analysed.

hands_on Hands-on: Export features

ExportToSpreadsheet tool with the following parameters:

  • param-file “Select the input CellProfiler pipeline”: output_pipeline (output of MeasureImageIntensity tool)
  • “Select the column delimiter”: Tab
  • “Add a prefix to file names?”: Do not add prefix to the file name
  • “Create a GenePattern GCT file?”: No
  • “Export all measurement types?”: Yes

Run the CellProfiler pipeline

All the steps in our workflow (except for the IDR download tool) have been passing through an output_pipeline as a parameter. This was the way to assemble all the modules from CellProfiler, now we can run all of them together!

hands_on Hands-on: Run CellProfiler pipeline

CellProfiler tool with the following parameters:

  • param-file “Pipeline file”: output_pipeline (output of ExportToSpreadsheet tool)
  • “Are the input images packed into a tar archive?”: Yes
    • param-file “A tarball of images”: output_tar (output of IDR Download tool)
  • “Detailed logging file?”: Yes

comment Comment

This is the only time-consuming step of the workflow, as it needs to perform the whole analysis in the input dataset.

Conclusion

In this tutorial, you have downloaded images from a public image repository into your Galaxy history. After that, you have built and run a typical image analysis pipeline, composed of segmentation of several objects and feature extraction. As an outcome, you got plenty of features to analyse! And some masks to check that the segmentation algorithms worked as expected. Now you are ready to perform your biological data analysis!

Key points

  • Galaxy workflows can download images from the IDR, selecting specific channels, time points, z-stack positions and crop the image in different ways.

  • CellProfiler in Galaxy can segment and extract features of any object of interest.

  • The features and masks can be exported for further analysis.

Frequently Asked Questions

Have questions about this tutorial? Check out the tutorial FAQ page or the FAQ page for the Imaging topic to see if your question is listed there. If not, please ask your question on the GTN Gitter Channel or the Galaxy Help Forum

Useful literature

Further information, including links to documentation and original publications, regarding the tools, analysis techniques and the interpretation of results described in this tutorial can be found here.

References

  1. Integration of biological data by kernels on graph nodes allows prediction of new genes involved in mitotic chromosome condensation, 2014 Mol Biol Cell. 2014;25(16):2522-2536: 10.1091/mbc.E13-04-0221
  2. CellProfiler 3.0: Next-generation image processing for biology, 2018 PLoS Biol. 16(7):e2005970: 10.1371/journal.pbio.2005970 https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.2005970

Feedback

Did you use this material as an instructor? Feel free to give us feedback on how it went.
Did you use this material as a learner or student? Click the form below to leave feedback.

Click here to load Google feedback frame

Citing this Tutorial

  1. Beatriz Serrano-Solano, Jean-Karim Hériché, 2021 Nucleoli segmentation and feature extraction using CellProfiler (Galaxy Training Materials). https://training.galaxyproject.org/training-material/topics/imaging/tutorials/tutorial-CP/tutorial.html Online; accessed TODAY
  2. Batut et al., 2018 Community-Driven Data Analysis Training for Biology Cell Systems 10.1016/j.cels.2018.05.012

details BibTeX

@misc{imaging-tutorial-CP,
author = "Beatriz Serrano-Solano and Jean-Karim Hériché",
title = "Nucleoli segmentation and feature extraction using CellProfiler (Galaxy Training Materials)",
year = "2021",
month = "03",
day = "12"
url = "\url{https://training.galaxyproject.org/training-material/topics/imaging/tutorials/tutorial-CP/tutorial.html}",
note = "[Online; accessed TODAY]"
}
@article{Batut_2018,
    doi = {10.1016/j.cels.2018.05.012},
    url = {https://doi.org/10.1016%2Fj.cels.2018.05.012},
    year = 2018,
    month = {jun},
    publisher = {Elsevier {BV}},
    volume = {6},
    number = {6},
    pages = {752--758.e1},
    author = {B{\'{e}}r{\'{e}}nice Batut and Saskia Hiltemann and Andrea Bagnacani and Dannon Baker and Vivek Bhardwaj and Clemens Blank and Anthony Bretaudeau and Loraine Brillet-Gu{\'{e}}guen and Martin {\v{C}}ech and John Chilton and Dave Clements and Olivia Doppelt-Azeroual and Anika Erxleben and Mallory Ann Freeberg and Simon Gladman and Youri Hoogstrate and Hans-Rudolf Hotz and Torsten Houwaart and Pratik Jagtap and Delphine Larivi{\`{e}}re and Gildas Le Corguill{\'{e}} and Thomas Manke and Fabien Mareuil and Fidel Ram{\'{\i}}rez and Devon Ryan and Florian Christoph Sigloch and Nicola Soranzo and Joachim Wolff and Pavankumar Videm and Markus Wolfien and Aisanjiang Wubuli and Dilmurat Yusuf and James Taylor and Rolf Backofen and Anton Nekrutenko and Björn Grüning},
    title = {Community-Driven Data Analysis Training for Biology},
    journal = {Cell Systems}
}
                

Congratulations on successfully completing this tutorial!