i.segment.gsoc

Outputs a single segmented map (raster) based on input values in an image group.

Command linePython (grass.tools)Python (grass.script)

i.segment.gsoc [-tdwfl] group=name output=name threshold=float method=string similarity=string minsize=integer radioweight=float smoothweight=float [seeds=name] [bounds=name] [endt=integer] [final_mean=name] [--overwrite] [--verbose] [--quiet] [--qq] [--ui]

Example:

i.segment.gsoc group=name output=name threshold=0.0 method=region_growing similarity=euclidean minsize=1 radioweight=0.9 smoothweight=0.5

grass.tools.Tools.i_segment_gsoc(group, output, threshold, method="region_growing", similarity="euclidean", minsize=1, radioweight=0.9, smoothweight=0.5, seeds=None, bounds=None, endt=1000, final_mean=None, flags=None, overwrite=None, verbose=None, quiet=None, superquiet=None)

Example:

tools = Tools()
tools.i_segment_gsoc(group="name", output="name", threshold=0.0, method="region_growing", similarity="euclidean", minsize=1, radioweight=0.9, smoothweight=0.5)

This grass.tools API is experimental in version 8.5 and expected to be stable in version 8.6.

grass.script.run_command("i.segment.gsoc", group, output, threshold, method="region_growing", similarity="euclidean", minsize=1, radioweight=0.9, smoothweight=0.5, seeds=None, bounds=None, endt=1000, final_mean=None, flags=None, overwrite=None, verbose=None, quiet=None, superquiet=None)

Example:

gs.run_command("i.segment.gsoc", group="name", output="name", threshold=0.0, method="region_growing", similarity="euclidean", minsize=1, radioweight=0.9, smoothweight=0.5)

Parameters

Command linePython (grass.tools)Python (grass.script)

group=name [required]
    Name of input imagery group
output=name [required]
    Name for output raster map
threshold=float [required]
    Similarity threshold.
method=string [required]
    Segmentation method.
    Allowed values: region_growing
    Default: region_growing
similarity=string [required]
    Similarity calculation method.
    Allowed values: euclidean, manhattan
    Default: euclidean
minsize=integer [required]
    Minimum number of cells in a segment.
    The final iteration will ignore the threshold for any segments with fewer pixels.
    Allowed values: 1-100000
    Default: 1
radioweight=float [required]
    Importance of radiometric (input raseters) values relative to shape
    Allowed values: 0-1
    Default: 0.9
smoothweight=float [required]
    Importance of smoothness relative to compactness
    Allowed values: 0-1
    Default: 0.5
seeds=name
    Optional raster map with starting seeds.
    Pixel values with positive integers are used as starting seeds.
bounds=name
    Optional bounding/constraining raster map
    Pixels with the same integer value will be segmented independent of the others.
endt=integer
    Maximum number of passes (time steps) to complete.
    Default: 1000
final_mean=name
    Save the final mean values for the first band in the imagery group.
-t
    Estimate a threshold based on input image group and exit.
-d
    Use 8 neighbors (3x3 neighborhood) instead of the default 4 neighbors for each pixel.
-w
    Weighted input, don't perform the default scaling of input maps.
-f
    Final forced merge only (skip the growing portion of the algorithm.
-l
    segments are limited to be included in only one merge per pass
--overwrite
    Allow output files to overwrite existing files
--help
    Print usage summary
--verbose
    Verbose module output
--quiet
    Quiet module output
--qq
    Very quiet module output
--ui
    Force launching GUI dialog

group : str, required
    Name of input imagery group
    Used as: input, group, name
output : str | type(np.ndarray) | type(np.array) | type(gs.array.array), required
    Name for output raster map
    Used as: output, raster, name
threshold : float, required
    Similarity threshold.
method : str, required
    Segmentation method.
    Allowed values: region_growing
    Default: region_growing
similarity : str, required
    Similarity calculation method.
    Allowed values: euclidean, manhattan
    Default: euclidean
minsize : int, required
    Minimum number of cells in a segment.
    The final iteration will ignore the threshold for any segments with fewer pixels.
    Allowed values: 1-100000
    Default: 1
radioweight : float, required
    Importance of radiometric (input raseters) values relative to shape
    Allowed values: 0-1
    Default: 0.9
smoothweight : float, required
    Importance of smoothness relative to compactness
    Allowed values: 0-1
    Default: 0.5
seeds : str | np.ndarray, optional
    Optional raster map with starting seeds.
    Pixel values with positive integers are used as starting seeds.
    Used as: input, raster, name
bounds : str | np.ndarray, optional
    Optional bounding/constraining raster map
    Pixels with the same integer value will be segmented independent of the others.
    Used as: input, raster, name
endt : int, optional
    Maximum number of passes (time steps) to complete.
    Default: 1000
final_mean : str | type(np.ndarray) | type(np.array) | type(gs.array.array), optional
    Save the final mean values for the first band in the imagery group.
    Used as: output, raster, name
flags : str, optional
    Allowed values: t, d, w, f, l
    t
        Estimate a threshold based on input image group and exit.
    d
        Use 8 neighbors (3x3 neighborhood) instead of the default 4 neighbors for each pixel.
    w
        Weighted input, don't perform the default scaling of input maps.
    f
        Final forced merge only (skip the growing portion of the algorithm.
    l
        segments are limited to be included in only one merge per pass
overwrite : bool, optional
    Allow output files to overwrite existing files
    Default: None
verbose : bool, optional
    Verbose module output
    Default: None
quiet : bool, optional
    Quiet module output
    Default: None
superquiet : bool, optional
    Very quiet module output
    Default: None

Returns:

result : grass.tools.support.ToolResult | np.ndarray | tuple[np.ndarray] | None
If the tool produces text as standard output, a ToolResult object will be returned. Otherwise, None will be returned. If an array type (e.g., np.ndarray) is used for one of the raster outputs, the result will be an array and will have the shape corresponding to the computational region. If an array type is used for more than one raster output, the result will be a tuple of arrays.

Raises:

grass.tools.ToolError: When the tool ended with an error.

group : str, required
    Name of input imagery group
    Used as: input, group, name
output : str, required
    Name for output raster map
    Used as: output, raster, name
threshold : float, required
    Similarity threshold.
method : str, required
    Segmentation method.
    Allowed values: region_growing
    Default: region_growing
similarity : str, required
    Similarity calculation method.
    Allowed values: euclidean, manhattan
    Default: euclidean
minsize : int, required
    Minimum number of cells in a segment.
    The final iteration will ignore the threshold for any segments with fewer pixels.
    Allowed values: 1-100000
    Default: 1
radioweight : float, required
    Importance of radiometric (input raseters) values relative to shape
    Allowed values: 0-1
    Default: 0.9
smoothweight : float, required
    Importance of smoothness relative to compactness
    Allowed values: 0-1
    Default: 0.5
seeds : str, optional
    Optional raster map with starting seeds.
    Pixel values with positive integers are used as starting seeds.
    Used as: input, raster, name
bounds : str, optional
    Optional bounding/constraining raster map
    Pixels with the same integer value will be segmented independent of the others.
    Used as: input, raster, name
endt : int, optional
    Maximum number of passes (time steps) to complete.
    Default: 1000
final_mean : str, optional
    Save the final mean values for the first band in the imagery group.
    Used as: output, raster, name
flags : str, optional
    Allowed values: t, d, w, f, l
    t
        Estimate a threshold based on input image group and exit.
    d
        Use 8 neighbors (3x3 neighborhood) instead of the default 4 neighbors for each pixel.
    w
        Weighted input, don't perform the default scaling of input maps.
    f
        Final forced merge only (skip the growing portion of the algorithm.
    l
        segments are limited to be included in only one merge per pass
overwrite : bool, optional
    Allow output files to overwrite existing files
    Default: None
verbose : bool, optional
    Verbose module output
    Default: None
quiet : bool, optional
    Quiet module output
    Default: None
superquiet : bool, optional
    Very quiet module output
    Default: None

DESCRIPTION

Image segmentation is the process of grouping similar pixels into unique segments. Boundary and region based algorithms are described in the literature, currently a region growing and merging algorithm is implemented. Each grouping (usually refered to as objects or segments) found during the segmentation process is given a unique ID and is a collection of contiguous pixels meeting some criteria. (Note the contrast with image classification, where continuity and spatial characteristics are not important, but rather only the spectral similarity.) The results can be useful on their own, or used as a preprocessing step for image classification. The segmentation preprocessing step can reduce noise and speed up the classification.

NOTES

Region Growing and Merging

This segmentation algorithm sequentially examines all current segments in the map. The similarity between the current segment and each of its neighbors is calculated according to the given distance formula. Segments will be merged if they meet a number of criteria, including: 1. The pair is mutually most similar to each other (the similarity distance will be smaller then all other neighbors), and 2. The similarity must be lower then the input threshold. All segments are checked once per pass. The process is repeated until no merges are made during a complete pass.

Similarity and Threshold

The similarity between segments and unmerged pixels is used to determine which are merged. The Euclidean version uses the radiometric distance between the two segments and also the shape characteristics. The Manhatten calculations currently only uses only the radiometric distance between the two segments, but eventually shape characteristics will be included as well. NOTE: Closer/smaller distances mean a lower value for the similarity indicates a closer match, with a similarity score of zero for identical pixels.

During normal processing, merges are only allowed when the similarity between two segments is lower then the calculated threshold value. During the final pass, however, if a minimum segment size of 2 or larger is given with the minsize parameter, segments with a smaller pixel count will be merged with their most similar neighbor even if the similarity is greater then the threshold.

Unless the -w flag for weighted data is used, the threshold should be set by the user between 0 and 1.0. A threshold of 0 would allow only identical valued pixels to be merged, while a threshold of 1 would allow everything to be merged.

The threshold will be multiplied by the number of rasters included in the image group. This will allow the same threshold to achieve similar segmentation results when the number of rasters in the image group varies.

The -t flag will estimate the threshold, it is calculated at 3% of the range of data in the imagery group. Initial empirical tests indicate threshold values of 1% to 5% are reasonable places to start.

Calculation Formulas

Both Euclidean and Manhattan distances use the normal definition, considering each raster in the image group as a dimension. Furthermore, the Euclidean calculation also takes into account the shape characteristics of the segments. The normal distances are multiplied by the input radiometric weight. Next an additional contribution is added: (1-radioweight) * {smoothness * smoothness weight + compactness * (1-smoothness weight)}, where compactness = the Perimeter Length / sqrt( Area ) and smoothness = Perimeter Length / the Bounding Box. The perimeter length is estimated as the number of pixel sides the segment has.

Seeds

The seeds map can be used to provide either seed pixels (random or selected points from which to start the segmentation process) or seed segments (results of previous segmentations or classifications). The different approaches are automatically detected by the program: any pixels that have identical seed values and are contiguous will be lumped into a single segment ID.

It is expected that the minsize will be set to 1 if a seed map is used, but the program will allow other values to be used. If both options are used, the final iteration that ignores the threshold also will ignore the seed map and force merges for all pixels (not just segments that have grown/merged from the seeds).

Maximum number of starting segments

For the region growing algorithm without starting seeds, each pixel is sequentially numbered. The current limit with CELL storage is 2 billion starting segment ID's. If the initial map has a larger number of non-null pixels, there are two workarounds:

1. Use starting seed pixels. (Maximum 2 billion pixels can be labeled with positive integers.)

2. Use starting seed segments. (By initial classification or other methods.)

Boundary Constraints

Boundary constraints limit the adjacency of pixels and segments. Each unique value present in the bounds raster are considered as a MASK. Thus no segments in the final segmentated map will cross a boundary, even if their spectral data is very similar.

Minimum Segment Size

To reduce the salt and pepper affect, a minsize greater than 1 will add one additional pass to the processing. During the final pass, the threshold is ignored for any segments smaller then the set size, thus forcing very small segments to merge with their most similar neighbor.

EXAMPLES

This example uses the ortho photograph included in the NC Sample Dataset. Set up an imagery group:

i.group group=ortho_group input=ortho_2001_t792_1m@PERMANENT

Because the segmentation process is computationally expensive, start with a small processing area to confirm if the segmentation results meet your requirements. Some manual adjustment of the threshold may be required.

g.region raster=ortho_2001_t792_1m@PERMANENT n=220400 s=220200 e=639000 w=638800

Try out a first threshold and check the results.

i.segment -w -l group=ortho_group output=ortho_segs threshold=4 \
          method=region_growing

From a visual inspection, it seems this results in oversegmentation. Increasing the threshold:

i.segment -w -l --overwrite group=ortho_group output=ortho_segs \
          threshold=10 method=region_growing

This looks better. There is some noise in the image, lets next force all segments smaller then 5 pixels to be merged into their most similar neighbor (even if they are less similar then required by our threshold):

i.segment -w -l --overwrite group=ortho_group output=ortho_segs \
          threshold=10 method=region_growing minsize=5

Each of these segmentation steps took less then 1 minute on a decent machine. Now that we are satisfied with the settings, we'll process the entire image:

g.region raster=ortho_2001_t792_1m@PERMANENT

i.segment -w -l --overwrite group=ortho_group output=ortho_segs \
          threshold=10 method=region_growing minsize=5 endt=5000

Processing the entire ortho image (over 9 million pixels) took about a day.

TODO

Functionality

Further testing of the shape characteristics (smoothness, compactness), if it looks good it should be added to the Manhatten option. in progress
Malahanobis distance for the similarity calculation.

Use of Segmentation Results

Improve the optional output from this module, or better yet, add a module for i.segment.metrics.
Providing updates to i.maxlik to ensure the segmentation output can be used as input for the existing classification functionality.
Integration/workflow for r.fuzzy.

Speed

See create_isegs.c

Memory

User input for how much RAM can be used.
Check input map type(s), currently storing in DCELL sized SEG file, could reduce this dynamically depending on input map time. (Could only reduce to FCELL, since will be storing mean we can't use CELL. Might not be worth the added code complexity.)

BUGS

If the seeds map is used to give starting seed segments, the segments are renumbered starting from 1. There is a chance a segment could be renumbered to a seed value that has not yet been processed. If they happen to be adjacent, they would be merged. (Possible fixes: a. use a processing flag to make sure the pixels hasn't been previously used, or b. use negative segment ID's as a placeholder and negate all values after the seed map has been processed.)

REFERENCES

This project was first developed during GSoC 2012. Project documentation, Image Segmentation references, and other information is at the project wiki.

Information about classification in GRASS GIS is also available on the wiki.

AUTHORS

Eric Momsen - North Dakota State University

GSoC mentor: Markus Metz

SOURCE CODE

Available at: i.segment.gsoc source code (history)
Latest change: Wednesday Jul 01 11:49:28 2026 in commit 7caa290

i.segment.gsoc

Parameters

DESCRIPTION

NOTES

Region Growing and Merging

Similarity and Threshold

Calculation Formulas

Seeds

Maximum number of starting segments

Boundary Constraints

Minimum Segment Size

EXAMPLES

TODO

Functionality

Use of Segmentation Results

Speed

Memory

BUGS

REFERENCES

SEE ALSO

AUTHORS

SOURCE CODE