OBIA conception documentation

Object Based Image Analysis (OBIA)

Note

This page describes the technical choices done during the OBIA implementation. To learn how use it, refer to the tutorial.

Managing input data

Preprocess sensor data

As in iota2 classification workflow, the first steps are dedicated to preprocess the data. Available dates are searched for to initialize the gapfilling, common masks are produced for all enabled sensors. Validity masks are produced according to cloud coverage. All these steps are also required for OBIA as they are generic to handle all information available.

Preprocess input segmentation

Here really begins the OBIA workflow. Two cases must be handled for this step:

The user provides a segmentation over all tiles required.
The user wants iota2 to produce the segmentation.

User provided segmentation

Like all user provided data, the first step is to ensure that the data will be compatible with the iota2 workflow. The segmentation can be a tif image, or a vector file like sqlite or shapefile. GML files are forbidden as most vector tools are unable to find the associated projection.

The segmentation can be exhaustive over all the required zone, i.e segments over all tiles or can be sparse. In this case, the output classification will only be produced for segments in the original segmentation.

The first step is to split the segmentation over all tiles. This is required as iota2 works by tiles. Border segments are cut but there is a covering zone between tiles. From now until the production of the final classification, the original segmentation will not be used.

If the segmentation is a raster, it is vectorized after splitting over tiles. Then, a new column using a iota2 constant name is created assigning a different identifier to all segment. This is required to avoid a segment cut in several parts to be deleted later.

iota2 provided segmentation

In this case, the chain will use the full time series and process SLIC (Simple Linear Iterative Clustering) for each tile. At the end of the process, one segmentation by tile is provided as raster and vectorized. Then we have the same data available than the previous case. The main difference will be at the end of the workflow when producing the final classification.

Handle eco-climatic region

Eco-climatic or region of interest files are very important in the OBIA workflow. Indeed, they impact the learning step by dividing samples by similarity. Unlike the pixel classification, samples are not split at the region limits, but all the objects are associated to a single region. This makes possible to create much more flexible boundaries between regions.

Then it is necessary to find the intersection between segments and region. To this end, the following workflow is applied:

Compute intersections between common mask of the tile and the region shapefile.
Compute intersections between the region grid and the segmentation.
Keep only one intersection between a segment and one region.

If no region are provided, all the segment are assigned to the same region.

Compute tile envelope

The tiles envelopes provided by iota2 are computed using tile priority and validity masks. In pixel classification, they indicate from which tile a pixel comes from in the final classification. The envelope can be not squared, and be a multipolygon with very small area around the central scene of the tile.

The tile envelope will be used in the lasts steps to reassemble the classification.

Learning steps

Split into learning and validation

In classification process, the split between learning and validation polygon is a common step. At the end of this process, the reference data is split over each tile and region using the tile envelope. This process is the same as in pixel classification.

Find intersection between segmentation and reference data

The learning samples are already split in tiles and region, then we must now associate each one with a segment. To this end, the region field in learning samples is removed and only the one attributed in the segmentation is used.

In case of duplicates:

if a learning polygon is covering more than one segment, it will be split over all segments intersecting.
if a segment intersects more than one learning polygon it is removed, as we can not choose which class is the good one.

Two modes are available at this point:

Clip learning samples and segment, only the common part of both are keep.
Keep the entire segment which intersect the learning samples.

Use the parameter full_learn_segment to manage these options.

Computing zonal statistics

Once the learning data are processed, zonal statistics can be processed too. OTB ZonalStatistics is used to provide five statistics:

mean
min
max
std
count

Two issues arise:

the number of features that will be produced.
the ram required to compute zonal statistics over an entire tile.

Indeed, for instance a Sentinel-2 full year time series counts 468 features. Then zonal statistics can produce 2340 features. However, a sqlite (main format used in iota2) can not have more than 2000 columns and shapefiles are limited to 2go.

To solve this two issues, the ESRI shapefile format is used, and a new grid is computed over the tile. The grid is computed by region in the tile. Then zonal statistics are produced over all subgrid. Each processing provides one samples file for the learning step.

Note about the grid used

The grid size must be parametrized by the user depending of its need and resources, using buffer_size parameter. By default, the grid has a 20 kilometers squared resolution. Tests have been made using the shapefile format with three statistics and a 468 bands S2 time series. In this configuration, it is required to have at least 80go of RAM to launch the process. By reducing the buffer size, the RAM requirement decreases, but processing time increases.

Learn one model by region

Once zonal statistics are computed, a model is produced for each region.

The only warning in this step is about the total number of samples files for a given region. This is directly related to the grid defined in the previous step. In addition, the OTB TrainVectorClassifier requires that all features are contained in one file (columns name) but the samples can be split over several files. Again the limitation will be the total RAM required to learn the model.

Classifications step

Splitting tile between regions

Same situation as for learning, an entire tile can be handled directly in RAM, but only one part of the tile is covered by a region.

A grid is designed too. At the difference of the learning step, the grid can be larger as only one file is processed at the same time.

Classify

The classification step is composed of six parts:

Design the grid by region and write the corresponding shapefiles.
Rasterize the shapefile to be faster in zonal statistics computation.
Compute the zonal statistics (which produces an xml).
Join the statistics and the geometry provided by the shapefile.
Classify the shapefile containing statistics.
Keep only columns of interest for the final product: original segmentation identifier, geometry, predicted class and confidence.

Outputs production

Produce the final land cover map

Using all vector files produced by the classification step, two cases are available:

User provided segmentation.
iota2 provided segmentation.

User provided segmentation

Warning

This is not yet implemented. This can improve the output map visual quality.

In this case, the first step is to merge all shapefiles into a dataframe. Then it is possible to associate the label and confidence to the initial geometry. To choose which segment is kept among the duplicates (common part between tiles), the envelope is used. As for region, the first intersection found is kept.

iota2 provided segmentation

Note

In all case, this option is currently used.

In this case, the main difficulty is to ensure the coherence of the geometry between the tiles. Without an efficient way to generate segmentation across multiple tiles without having to merge images beforehand, there is no way to provide uniform segmentation.

The solution proposed is to clip each tile according is tile envelope. For each envelope boundary, there are oversegmentation with the risk that the left object has a different label than the right one.

Validation

To validate the produced map, the input reference data was split between learning and validation dataset. As for learning, the validation polygons are clipped according to the segmentation with the same rules. If a validation polygon intersects at least two segments, it is clipped. If a segment intersected at least two validation polygons, the validation polygons are removed from the dataset.

This can be improved by ensuring that all validation polygons which intersect the segment have different class labels.

Once the validation data has been processed, one polygon corresponds to one validation sample. Then by counting and comparing the prediction with the reference data, a confusion matrix can be written. With the confusion matrix, the standard classification metrics are computed: kappa, overall accuracy, Fscore-1, precision, recall.

As OBIA is an object approach, the visual validation is important too. The geometry of the final classification depends on the input segmentation. In the case of SLIC, superpixels are used. A possible post processing could be to aggregate adjacent superpixels with same label to smooth the map.

Known issues

Learning and Validation samples number

In OBIA, the number of learning and validation polygons can vary. This comes from the intersection between the segmentation and the reference data. To know the exact number of samples used for learning, look at the files in learningSamples folder. The samples of each file by seed must be summed up to have the total number.

For validation, the total number of samples in provided by summing all elements in the confusion matrix.

Holes in map

This issue can appear if the segmentation does not respect the ratio between object size and pixels. From OTB application, if a segment has an area lower than 2 * pixel area, the application return nan or does not return statistics for this segment.

OTB error

An error std::Bad_Alloc can sometimes happen. Relaunching the chain with the restart option seem to be sufficient to pass over this error.