Area-to-Point Poisson Kriging#

To start this tutorial, it is required to understand concepts of the Ordinary Kriging and Semivariogram Regularization. The good idea is to end Poisson Kriging - Centroid based and Poisson Kriging - Area to Area tutorials too.

The Poisson Kriging technique is used to model spatial counts over areas (blocks) of varying shapes and sizes. The counts could be number of infections, and area is a county. Following this example, we will transform the areal aggregates (rates - number of cases divided by population) of Breast Cancer in the Northeastern US counties. After transformation, we will get smooth map supported by point coordinates defined by the population grid (point support).

Prerequisites#

Domain:
- kriging
- Poisson kriging and Poisson distribution
- rates
Package:
- Deconvolution()
- ordinary_kriging()
- TheoreticalVariogram()
- centroid_poisson_kriging()
- area_to_area_pk()
Programming:
- Python basics
- pandas basics

Table of contents#

Prepare data
Load regularized semivariogram model
Smooth blocks.
Export smoothed map.

1. Prepare data#

[1]:

import geopandas as gpd
import numpy as np

from pyinterpolate import TheoreticalVariogram, Blocks, PointSupport, smooth_blocks

[2]:

DATASET = '../data/blocks/cancer_data.gpkg'
OUTPUT = '../data/regularized_variogram.json'
POLYGON_LAYER = 'areas'
POPULATION_LAYER = 'points'
POP10 = 'POP10'
GEOMETRY_COL = 'geometry'
POLYGON_ID = 'FIPS'
POLYGON_VALUE = 'rate'

[3]:

blocks = Blocks(
    ds=gpd.read_file(DATASET, layer=POLYGON_LAYER),
    value_column_name=POLYGON_VALUE,
    geometry_column_name=GEOMETRY_COL,
    index_column_name=POLYGON_ID
)

[4]:

point_support = PointSupport(
    points=gpd.read_file(DATASET, layer=POPULATION_LAYER),
    blocks=blocks,
    points_value_column=POP10,
    points_geometry_column=GEOMETRY_COL,
    verbose=True
)

[5]:

blocks.ds.plot(column=blocks.value_column_name,
               cmap='Spectral_r',
               legend=True,
               figsize=(12, 8))

[5]:

<Axes: >

../../../_images/usage_tutorials_functional_4-4-poisson-kriging-area-to-point-smoothing_5_1.png

This is our initial dataset. Cancer rates are aggregated over counties, and we have to deal with a choropleth map. In many cases, it is undesired, especially if we want to use remotely sensed data for AI/ML modeling and those aggregates. We can use Poisson Kriging to transform this representation into population blocks. But first, we need to load regularized semivariogram model.

2. Load regularized semivariogram model#

We load a regularized semivariogram from the tutorial about Semivariogram Regularization. You can always perform semivariogram regularization along with the Poisson Kriging, but it is a very long process, and it is more convenient to separate those two steps.

[6]:

semivariogram = TheoreticalVariogram()
semivariogram.from_json(OUTPUT)
semivariogram.plot(experimental=False)

../../../_images/usage_tutorials_functional_4-4-poisson-kriging-area-to-point-smoothing_7_0.png

3. Smooth blocks#

The process of map smoothing is straightforward. We use the smooth_blocks() function and pass theoretical variogram and point support objects into it. Other important parameter is number_of_neighbors because the final results might be affected by the number of regions that we include as the closest neighbors of a transformed area.

The method returns GeoDataFrame with points and predicted values. It iteratively re-calculates each area risk and produces predictions per point. In Area-to-Area Kriging, those predictions are aggregated. We leave them and get the smooth risk map.

[7]:

smoothed = smooth_blocks(
    semivariogram_model=semivariogram,
    point_support=point_support,
    number_of_neighbors=16,
    data_crs=point_support.point_support.crs,
    verbose=False
)

[8]:

smoothed.head()

[8]:

	id	geometry	reg.est	reg.err	rmse
0	42049.0	POINT (1277277.625 441124.5)	0.907773	11.788809	NaN
1	42049.0	POINT (1277277.625 431124.5)	1.044693	11.529003	NaN
2	42049.0	POINT (1285937.875 446124.5)	5.031347	11.818780	NaN
3	42049.0	POINT (1285937.875 436124.5)	1.625427	11.191553	NaN
4	42049.0	POINT (1285937.875 426124.5)	2.680877	11.915994	NaN

4. Export results#

The last step is data visualization. We use a choropleth map from the GeoPandas, but then we will store a smoothed map as geopackage. Then, you can work on this dataset in a different program (e.g.: QGIS).

[9]:

base = blocks.ds.plot(figsize=(14, 14), color='white', edgecolor='black')
smooth_plot_data = smoothed.copy()
smooth_plot_data[smooth_plot_data['reg.est'] < 0.01] = np.nan
smooth_plot_data.plot(ax=base, column='reg.est', cmap='plasma', legend=True, markersize=30, alpha=0.7)

[9]:

<Axes: >

../../../_images/usage_tutorials_functional_4-4-poisson-kriging-area-to-point-smoothing_12_1.png

[10]:

base = blocks.ds.plot(figsize=(14, 14), color='white', edgecolor='black')
smooth_plot_data.plot(ax=base, column='reg.err', cmap='Reds', legend=True, markersize=20, alpha=0.7)

[10]:

<Axes: >

../../../_images/usage_tutorials_functional_4-4-poisson-kriging-area-to-point-smoothing_13_1.png

[11]:

# smoothed.to_file('smoothed.gpkg')

Changelog#

Date	Changes	Author
2025-05-31	Tutorial has been adapted to the 1.0 release	@SimonMolinsky (Szymon Moliński)