Natural Resources CanadaGovernment of Canada
Satellite image of CanadaStrong and safe communitiesGeological Survey of Canada
Spatial data analysis
Prediction models in spatial data analysis for landslide hazard mapping

Overview

views of prediction map


Brief history of spatial prediction modeling at the GSC

1972: Agterberg, F.P. et al., Prediction models for locating undiscovered copper deposits.

1977-1980: Chung C.F., Development of SIMSAG (interactive graphic system), CDC 6400 + tektronix 4014 terminal maximum number of pixels: 100 x 100

1990: Agterberg, F.P. and Bonham-Carter G.F. (eds.), GSC paper 89-9, Statistical Applications in the Earth Sciences

1992 - 1997: Spatial Data Analysis Laboratory. Development of spatial data integration system SGI, Windows NT/95/98/3.1 maximum number of pixels (in practice) 4000 x 4000

Basic idea: the favorability function

At each pixel, p:

ƒ(Tp: given m causal factors vk(p)=1 . . . m)

Tp: (p will be affected by a future landslide of type D)

The "sureness" that the proposition Tp is true given the m causal factors (vk(p)=1 . . . m), is being measured.

By "sureness" we mean: probability, certainty, belief, plausibility, possibility ...

Mathematical frameworks

Three mathematical frameworks used for the models are:

  1. Probability theory

  2. Dempster-Shafer evidential theory

  3. Zadeh's fuzzy set theory

Direct estimation

Sp: "p has been affected by a past landslide of a given type

Prob{Sp|c1,c2,...,cm}= size of S Θ / size of Θ

Prob{Sp}=size of S / size of A
where S represents the areas affected by past landslides

Prob(Fp|c1,c2,...,cm) = Prob(Sp|c1,c2,...,cm)

λD = Prob{c1,c2,...,cm|Sp} / Prob{c1,c2,...,cm|notSp} =1-Prob{Sp} / Prob{Sp} * Prob{Sp|c1,c2,...,cm} / 1- Prob{Sp|c1,c2,...,cm}

CFD{Fp|c1,c2,...,cm} =λ -1 / λ -1

WoED{Fp|c1,c2,...,cm} = logeλ

The order is preserved. The estimators are simple to compute, and do not require any mathematical assumptions. They fail badly as predictors of the occurrence of future landslides. They should be computed as benchmarks of the performance of spatial data as causal factors of landslides.

Bayesian estimation

Prob{Fp|c1,c2,...,cm}= Prob{Fp}Prob{c1,c2,...,cm|Fp} / Prob{c1,c2,...,cm}

Conditional independence assumption given Fp

Prob{c1,c2,...,cm|Fp} =Prob{c1|Fp}Prob{c2|Fp} ...Prob{cm|Fp}

Prob{Fp|c1,c2,...,cm} =(Prob{c1|Fp}...Prob{cm|Fp} / Prob{Fp|c1,c2,...,cm})

Prob{F}(Prob{Fp|c1} / Prob{F})... (Prob{Fp|cm} / Prob{F})

Prob{c1,c2,...,cm}= size of Θ / size of A,

Prob{ck}= size of Akck / size of A,

Prob{Fp|ck} = size of F ∩ size of Akck / size of A,

Prob{Sp|ck} =size ofS∩ size of Akck / size of A,

Prob{Sp} =size ofS / size of A

Bayesian estimate at each pixel p:

Prob{Fp|c1,c2,...,cm} =(Prob{c1|Sp}...Prob{cm|Sp} / Prob{Sp|c1,c2,...,cm}) Prob{S}(Prob{Sp|c1} / Prob{Sp})... (Prob{Sp|cm} / Prob{Sp})

The likelihood ratio becomes:

λ = λ1 ... λm

λ1 =Prob{c1|Fp} / Prob{c1|notFp}
=Prob{Fp|c1}(1-Prob{Fp} / Prob{Fp}(1-Prob{Fp|c1}

λ' = λ'1 ... λ'm

λ'k =Prob{ck|Sp} / Prob{ck|notSp}
=Prob{Sp|ck}(1-Prob{Sp} / Prob{Sp}(1-Prob{Sp|ck}

CF2={Fp|c1,c2,...,cm}=(λ' - 1) / (λ' + 1)

WOE2={Fp|c1,c2,...,cm}=logeλ'

The advantage of this estimator is that it depends only on bivariate conditional probabilities of the occurrences of past landslides given pixel values at each layer separately. However, the price of this advantage is adherance to the conditional independence assumption.

Validation methods


2006-09-01
http://www.gsc.nrcan.gc.ca/sda/landslide_e.php