Mauntz-Kucherenko formula

josephmure · December 1, 2020, 4:13pm

This post follows the discussion started on Github:

In the following, I use the notations defined on this page of the doc:

Applying Equation (12) of the original paper

the Mauntz-Kucherenko estimator of the first order Sobol index is:

\hat{V}_i = \frac{1}{N} \sum_{k=1}^N \tilde{G}(\boldsymbol{B}_k) \left[ \tilde{G}(\boldsymbol{E}_k^{i}) - \tilde{G}(\boldsymbol{A}_k) \right].

Logically, the estimator \hat{V}_{-i} should then be

\hat{V}_{-i} = \frac{1}{N} \sum_{k=1}^N \tilde{G}(\boldsymbol{B}_k) \left[ \tilde{G}(\boldsymbol{C}_k^{i}) - \tilde{G}(\boldsymbol{A}_k) \right].

If we cannot do that (for example because we have not implemented \boldsymbol{C}_k^{i} ), then we could define

\hat{V}_{-i}^{alt} = \frac{1}{N} \sum_{k=1}^N \tilde{G}(\boldsymbol{A}_k) \left[ \tilde{G}(\boldsymbol{E}_k^{i}) - \tilde{G}(\boldsymbol{B}_k) \right].

and that would remain consistent with the paper.

In that case, we would define \widehat{VT}_{i}^{alt} as \frac{1}{N} \sum_{k=1}^N \tilde{G}(\mathcal{A}_k)^2 - \hat{V}_{-i}^{alt} (keep in mind that \sum_{k=1}^N \tilde{G}(\boldsymbol{A}_k) is supposed to be null). Therefore

\widehat{VT}_{i}^{alt} = \frac{1}{N} \sum_{k=1}^N \tilde{G}(\boldsymbol{A}_k) \left[ \tilde{G}(\boldsymbol{A}_k) + \tilde{G}(\boldsymbol{B}_k) - \tilde{G}(\boldsymbol{E}_k^{i}) \right].

Of course, in OpenTURNS, we typically use unbiased variance estimators so \frac{1}{N} should be replaced in all formulas by \frac{1}{N-1}. Not that it matters with the large N involved in Sobol estimation.

josephmure · December 2, 2020, 10:10am

Here is the formula for \widehat{VT}_i^{alt} actually implemented in the code:

github.com

openturns/openturns/blob/6ef868a09b1baf365ed7b793d23fbbce35ee211d/lib/src/Uncertainty/Algorithm/Sensitivity/MauntzKucherenkoSensitivityAlgorithm.cxx#L106


    // Compute yE * yA
    const Point yEDotyA(computeSumDotSamples(sample, size_, 0, (2 + p) * size_));
    for (UnsignedInteger k = 0; k < size; ++k)
    {
      for (UnsignedInteger q = 0; q < outputDimension; ++q)
      {
        varianceI(q, p) =  (yEDotyB[q] - yADotyB[q]) / (size - 1.0);
        // Vti = Var - V_{-i}
        // \sum_{k} yA[k] * yA[k] - yA[k]*yE[k]
        // yA[k] * yA[k]  = sigma_a^2 + muA^2
        VTi(q, p) = referenceVariance_[q] + (size * muA[q] *  muA[q] - yEDotyA[q]) / (size - 1.0);
      }
    }
  }
  return varianceI;
}
void MauntzKucherenkoSensitivityAlgorithm::computeAsymptoticDistribution() const
{
  const UnsignedInteger inputDimension = inputDesign_.getDimension();
  const UnsignedInteger outputDimension = outputDesign_.getDimension();

VTi(q, p) = referenceVariance_[q] + (size * muA[q] * muA[q] - yEDotyA[q]) / (size - 1.0);

In the code, q is the output dimension. For the sake of simplicity, let us assume that the output is one-dimensional, so q is 0. The input dimension (denoted by i in the post above) is here denoted by p: p =i.

Currently, \tilde{G} is centered with respect to the full (G(\boldsymbol{A}), G(\boldsymbol{B}), G(\boldsymbol{E}^1),...,G(\boldsymbol{E}^{n_X}))^T:

github.com

openturns/openturns/blob/6ef868a09b1baf365ed7b793d23fbbce35ee211d/lib/src/Uncertainty/Algorithm/Sensitivity/SobolIndicesAlgorithmImplementation.cxx#L955


    // Special case when dim=2, SO=true; the experiment is allowed to be smaller by symmetry
    // its size is N(d+2) instead of N(2d+2) as it does not contain the C=[E_2, E_1]
    Sample E1(outputDesign, size * 2, size * 3);
    Sample E2(outputDesign, size * 3, size * 4);
    fullOutputDesign.add(E2);
    fullOutputDesign.add(E1);
  }
  // center Y
  Point muY(fullOutputDesign.computeMean());
  outputDesign_ = fullOutputDesign - muY;
  // yA variance
  referenceVariance_ = Sample(fullOutputDesign, 0, size).computeVariance();
  for (UnsignedInteger j = 0; j < referenceVariance_.getDimension(); ++ j)
    if (!(referenceVariance_[j] > 0.0))
      throw InvalidArgumentException(HERE) << "Null output sample variance";
  alreadyComputedIndicesDistribution_ = false;
}

Case 1 : \tilde{G} centerered with respect to the sample \boldsymbol{A}

In this case, we assume that the code snippet above is changed in order to center the output sample with respect to \boldsymbol{A}.

That is to say that \tilde{G}(\cdot) = G(\cdot) - \frac{1}{N} \sum_{k=1}^N G(\boldsymbol{A}_k). In this case we would have:

referenceVariance_[q] = \frac{1}{N-1} \sum_{k=1}^{N} \tilde{G}(\boldsymbol{A}_k)^2,

size = N,

muA[q] = \frac{1}{N} \sum_{k=1}^N \tilde{G}(\boldsymbol{A}_k) = 0,

yEDotyA[q] = \sum_{k=1}^N \tilde{G}(\boldsymbol{E}_k^i) \tilde{G}(\boldsymbol{A}_k).

Therefore,

VTi(q, p) = \frac{1}{N-1} \sum_{k=1}^{N} \tilde{G}(\boldsymbol{A}_k) \left( \tilde{G}(\boldsymbol{A}_k) - \tilde{G}(\boldsymbol{E}_k^i) \right)

Case 2 : \tilde{G} centerered with respect to the full sample

This is what is currently implemented, so the formulas below are what is really computed in OT version 1.16.

With \tilde{G}(\cdot) = G(\cdot) - \frac{1}{(2+n_X)N} \left( \sum_{k=1}^N G(\boldsymbol{A}_k) + \sum_{k=1}^N G(\boldsymbol{B}_k) + \sum_{i=1}^{n_X} \sum_{k=1}^N G(\boldsymbol{E}_k^i) \right), we have

referenceVariance_[q] = \frac{1}{N-1} \sum_{k=1}^{N} \left( \tilde{G}(\boldsymbol{A}_k) - \frac{1}{N} \sum_{k'=1}^N \tilde{G}(\boldsymbol{A}_{k'})\right)^2,

size = N,

muA[q] = \frac{1}{N} \sum_{k=1}^N \tilde{G}(\boldsymbol{A}_k) \neq 0,

yEDotyA[q] = \sum_{k=1}^N \tilde{G}(\boldsymbol{E}_k^i) \tilde{G}(\boldsymbol{A}_k).

Therefore,

VTi(q, p) = \frac{1}{N-1} \sum_{k=1}^{N} \left( \tilde{G}(\boldsymbol{A}_k) - \frac{1}{N} \sum_{k'=1}^N \tilde{G}(\boldsymbol{A}_{k'})\right)^2 + \frac{N}{N-1}\left( \frac{1}{N} \sum_{k=1}^{N} \tilde{G}(\boldsymbol{A}_k) \right)^2
- \frac{1}{N-1} \sum_{k=1}^N \tilde{G}(\boldsymbol{E}_k^i) \tilde{G}(\boldsymbol{A}_k).

Conclusion

The formulas of both Case 1 and Case 2 do not fit the theory. The doc is wrong too, but in a different way.

sofianehaddad · December 2, 2020, 1:45pm

@josephmure Using what your wrote and analysing a bit theory/other sources, we can refine the analysis.

Starting from the estimate of first order:

\hat{V}_i=\frac{1}{N}\sum_{k=1}^N \tilde{G(\boldsymbol{B}_k)}\left[\tilde{G(\boldsymbol{E}_k^{i})} - \tilde{G(\boldsymbol{A}_k)}\right]

We get:

\hat{V}_{-i}=\frac{1}{N}\sum_{k=1}^N \tilde{G(\boldsymbol{B}_k)}\left[\tilde{G(\boldsymbol{C}_k^{i}}) - \tilde{G(\boldsymbol{A}_k)}\right]

As C_k are missing, we use (as you mentioned):

\widehat{V}_{-i}^{alt}=\frac{1}{N}\sum_{k=1}^N\tilde{G(\boldsymbol{A}_k)}\left[\tilde{G(\boldsymbol{E}_k^{i})} - \tilde{G(\boldsymbol{B}_k)}\right].

Then to get \widehat{VT}_{i}^{alt}, we use:

\widehat{VT}_{i}^{alt}= Var( \tilde{G(\boldsymbol{A}_k)} ) - \widehat{V}_{-i}^{alt}

The tip here consists, as for first order, to replace \left(\frac{1}{N}\sum_{k=1}^N \tilde{G(\boldsymbol{A}_k)}\right)^2 by \frac{1}{N}\sum_{k=1}^N \tilde{G(\boldsymbol{A}_k)} \tilde{G(\boldsymbol{B}_k)}. Thus we get:

\widehat{VT}_{i}^{alt}= \frac{1}{N}\sum_{k=1}^N\tilde{G(\boldsymbol{A}_k)}\left[(\tilde{G(\boldsymbol{A}_k}) - \tilde{G(\boldsymbol{E}_k^{i})})\right]

If we have a look for example to R-sensitivity, the estimator is implemented like that.

However, in OpenTURNS, we replace the first term \frac{1}{N}\sum_{k=1}^N\tilde{G(\boldsymbol{A}_k)}(\tilde{G(\boldsymbol{A}_k}) by var(y_a) + \mu_a^2. In other words, we should implement the good one with the previous equation

Topic		Replies	Views
Chaos Vs Saltelli Methodology sensitivity-analysis , polynomial-chaos	6	762	December 23, 2020
OpenTURNS 1.23 released Announcements	0	145	June 6, 2024
Version 1.20 released Announcements	0	346	November 10, 2022
Version 1.20 RC1 released Announcements	0	418	October 13, 2022
Sobol Indices do not add up to 1 Methodology	2	281	March 30, 2024

Mauntz-Kucherenko formula

Case 1 : \tilde{G} centerered with respect to the sample \boldsymbol{A}

Case 2 : \tilde{G} centerered with respect to the full sample

Conclusion

Related topics