Frozen inputs for calibration not really frozen

sanaaZ · May 25, 2021, 3:59pm

Hi everyone,
I have a calibration problem which resemble a lot the one described in the logistic calibration example in the Openturns documentation

Only i have had some trouble using Openturns because the input parameters that are supposedly frozen during the calibration procedure (i.e. the observed years) are slightly changed. not very much changed, just around the fifth or sixth decimal but sufficiently so to make for some useless calls to the function. You can see such a problem in the file included, which comes from the OT documentation with the addition of a print call within the calibrated function, in order to monitor on what inputs the function is being called

Given that my function is a bit costly to compute, it becomes quite cumbersome and renders any memoization measure useless

is there a way to ensure that the frozen input parameters are effectively frozen during the calibration procedure?

regards,
sanaa

regislebrun · May 25, 2021, 8:07pm

Hi Sanaa,

Using LinearLeastSquaresCalibration, at one point you need to compute the gradient of the function wrt the frozen parameters. Either you give a specific implementation of the gradient, or it is computed using finite differences (the default). It is exactly what’s happen here and there is no way to avoid it if you don’t provide the gradient. It is not a limitation of OT, but a characteristic of linear least squares calibration.
Nevertheless, you can reduce the cost a little bit by providing your own finite difference gradient based on a non-centered finite difference instead of the default centered one. It reduces the number of evaluations from 2d to d+1, and even to d as the central point is also needed in another part of the algorithm and will be cached.Have a look here:
https://openturns.github.io/openturns/latest/user_manual/_generated/openturns.NonCenteredFiniteDifferenceGradient.html

Cheers

Régis

sanaaZ · May 26, 2021, 7:18am

Thank you Regis
I makes a bit more sense in the light of your explanation. Still, i dont understand why OT should be needing to calculate the global gradient and not just a partial one ? The optimisation is carried out on two parameters only after all (in the example), to my understanding OT does not need to compute the gradient on the whole input, but only the partial gradient over the parameters that are to be calibrated

Thank you !
Sanaa

-------- Message d’origine --------

regislebrun · May 26, 2021, 8:45am

In fact it does exactly that: if you look at the LinearLeastSquaresCalibration.cxx file, line 72:

const Matrix parameterGradient(parametrizedModel.parameterGradient(inputObservations[i]));

but if your model has been built using a ParametricFunction class, this method is based on the gradient() method of the underlying full function, where the parameters and the other inputs are merged and there is no way to tell a function to compute only a part of its gradient. I can add a specific case for finite differences but at one point it becomes a crapy software design. I will propose an evolution in this spirit and discuss this point with the development team.

Cheers

Régis

sanaaZ · May 26, 2021, 12:44pm

Thank you Régis
Here is what i ended up doing in order to circumvent this problem:

from functools import partial
def create_frozen_input_func(logisticFun, arg_froz ):
    froz_func = partial(logisticFun, arg_froz=arg_froz)
    return lambda args:froz_func(args)

def logisticFun (arg_cal, arg_froz):
    arg_cal = [i for i in arg_cal]
    X = arg_froz + arg_cal 
    return logisticModel(X)

# Frozen arguments:
arg_froz = np.array(timeObservationsVector).flatten().tolist()
# Wrapped function:
logisticParametric = create_frozen_input_func(logisticFun, arg_froz )
logisticParametric (thetaPrior)
logisticParametric = ot.PythonFunction(2, nbdates,logisticParametric)

# The new parametric function, with a 0-dimension input
logisticParametric = ot.ParametricFunction(logisticParametric,[0,1],thetaPrior)

populationPredicted = logisticParametric([])

algo = ot.LinearLeastSquaresCalibration(logisticParametric, [[]], 
               populationObservationsVector, thetaPrior)
algo.run()

Sanaa

regislebrun · May 26, 2021, 1:44pm

I just implemented an OT version of this trick, taking into account the fact that the gradients are computed using finite differences (a very common case).
What makes your implementation work is because your full model does not provide its gradient. If it was provided, for example an analytical gradient (here: logisticFun implemented as e.g. a SymbolicFunction) then your construction would lead to a parameter gradient computed by finite differences over the frozen input function, not using any of the components of the analytical gradient.
In OT we made the decision to favor accuracy wrt speed as much as possible, and we made the assumption that if the user provides the gradient it is because he took the time to fine-tune it, hence the call to the full gradient and the extraction to its relevant subset.
Stay tuned to the following OT pull request:
https://github.com/openturns/openturns/pull/1846

MichaelBaudin · May 27, 2021, 9:46am

Hi!
I created an issue on this topic to make the problem as clear as possible:

github.com/openturns/openturns

`ParametricFunction` require unnecessary function evaluations

opened 01:22PM - 26 May 21 UTC

mbaudin47

When we calibrate the parameters of a function, we need to compute the gradient …of the function with respect to the parameters (e.g. with LinearLeastSquaresCalibration). With a `PythonFunction`, this may generate more function evaluations than required. This does not generate a wrong calculation, but limits the speed. The root cause of the problem is that the `ParametricFunction` implements the gradient by computing the gradient of the full underlying function, then extract the required components of the gradient with respect to the parameters. In the following script, we create a `ParametricFunction` with name `g`. This function is created based on the full model `f` which has inputs `a, b, x0, x1, x2`. Then `g` is created by setting `a` and `b` as parameters. Hence the parametric function `g` has (a,b) as parameters and (x0, x1, x2) as inputs. ``` import openturns as ot def f(x): x = ot.Point(x) a, b, x0, x1, x2 = x print("x=", x) y = a + b + x0 + x1 + x2 return [y] f_py = ot.PythonFunction(5, 1, f) indices = [0, 1] referencePoint = [1.0, 2.0] g = ot.ParametricFunction(f_py, indices, referencePoint) x = ot.Point([3.0, 4.0, 5.0]) print("g(x)=", g(x)) ``` To evaluate the gradient with respect to its input x, we use the script: ``` # Compute gradient with respect to inputs x gradient_x = g.gradient(x) print("Gradient with respect to x = (x0, x1, x2)=") print(gradient_x) ``` which prints: ``` x= [1.00001,2,3,4,5] x= [0.99999,2,3,4,5] x= [1,2.00001,3,4,5] x= [1,1.99999,3,4,5] x= [1,2,3.00001,4,5] x= [1,2,2.99999,4,5] x= [1,2,3,4.00001,5] x= [1,2,3,3.99999,5] x= [1,2,3,4,5.00001] x= [1,2,3,4,4.99999] ``` This shows that the underlying full model `f` is used to compute the gradient with respect to all inputs (using finite differences). This is useless, since only the partial derivatives with respect to x0, x1, and x2 are required. When we want to compute the gradient with respect to the parameters, we use: ``` # Compute gradient with respect to parameters gradient_p = g.parameterGradient(x) print("Gradient with respect to parameters (a, b)=") print(gradient_p) ``` This prints the same message. The problem is in: https://github.com/openturns/openturns/blob/b5797d7e4a71c71faf86df51f26ad0d8d551ad08/lib/src/Base/Func/ParametricGradient.cxx#L62 The code is: ``` const Matrix fullGradient(p_evaluation_->getFunction().gradient(x)); ``` A possible implementation to solve the problem would be to provide a way to access to a specific component of the gradient. Thanks to @sanaaZ for pointing this problem.

Regards,
Michaël

Topic		Replies	Views
How should i memoize my function in a calibration setting? Python usage calibration	4	403	May 25, 2021
Stochastic calibration Python usage	1	314	January 23, 2022
Minor feedback on covariance matrixes in a calibration context Python usage	1	217	June 8, 2022
Computation time Python usage	4	35	October 9, 2024
Calibration with noisy input parameters Python usage calibration	4	414	April 11, 2022

Frozen inputs for calibration not really frozen

Related topics