How to reuse Random forest Classifier from Otpod Adaptive Hit miss method?

gblondet · June 5, 2023, 1:18pm

Good afternoon,

I am working with otpod to estimate POD with the Adaptive Hit-Miss method. I did not manage to reuse the classifier to compute confusion matrix. Of course, there is a method to compute confusion matrix directly, but it is only computed on added points, not on the whole dataset. I would like to compute the confusion matrix on new data by using the method AdaptiveHitMissPOD.getClassifier)(), but i don’t understand the type of argument which is required. The input data object we used to run the HitMiss method is not compatible with the ot.Function returned by getClassifier().
How can I use the ot.Function properly to evaluate the confusion matrix with sklearn ?

Sincerely yours,

G.Blondet

gblondet · June 6, 2023, 6:45am

I think I’ve found a simple solution, but it is not documented.

From the example otpod/test_adaptive_hitmiss_pod.py at master · openturns/otpod · GitHub
i added this :

HM_model = POD1.getClassifier()
pred = HM_model(inputDOE)


from sklearn.metrics import confusion_matrix

# outputDOE is composed of floats, but we need 1 (>detection) or 0.
ref = np.where(np.array(outputDOE) > detection, 1, 0)

# pred is composed of 2 columns, but we want just one !
pred_melt = pred.argmax(axis=1)

# confusion matrix
cf_mat = confusion_matrix(ref, pred_melt)

Is it correct ?
I haven’t check if the classifier is used directly like this inside AdaptiveHitMissPOD.

Sincerely,

G.Blondet

dumas · June 13, 2023, 1:43pm

Hello,

This part of the module was developped using scikit learn. Yes I checked inside the code, the confusion matrix is computed using the method imported from metrics:

github.com

openturns/otpod/blob/master/otpod/_adaptive_hitmiss_pod.py#L342


      
                      ),
                      *self._ClassifierParameters
                  )
              )[0]
          
          
# Apprentissage avec self._input,self._signals
          algo_temp.fit(self._input, self._signals)
          
          
self._confMat = np.zeros((2, 2))
          for classifier in list_classifiers:
              conf_temp = 1.0 * confusion_matrix(
                  self._signals, classifier(self._input)[:, 1] >= 0.5
              )
              conf_temp = 1.0 * conf_temp / conf_temp.sum(axis=0)
              self._confMat = conf_temp + self._confMat
          
          
self._confMat = 1.0 * self._confMat / len(list_classifiers)
          classif_algo_temp = algo_temp.predict_proba
          
          
p11 = self._confMat[1, 1]
          p10 = self._confMat[1, 0]

Antoine

Topic		Replies	Views
Predict new value POD Summary Python usage	0	215	September 30, 2022
New otbenchmark module Python usage	0	391	September 25, 2020
OTbenchmark: a benchmark module for UQ available on PyPi! Announcements	2	387	February 23, 2021
Using Kriging POD model Python usage	6	367	September 30, 2022
Otagrum computeConditionalCDF Python usage	1	31	January 3, 2025

How to reuse Random forest Classifier from Otpod Adaptive Hit miss method?

Related topics