マルチクラスのsklearn.roc_auc_scoreを計算する

Question

分類器のAUC、精度、精度を計算したいと思います。私は教師付き学習をしています：

これが私の作業コードです。このコードはバイナリクラスでは正常に機能しますが、マルチクラスでは機能しません。バイナリクラスを持つデータフレームがあると仮定してください。

sample_features_dataframe = self._get_sample_features_dataframe() labeled_sample_features_dataframe = retrieve_labeled_sample_dataframe(sample_features_dataframe) labeled_sample_features_dataframe, binary_class_series, multi_class_series = self._prepare_dataframe_for_learning(labeled_sample_features_dataframe) k = 10 k_folds = StratifiedKFold(binary_class_series, k) for train_indexes, test_indexes in k_folds: train_set_dataframe = labeled_sample_features_dataframe.loc[train_indexes.tolist()] test_set_dataframe = labeled_sample_features_dataframe.loc[test_indexes.tolist()] train_class = binary_class_series[train_indexes] test_class = binary_class_series[test_indexes] selected_classifier = RandomForestClassifier(n_estimators=100) selected_classifier.fit(train_set_dataframe, train_class) predictions = selected_classifier.predict(test_set_dataframe) predictions_proba = selected_classifier.predict_proba(test_set_dataframe) roc += roc_auc_score(test_class, predictions_proba[:,1]) accuracy += accuracy_score(test_class, predictions) recall += recall_score(test_class, predictions) precision += precision_score(test_class, predictions)

最終的に、平均AUC、精度などを得るために、結果をもちろんKで分割しました。このコードは正常に機能しています。ただし、マルチクラスの場合は同じように計算できません。

 train_class = multi_class_series[train_indexes] test_class = multi_class_series[test_indexes] selected_classifier = RandomForestClassifier(n_estimators=100) selected_classifier.fit(train_set_dataframe, train_class) predictions = selected_classifier.predict(test_set_dataframe) predictions_proba = selected_classifier.predict_proba(test_set_dataframe)

マルチクラスの場合、平均値に「加重」パラメータを追加する必要があることがわかりました。

 roc += roc_auc_score(test_class, predictions_proba[:,1], average="weighted")

エラーが発生しました：ValueError（ "{0}形式はサポートされていません。"。format（y_type））を発生させます

ValueError：マルチクラス形式はサポートされていません

ogrisel · Answer

roc_auc_scoreのaverageオプションは、マルチラベルの問題に対してのみ定義されます。

Scikit-learnドキュメンテーションの次の例を参照して、マルチクラス問題のマイクロまたはマクロ平均スコアを定義してください。

http://scikit-learn.org/stable/auto_examples/model_selection/plot_roc.html#multiclass-settings

Edit：scikit-learnトラッカーに問題があり、マルチクラスの問題に対してROC AUCを実装します： https://github.com/ scikit-learn/scikit-learn/issues/3298

maxymoo · Answer

roc_aucをマルチクラスモデルの単一のサマリーメトリックとして使用することはできません。必要に応じて、クラスごとにroc_aucを計算できます。

roc = {label: [] for label in multi_class_series.unique()} for label in multi_class_series.unique(): selected_classifier.fit(train_set_dataframe, train_class == label) predictions_proba = selected_classifier.predict_proba(test_set_dataframe) roc[label] += roc_auc_score(test_class, predictions_proba[:,1])

ただし、sklearn.metrics.confusion_matrixを使用してマルチクラスモデルのパフォーマンスを評価する方が一般的です。

Raul · Answer

ここで述べたように、私の知る限り、sklearnでネイティブに複数のクラス設定のroc aucを簡単に計算する方法はまだありません。

ただし、classification_reportに精通している場合は、classification_reportと同じ出力をpandas.DataFrameとして返すこの単純な実装を好むかもしれません。

import pandas as pd import numpy as np from scipy import interp from sklearn.metrics import precision_recall_fscore_support from sklearn.metrics import roc_curve, auc from sklearn.preprocessing import LabelBinarizer def class_report(y_true, y_pred, y_score=None, average='micro'): if y_true.shape != y_pred.shape: print("Error! y_true %s is not the same shape as y_pred %s" % ( y_true.shape, y_pred.shape) ) return lb = LabelBinarizer() if len(y_true.shape) == 1: lb.fit(y_true) #Value counts of predictions labels, cnt = np.unique( y_pred, return_counts=True) n_classes = len(labels) pred_cnt = pd.Series(cnt, index=labels) metrics_summary = precision_recall_fscore_support( y_true=y_true, y_pred=y_pred, labels=labels) avg = list(precision_recall_fscore_support( y_true=y_true, y_pred=y_pred, average='weighted')) metrics_sum_index = ['precision', 'recall', 'f1-score', 'support'] class_report_df = pd.DataFrame( list(metrics_summary), index=metrics_sum_index, columns=labels) support = class_report_df.loc['support'] total = support.sum() class_report_df['avg / total'] = avg[:-1] + [total] class_report_df = class_report_df.T class_report_df['pred'] = pred_cnt class_report_df['pred'].iloc[-1] = total if not (y_score is None): fpr = dict() tpr = dict() roc_auc = dict() for label_it, label in enumerate(labels): fpr[label], tpr[label], _ = roc_curve( (y_true == label).astype(int), y_score[:, label_it]) roc_auc[label] = auc(fpr[label], tpr[label]) if average == 'micro': if n_classes <= 2: fpr["avg / total"], tpr["avg / total"], _ = roc_curve( lb.transform(y_true).ravel(), y_score[:, 1].ravel()) else: fpr["avg / total"], tpr["avg / total"], _ = roc_curve( lb.transform(y_true).ravel(), y_score.ravel()) roc_auc["avg / total"] = auc( fpr["avg / total"], tpr["avg / total"]) Elif average == 'macro': # First aggregate all false positive rates all_fpr = np.unique(np.concatenate([ fpr[i] for i in labels] )) # Then interpolate all ROC curves at this points mean_tpr = np.zeros_like(all_fpr) for i in labels: mean_tpr += interp(all_fpr, fpr[i], tpr[i]) # Finally average it and compute AUC mean_tpr /= n_classes fpr["macro"] = all_fpr tpr["macro"] = mean_tpr roc_auc["avg / total"] = auc(fpr["macro"], tpr["macro"]) class_report_df['AUC'] = pd.Series(roc_auc) return class_report_df

以下に例を示します。

from sklearn.metrics import classification_report from sklearn.model_selection import train_test_split from sklearn.ensemble import RandomForestClassifier from sklearn.datasets import make_classification X, y = make_classification(n_samples=5000, n_features=10, n_informative=5, n_redundant=0, n_classes=10, random_state=0, shuffle=False) X_train, X_test, y_train, y_test = train_test_split(X, y) model = RandomForestClassifier(max_depth=2, random_state=0) model.fit(X_train, y_train)

レギュラーclassification_report：

sk_report = classification_report( digits=6, y_true=y_test, y_pred=model.predict(X_test)) print(sk_report)

でる：

 precision recall f1-score support 0 0.262774 0.553846 0.356436 130 1 0.405405 0.333333 0.365854 135 2 0.367347 0.150000 0.213018 120 3 0.350993 0.424000 0.384058 125 4 0.379310 0.447154 0.410448 123 5 0.525000 0.182609 0.270968 115 6 0.362573 0.488189 0.416107 127 7 0.330189 0.299145 0.313901 117 8 0.328571 0.407080 0.363636 113 9 0.571429 0.248276 0.346154 145 avg / total 0.390833 0.354400 0.345438 1250

カスタム分類レポート：

report_with_auc = class_report( y_true=y_test, y_pred=model.predict(X_test), y_score=model.predict_proba(X_test)) print(report_with_auc)

でる：

 precision recall f1-score support pred AUC 0 0.262774 0.553846 0.356436 130.0 274.0 0.766477 1 0.405405 0.333333 0.365854 135.0 111.0 0.773974 2 0.367347 0.150000 0.213018 120.0 49.0 0.817341 3 0.350993 0.424000 0.384058 125.0 151.0 0.803364 4 0.379310 0.447154 0.410448 123.0 145.0 0.802436 5 0.525000 0.182609 0.270968 115.0 40.0 0.680870 6 0.362573 0.488189 0.416107 127.0 171.0 0.855768 7 0.330189 0.299145 0.313901 117.0 106.0 0.766526 8 0.328571 0.407080 0.363636 113.0 140.0 0.754812 9 0.571429 0.248276 0.346154 145.0 63.0 0.769100 avg / total 0.390833 0.354400 0.345438 1250.0 1250.0 0.776071

Afsan Abdulali Gujarati · Answer

実際のリストと予測リストを取得し、すべてのクラスをキーとして、そのroc_auc_scoreを値として含む辞書を返す比較的単純なものを探している場合は、次の方法を使用できます。

from sklearn.metrics import roc_auc_score def roc_auc_score_multiclass(actual_class, pred_class, average = "macro"): #creating a set of all the unique classes using the actual class list unique_class = set(actual_class) roc_auc_dict = {} for per_class in unique_class: #creating a list of all the classes except the current class other_class = [x for x in unique_class if x != per_class] #marking the current class as 1 and all other classes as 0 new_actual_class = [0 if x in other_class else 1 for x in actual_class] new_pred_class = [0 if x in other_class else 1 for x in pred_class] #using the sklearn metrics method to calculate the roc_auc_score roc_auc = roc_auc_score(new_actual_class, new_pred_class, average = average) roc_auc_dict[per_class] = roc_auc return roc_auc_dict print("
Logistic Regression") # assuming your already have a list of actual_class and predicted_class from the logistic regression classifier lr_roc_auc_multiclass = roc_auc_score_multiclass(actual_class, predicted_class) print(lr_roc_auc_multiclass) # Sample output # Logistic Regression # {0: 0.5087457159427196, 1: 0.5, 2: 0.5, 3: 0.5114706737345112, 4: 0.5192307692307693} # 0.5078894317816

Saad Ahmed · Answer

@Raulあなたの関数は良さそうに見えますが、n_classes <= 2でミクロ平均のroc_scoreを計算するときに関数に問題があります。寸法に問題があったので、次を変更しました。

これから

if average == 'micro': if n_classes <= 2: fpr["avg / total"], tpr["avg / total"], _ = roc_curve( lb.transform(y_true).ravel(), **y_score[:, 1]**.ravel())

これに

if average == 'micro': if n_classes <= 2: fpr["avg / total"], tpr["avg / total"], _ = roc_curve( lb.transform(y_true).ravel(), **y_score**.ravel())

この変更により、roc_scoreの計算に問題が生じないことを願っています。

Long · Answer

Maxymooの答えを更新します。

roc [label] + = roc_auc_score（test_class、predicts_proba [：、label]）

または、classifier.classes_属性を参照して、関心のあるラベルの正しい列を決定します。