NO.PZ2021083101000007
问题如下:
Select performance data from the cross- validation set confusion matrices is presented in Exhibit 1:
Azarov and Bector evaluate the Dataset XYZ performance metrics for Confusion Matrices A, B, and C in Exhibit 1. Azarov says, “For Ganymede’s purposes, we should be most concerned with the cost of Type I errors. ”
Based on Exhibit 1, which confusion matrix demonstrates the most favorable value of the performance metric that best addresses Azarov’s concern?
选项:
A.Confusion Matrix A
Confusion Matrix B
Confusion Matrix C
解释:
A is correct.
Precision is the ratio of correctly predicted positive classes to all predicted positive classes and is useful in situations where the cost of false positives or Type I errors is high.
Confusion Matrix A has the highest precision and therefore demonstrates the most favorable value of the performance metric that best addresses Azarov’s concern about the cost of Type I errors.
Confusion Matrix A has a precision score of 0.95, which is higher than the precision scores of Confusion Matrix B (0.93) and Confusion Matrix C (0.86).
B is incorrect because precision, not accuracy, is the performance measure that best addresses Azarov’s concern about the cost of Type I errors.
Confusion Matrix B demonstrates the most favorable value for the accuracy score (0.92), which is higher than the accuracy scores of Confusion Matrix A (0.91) and Confusion Matrix C (0.91).
Accuracy is a performance measure that gives equal weight to false positives and false negatives and is considered an appropriate performance measure when the class distribution in the dataset is equal (a balanced dataset).
However, Azarov is most concerned with the cost of false positives, or Type I errors, and not with finding the equilibrium between precision and recall.
Furthermore, Dataset XYZ has an unequal (unbalanced) class distribution between positive sentiment and negative sentiment sentences.
C is incorrect because precision, not recall or F1 score, is the performance measure that best addresses Azarov’s concern about the cost of Type I errors.
Confusion Matrix C demonstrates the most favorable value for the recall score (0.97), which is higher than the recall scores of Confusion Matrix A (0.87) and Confusion Matrix B (0.90).
Recall is the ratio of correctly predicted positive classes to all actual positive classes and is useful in situations where the cost of false negatives, or Type II errors, is high.
However, Azarov is most concerned with the cost of Type I errors, not Type II errors.
F1 score is more appropriate (than accuracy) when there is unequal class distribution in the dataset and it is necessary to measure the equilibrium of precision and recall.
Confusion Matrix C demonstrates the most favorable value for the F1 score (0.92), which is higher than the F1 scores of Confusion Matrix A (0.91) and Confusion Matrix B (0.91).
Although Dataset XYZ has an unequal class distribution between positive sentiment and negative sentiment sentences, Azarov is most concerned with the cost of false positives, or Type I errors, and not with finding the equilibrium between precision and recall.
考点:Model Training - Performance Evaluation
这道题我想问下,我做的时候知道用precison,但是他说worried about cost of Typ 1 error, 那不应该是FP比较大的时候,Type I error会比较大吗,那么prcesion小的时候,不应该是选C吗?还是说我哪里理解错了,谢谢