Data Science Asked on September 5, 2021
When I write "uncertainty" in this post I mean:
If I have a classifier into $a_1,..,a_n$ categories and for an observation $x$ I classify $x$ to $a_i$ with probability $p_i$, then the uncertainty of this decision is $1-p_i$.
I’d like to inquire about connections of this notion and that of accuracy and explainability.
For example, if I have a classifier that is "very certain" (on mean/median on the test/training set) how often is this property correlated to achieving real-time accurate predictions? What about vice-versa?
Moreover, if my classifier is "certain" how does this affect my ability to explain its decision in any sense?
I couldn’t find good resources for this notion of uncertainty and these questions so I will really appreciate some references as well!
There is a bit of confusion, I'm afraid:
Now the main problem: whatever you call a confidence measure based on the probability predicted by the classifier, it's not reliable. A prediction is at best an informed decision of the classifier given the data it has seen in the training set and the features of instance. But it could be a random classifier, or a majority-class classifier: in these cases the probability it "predicts" is arbitrary. Imagine you are a teacher and one of your students says "the answer of x=2+2 is x=5", I'm 100% sure". The fact that the student is "100% sure" doesn't make them right, same thing for the classifier. In other words, any reliable measure of uncertainty involves the gold-standard answer, so it's usually part of the evaluation process. That's not to say that the predicted probability is useless, but in general it has no direct link to accuracy, and it would be a mistake to interpret it in this way.
Interpretability (or explainability) is a completely different matter: the general idea is to know whether the answer predicted by a classifier can be understood by a human. Typically traditional models like Naive Bayes or Decision Tree models are more directly interpretable (at least with not too many features) than deep NN models.
Answered by Erwan on September 5, 2021
Get help from others!
Recent Questions
Recent Answers
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP