Cross Validated Asked by Abdu on February 20, 2021
Does anyone know an outlier detection method for a univariate categorical (nominal, unordered) statistical variable? Without any assumptions about the categorical variable distribution (non-parametric method)?
As per my understanding, there is no concept of outliers detection in categorical variables(nominal), as each value is count as labels. Based on frequency(Mode), we can't do outliers treatment for categorical variables. Plz prove me wrong :)..
Answered by Kapil on February 20, 2021
Outliers are extreme values that we come across, where they may be influential to the model or not. When it comes to categorical data (say Gender: as in male and female). There's no way of any outlier detection in that. If you mean something like this: You take a sample of 10 with 9 males and 1 female. So you mean that "1 female" is an outlier? NO! It's just the composition of the sample which you have selected.
Answered by Dovini Jayasinghe on February 20, 2021
Think about your question once more because you ask for an algorithm to detect which of these is an outlier:
Nominal scale means that you have just labels of items like city names or car brands. You can't tell which is an outlier without additional info.
Answered by Silvestris on February 20, 2021
Get help from others!
Recent Answers
Recent Questions
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP