how to handle outliers for clustering algorithms?

Question

I am wondering what's the best way to handle outliers when using non-supervised clustering algorithms?

khwaja wisal · Answer

you can perform standardization of your data using Standard Scaler before applying clustering techniques or you can use k-mediod clustering algorithm. You can also use z-score analysis to remove your outliers.

Answered by khwaja wisal on June 6, 2021

Has QUIT--Anony-Mousse · Answer

If you have outliers, the best way is to use a clustering algorithm that can handle them.

For example DBSCAN clustering is robust against outliers when you choose minpts large enough. Don't use k-means: the squared error approach is sensitive to outliers. But there are variants such as k-means-- for handling outliers.

how to handle outliers for clustering algorithms?

2 Answers

Add your own answers!

Ask a Question