TransWikia.com

how to handle outliers for clustering algorithms?

Data Science Asked on June 6, 2021

I am wondering what’s the best way to handle outliers when using non-supervised clustering algorithms?

2 Answers

you can perform standardization of your data using Standard Scaler before applying clustering techniques or you can use k-mediod clustering algorithm. You can also use z-score analysis to remove your outliers.

Answered by khwaja wisal on June 6, 2021

If you have outliers, the best way is to use a clustering algorithm that can handle them.

For example DBSCAN clustering is robust against outliers when you choose minpts large enough. Don't use k-means: the squared error approach is sensitive to outliers. But there are variants such as k-means-- for handling outliers.

Answered by Has QUIT--Anony-Mousse on June 6, 2021

Add your own answers!

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP