Data Science Asked on June 6, 2021
I am wondering what’s the best way to handle outliers when using non-supervised clustering algorithms?
you can perform standardization of your data using Standard Scaler before applying clustering techniques or you can use k-mediod clustering algorithm. You can also use z-score analysis to remove your outliers.
Answered by khwaja wisal on June 6, 2021
If you have outliers, the best way is to use a clustering algorithm that can handle them.
For example DBSCAN clustering is robust against outliers when you choose minpts large enough. Don't use k-means: the squared error approach is sensitive to outliers. But there are variants such as k-means-- for handling outliers.
Answered by Has QUIT--Anony-Mousse on June 6, 2021
Get help from others!
Recent Questions
Recent Answers
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP