TransWikia.com

Visualising K-Means clusters for 3D data in R

Data Science Asked by zlee11 on July 27, 2021

I have an excel file that contains 485k rows x 3 columns of integer values. Sample data:

cluster.x    cluster.y    cluster.z
1            3            3
3            1            4
4            2            5
...

I did a kmeans clustering with 6 clusters, but I am figuring out what is the best way to display the result. I plotted it using plot3d() as shown below, but it is hard to differentiate different clusters since the values are all integers.

Does anyone have any suggestions? If possible, I also want it to be able to visualise how many data points there are in each cluster (e.g a bubble plot where the size of the bubble depicts the number of data points).

enter image description here

One Answer

Is it necessary to plot all points? If not, you can use plot3d to display 6 bubbles centered at each clusters center. You can use mean within distance for each cluster to set the radius of each cluster (you will have to normalize this) and a color range to display the number of points in each cluster.

Answered by Dimitrios Panagopoulos on July 27, 2021

Add your own answers!

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP