Data Science Asked by Arek Żyłkowski on June 8, 2021
Let’s say I have pretrained word2vec model and apply it to dataset consisting of article titles from "The Guardian". It seems pretty obvious that titles coming from "Science" section would form one cluster in latent space and titles from "Fashion" section would form another cluster in latent space. But the thing is my dataset doesn’t have category label for each title. How can I come up with such human readable interpretation of cluster centers(probably coming from Kmeans)?
The usual way is to present the top N (e.g. top 10) words for the cluster:
Answered by Erwan on June 8, 2021
Get help from others!
Recent Questions
Recent Answers
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP