TransWikia.com

Clustering sequences of sentence embeddings

Data Science Asked by dendog on September 20, 2020

I have a sequence of events, right now I am not worried about their actual times, just the order. This is a sequence of web page views.

I have modelled my data as the following, where each element represents the category of the web page.

user_sequence = ['A', 'A', 'B', 'C', ...]

Following this I used the code and approach from this paper: Sequence Graph Transform

My question is how could I represent more complex data in my sequences, for example, we have an embedding representing the page content, along with features including the dwell of the user on each page.

So to summarise, the goal is to do process sequences of rich event data in an unsupervised manner.

One Answer

The approach taken for anyone coming across this has been to first cluster each web page / node before passing it into SGT. This means we can encode more information into the sequence prior to using SGT.

Answered by dendog on September 20, 2020

Add your own answers!

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP