Data Science Asked on October 7, 2020
I want to build a [probabilistic] model that aims to infer the true value of an unknown categorical variable, $y in {1,2,…, K}$.
We have a dataset $(X,y): mathbb{R}^drightarrow {1,2,…, K}$ and we can train a classifier that gives $d$-dimensional data, $X$, and estimates the output $y$.
Now, suppose that $X$s are correlated and all coming from a fixed $y$. I mean, we are observing $X^1, X^2,…., X^T,…$ over time and we know that $y$ is fixed for all of them.
For example:
So, at time $t=T$ we have $hat{y}^1, hat{y}^2, …, hat{y}^T$.
Now, the question is: How can I make a model to use these estimations ($hat{y}^1, hat{y}^2, …, hat{y}^T$) and improve my belief about the true $y$ over time, considering that:
dimension $d$ is not small. e.g. $d >50$
data samples, $X$s, are not i.i.d. but all coming from a fixed unknown $y$.
classifier is not optimal (just trained on some available data) and at each round gives an estimate about the $hat{y}^t$ for the current $X^t$.
I have been reading some materials and came across the following but I am not sure which one is better to investigate more into:
Or is there any specific Bayesian framework for it?
I think all of the options you give will yield results for the problem you described. It seems you have a y like a die that yields results which are rolls of the die all unpredictable but still linked from the y and you want to infer the y based on the x values. Like a Hidden Markov model.
The part about wanting to accurately make an estimate of y at each x and the x's being correlated make me believe that LSTM technology may be of benefit. If you want to use a NN.
Optimal stopping and Sequential Hypothesis Testing and Sequential probability ratio test and HDI+ROPE will all work for the abstract problem you describe. Until you give us more details about your problem like what specifically you will be working with it is hard to give you concise direction.
If you create a LSTM That takes in an X and makes a guess about which y and train it on x data you have then you would have the predictive model you seek.
Answered by Michael Hearn on October 7, 2020
Get help from others!
Recent Answers
Recent Questions
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP