Data Science Asked by eartoolbox on January 9, 2021
I trained a GRU model on some data and then created a bunch of predictions on a test set.
The predictions are really bad, as indicated by a near zero R2 score.
I notice that the variance of the model predictions are much smaller than the actual training data. i.e it seems like the model is overfit to the mean:
But why is this? I made sure to stop training/use hyperparameters where there was no model overfitting, so why are model predictions centred around the mean and less dispersed than the actual variance of the data set?
My model, if it is relevant:
model = Sequential()
model.add(GRU(100, activation='relu', input_shape=(3, 280), recurrent_dropout = 0.2, dropout = 0.2))
model.add(Dense(1))
model.compile(optimizer='adam', loss='mse', metrics=['mae'])
model.summary()
# fit model
history = model.fit(X_train, y_train, epochs=40, verbose=1, validation_split=0.33)
The first thing you can do here is to scale your output data before training the model. For instance
$$ dfrac{x_i - bar{x}_{train}}{sigma_{x_{train}}} $$
If that does not work, then you can have a look at the model and / or training parameters.
Answered by RonsenbergVI on January 9, 2021
Get help from others!
Recent Answers
Recent Questions
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP