TransWikia.com
  1. All Categories
  2. Data Science

Data Science : Recent Questions and Answers (Page 16)

Find answers to your questions about Data Science or help others by answering their Data Science questions.

Linear Regression in Python using gradient descent

I am trying to implement a simple multivariate linear regression model without using any inbuilt machine libraries. So far, I have been able to get a root mean squared error...

Asked on 11/13/2021

1 answer

SHAP value analysis gives different feature importance on train and test set

Should SHAP value analysis be done on the train or test set? What does it mean if the feature importance based on mean |SHAP value| is different between the...

Asked on 11/13/2021

2 answer

What is the best way to visualize data that is over 100% capacity

I’m trying to visualize some data that is usually under 100% (for example, workload, or activity) but on some occasions it will rise over 100, meaning that the resource has...

Asked on 11/13/2021

1 answer

1D convolutional neural network validation improvement

I created 1D CNN in Keras, but I'm having issues with validation loss and accuracy.I have 24k records, 22 features. Is my model overfitting or what is going on...

Asked on 11/13/2021

0 answer

Large Graphs: NetworkX distributed alternative

I have built some implementations using NetworkX(graph Python module) native algorithms in which I output some attributes which I use them for classification purposes. I want to scale it to...

Asked on 11/10/2021 by 20roso

3 answer

How many epochs to run during hyperparameter search?

If I'm doing a hyperparameter search and comparing two different hyperparameters (but not number of epochs), is there some established rule of thumb for how many epochs to run? ...

Asked on 11/10/2021

1 answer

How to compare and find common values from different columns in same dataframe?

I would like to compare two columns and find common value sets in each column, then output the rows with the common values. Let's say I have a dataframe with:...

Asked on 11/10/2021

1 answer

GridSearchCV with Random Forest Classifier

I'm working with a supervised learning problem and trying to predict a binary label and using a Random Forest to do so. I'm trying to tune my hyper-parameters to give...

Asked on 11/10/2021

3 answer

Can I apply feature selection before splitting by requiring selection occurs > 90% of time

I want to move the feature selection step to before splitting to save time and allow bigger input dataset. If, in repeated subsamples, a feature is selected in over X...

Asked on 11/10/2021

1 answer

What is the best way for synthetic data generation while maintaining privacy?

For one of the projects where we are working as third party contractors, we need a way for the company to share some datasets which can be used for data...

Asked on 11/10/2021

3 answer

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP