TransWikia.com
  1. All Categories
  2. Data Science

Data Science : Recent Questions and Answers (Page 245)

Find answers to your questions about Data Science or help others by answering their Data Science questions.

Target encoding with KFold cross-validation - how to transform test set?

Let's say I have a categorical feature (cat):import randomimport pandas as pdfrom sklearn.model_selection import train_test_split, StratifiedKFoldrandom.seed(1234)y = random.choices([1, 0], weights=[0.2, 0.8], k=100)cat = random.choices(["A", "B", "C"], k=100)df = pd.DataFrame.from_dict({"y":...

Asked on 01/15/2021

1 answer

Sklearn: applying cost complexity pruning along with pipeline

I have a data set with categorical variables. I have defined a decision tree algorithm and transformed these columns to numerical equivalent using one hot encoding functionality in sklearn: Create...

Asked on 01/15/2021 by codeman340

2 answer

String Values in a data frame in Pandas

Suppose I have a data frame like this :Hospital_name State Employees ......Fortis ...

Asked on 01/15/2021

4 answer

What model is recommended: I am using text features in a regression and want to interpret coefficients

I am using the text of comments on a forum to predict how many upvotes it will get. I want to be able to say, "Reviews with X, Y, Z...

Asked on 01/14/2021 by thewhitetie

1 answer

Are there libraries similar to cleverhans for different types of classifiers?

I am seeking adversarial examples for classifiers (e.g. random forest, logistic regression, multilayer perceptron) that I've already trained (through scikit-learn). But I can't seem to find any existing libraries for...

Asked on 01/14/2021

1 answer

Look up a number inside a list within a pandas cell, and return corresponding string value from a second DF

(I've edited the first column name in the labels_df for clarity) I have two DataFrames, train_df and labels_df. train_df has integers that map to attribute names in the labels_df....

Asked on 01/14/2021 by Donum

1 answer

How to create model for selecting a set of categories with a set of attributes?

I have a couple of hundred categories where each of these categories has a specific set of attributes having different values (historical). The problem I need to solve is to...

Asked on 01/14/2021

1 answer

CycleGAN: Generator losses don't decrease, discriminators get perfect

So I´m training a CycleGAN for image-to-image transfer. The problem is: while the discriminator losses decrease, and are very small now, the generator losses don't decrease at all. The...

Asked on 01/14/2021 by Marie M.

1 answer

Is my data appropriate for Hierarchical Clustering?

I am a newbie in clustering and trying to check whether there are differences in Symptoms (example: cough, sneezing, shortness of breath, etc) reported across different comorbidity groups ( obesity,...

Asked on 01/14/2021

0 answer

What is the difference between active learning and reinforcement learning?

From Wikipedia:Active learning is a special case of machine learning in which a learning algorithm can interactively query a user (or some other information source) to label new data points...

Asked on 01/14/2021 by Moradnejad

2 answer

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP