TransWikia.com
  1. All Categories
  2. Data Science

Data Science : Recent Questions and Answers (Page 239)

Find answers to your questions about Data Science or help others by answering their Data Science questions.

Combine K-nearest neighbor with categorical embedding

I've tried a few ways to do my multi-class classification. For categorical data, I used the embedding technique with Tensorflow, which moves the entity closer with its...

Asked on 01/22/2021 by Woden

1 answer

How can I count the number of occurrences of a category in dataset as part of an Sklearn Pipeline

Let us say we have a dataset with a feature such as Surname.arr['Surname'] = ['Smith', 'Jones', 'Johnson', 'Smith']And I want to encode this categorical info as a new feature...

Asked on 01/22/2021 by Jinglesting

1 answer

Predicting products to be sold in a store - problem formulation

I have a data from a store for the products that sold since more than 5 years. Each sell process has a customer id, date, and the quantity of the...

Asked on 01/22/2021

1 answer

ndiffs for Python

Is there a function equivalent of R's ndiffs function in Python?It is used to estimate a number of diffs for creating stationary time-series. R Documentation...

Asked on 01/22/2021 by S__A_a_r_

2 answer

Temporal data types: time-series vs temporal sequences

Regarding the types of temporal data, I saw on a temporal data mining book (Mitsa, Theophano. Temporal data mining. CRC Press, 2010), that there are 3 types of temporal data:...

Asked on 01/22/2021 by Maria Oliveira

0 answer

Python (Pytorch) loss function syntax

I have seen many examples of this syntax that is being used for the loss function specifically: loss = nn.BCEWithLogitsLoss()(pred, y) Can anyone explain me what does the (pred, y)...

Asked on 01/22/2021 by ProJaqf

1 answer

What is Image Annotation?

Why do we need to use Labelimg tool for object detection? After labeling the bunch of training images using labelimg tool which will give CSV file How...

Asked on 01/22/2021

2 answer

Should I oversample my validation data to get better F1 score and PRC?

I am currently working with a dataset that is imbalanced, about 30k rows * 14 features (just for you know), and 99.5% of the data is labeled 0. Since the...

Asked on 01/22/2021 by Frank Xu

1 answer

Keras loss function cost matrix / misclassification penalty

I'll explain my problem through a simplified example. Let's say I have a matrix $X$ of a single variable with the following valuesx = np.zeros(10).reshape(-1,1)x[0] = 1x[-1] =...

Asked on 01/21/2021 by stav

0 answer

How to handle a feature vector that could be variable length?

I would like to train a machine learning model with several features as input as X[] and with one output as Y. For example Every sample has a Data frame...

Asked on 01/21/2021 by Crazy9

1 answer

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP