TransWikia.com
  1. All Categories
  2. Data Science

Data Science : Recent Questions and Answers (Page 227)

Find answers to your questions about Data Science or help others by answering their Data Science questions.

Clustering time series based on monotonic similarity

Context I am involved in a task of clustering 1500 time series of 500 observations into a few number of clusters. The time series share all the same observed property...

Asked on 02/04/2021 by Delforge

1 answer

How do we decide between XGBoost, RandomForest and Decision tree?

What do we take into consideration while deciding which technique should be used when dealing with a particular dataset? I understand that there isn't any hard and fast rule to...

Asked on 02/04/2021

1 answer

Magnification factor in image classification

If a CNN is trained on images focusing on an object, will it also recognize when multiple such objects are present in the image? For example can a network trained...

Asked on 02/04/2021

1 answer

What is the reason behind Keras choice of default (recurrent) activation functions in LSTM networks

Activation function between LSTM layers In the above link, the answer to the question whether activation function are required for LSTM layers was answered as follows: as an LSTM...

Asked on 02/04/2021 by Lauramvp

0 answer

What does the classification report interpret? Class 1 indicates abnormal data

How to interpret the report and How is precision, recall values are calculated for individual class labels. What is the significance of macro avg ? Does this report signify a...

Asked on 02/04/2021 by prnai

2 answer

Impact of a new word on word embedding vectors

QuestionWhat is the impact of a new word on the word embedding vectors already trained before the word is invented? For instance, at November 2019, there existed multiple ...

Asked on 02/04/2021

0 answer

How do I generate a laplacian matrix for a graph dataset?

If I have a dataset in a csv that looks like the one shown below. How do I convert this into a laplacian matrix using Python? ...

Asked on 02/03/2021 by Naveed

2 answer

Stratified Sampling for XGboost

I have a multiclass-classification dataset with the target (dependent) variable highly imbalanced. While using the randomForest package in R, I usually use the parameters sampsize & strata to account for...

Asked on 02/03/2021 by honeybadger

1 answer

xgboost in R have different results compared to boosted decision tree in Azure ML

I have a small data set (4000 records with 10 features) and I used XGBOOST in R as well as Boosted Decision Tree model in Azure ML studio. Unfortunately the...

Asked on 02/03/2021

1 answer

k-means for customer review analysis

I have a dataset of amazon Alexa reviews and want to group negative and positive reviews in separate groups. Is k-means a good approach to it? The dataset is unlabeled...

Asked on 02/03/2021 by Pari Ganjoo

0 answer

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP