TransWikia.com
  1. All Categories
  2. Data Science

Data Science : Recent Questions and Answers (Page 111)

Find answers to your questions about Data Science or help others by answering their Data Science questions.

Is there a safe and simple way to estimate a standard deviation for a next subset?

In case I receive only standard deviation from a sensor of a value $v$ (that is btw normally distributed) each 4th minute but need to provide a standard deviation...

Asked on 06/08/2021 by zina

1 answer

What is the proper way to bin variables for calculating WoE during credit scoring?

I have read this article about developing a credit scorecard in python, where it is stated that when binning the continuous variables, it needs to be ensured...

Asked on 06/08/2021

1 answer

Human readable format for clusters of word vectors

Let's say I have pretrained word2vec model and apply it to dataset consisting of article titles from "The Guardian". It seems pretty obvious that titles coming from "Science" section would...

Asked on 06/08/2021 by Arek Żyłkowski

1 answer

How to use "tree boosting" with a data-driven loss function

We have a problem which has a data-driven (non-analytical) loss function. Our target contains whole numbers between 0 and 20 (the target is inherently discrete), although larger values are possible,...

Asked on 06/08/2021

2 answer

Forecasting monthly visitor count from daily values

I have a dataset of the daily visitor count of a website. Given this information, I want to forecast what the monthly visitor count will be. Depending on the visitor...

Asked on 06/08/2021 by Clock Slave

0 answer

Misclassification Rate for Random Forest Plateauing too Early

Using R, I have created 5 different random forest models using 5 different numbers of trees (3,10,30,100,300). My intention was to compute the misclassification rates of each of these models...

Asked on 06/08/2021 by user58887

1 answer

Keras model with LSTM quantization aware training

I would like to run quantization aware training with a keras model which has an LSTM layer. However, just the LSTM layer seems to not be supported....

Asked on 06/07/2021 by Karl Haebler

0 answer

Preprocessing dataset to predict salary

I'm currently a student in a machine learning course studying for an upcoming exam. Here's a question I've been given for practice: You have a very large dataset of employees...

Asked on 06/07/2021

0 answer

Error term in probabilistic interpretation of least squares update rule

I have read in Stanford's CS229 course notes that to justify the least-squares update rule with probability, the following is assumed: $$y^{(i)} = theta^Tx^{(i)}+epsilon^{(i)}$$ , where $epsilon^{(i)}$ represents...

Asked on 06/07/2021 by Matthew Yang

0 answer

Would there be any reason to pretrain BERT on specific texts?

So the official BERT English model is trained on Wikipedia and BookCurpos (source). Now, for example, let's say I want to use BERT for Movies tag recommendation....

Asked on 06/07/2021

2 answer

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP