TransWikia.com
  1. All Categories
  2. Data Science

Data Science : Recent Questions and Answers (Page 203)

Find answers to your questions about Data Science or help others by answering their Data Science questions.

Isolation Forest Score Function Theory

I am currently reading this paper on isolation forests. In the section about the score function, they mention the following. For context, $h(x)$ is definded as...

Asked on 02/27/2021 by Samyak Shah

1 answer

How to properly compare these two confusion matrix?

I have used Vader, a sentiment analysis tool for social media, on a database of movie reviews. These two confusion matrices differ in the vader.py algorithm, as the first one...

Asked on 02/27/2021

1 answer

Should I go for a 'balanced' dataset or a 'representative' dataset?

My 'machine learning' task is of separating benign Internet traffic from malicious traffic. In the real world scenario, most (say 90% or more) of Internet traffic is benign. Thus I...

Asked on 02/27/2021 by pnp

6 answer

Yolo issue with detecting positives

I've recently tried to implement a Yolo detector for traffic light detection based on yolo v1 implementation in Tensorflow/Keras. My model really struggles with detecting small objects. Loss function...

Asked on 02/27/2021 by monolith

0 answer

Understanding how convolutional layers work

After working with a CNN using Keras and the Mnist dataset for the well-know hand written digit recognition problem, I came up with some questions about how the convolutional layer...

Asked on 02/27/2021 by Karampistis Dimitrios

4 answer

How to incorporate the uncertainty of the model coefficients in the prediction interval of a multiple linear regression

I'm dealing with the modeling of small experimental data sets. As most experimental work does not generate thousands of samples, but rather a handful, I need to be inventive in...

Asked on 02/27/2021

1 answer

Is it bad to have a lot of one class of Data [K-NN classifier]?

I am trying to train a sklearn K-NN classifier on a labeled text dataset (in Irish). There are 5 classes, 0-4, but there is a lot of variation between how...

Asked on 02/27/2021 by Justin Cunningham

2 answer

How to deal with highly skewed (on counts) dependent variables?

I am working on a binary classification problem and the dataset consists of several variables which are count variables. For example, how many times a customer defaulted on a broadband...

Asked on 02/27/2021

3 answer

Derivative of activation function used in gradient descent algorithms

Why is it necessary to calculate the derivative of activation functions while updating model( regression or NN) parameters? Why is the constant gradient of linear functions considered as a disadvantage?...

Asked on 02/27/2021 by rajarshi

1 answer

Is it acceptable to append information to word embeddings?

Let's say I have my 300 dimensional word embedding trained with Word2Vec and it contains 10,000 word vectors. I have additional data on the 10,000 words in the form of...

Asked on 02/27/2021 by forgetso

0 answer

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP