TransWikia.com
  1. All Categories
  2. Data Science

Data Science : Recent Questions and Answers (Page 57)

Find answers to your questions about Data Science or help others by answering their Data Science questions.

What would be a good randomization environment for data science?

I would like to know if there are any best practices to optimize random environment. Currently I use this simple structure in my config :from numpy.random import Generator,...

Asked on 08/09/2021

0 answer

Tweet Classification into topics- What to do with data

Good evening,First of all, I want to apologize if the title is misleading.I have a dataset made of around 60000 tweets, their date and time as well as...

Asked on 08/09/2021 by Mauretto

1 answer

How to apply variational autoencoder for oversampling with cross-validation?

Currently, I have an imbalanced data set with proportions 84% and 16%. I wanna use VAE as oversampling method and I want to determine the best proportions of data that...

Asked on 08/08/2021

0 answer

Hashing trick for dimensionality reduction

I am building a model that uses TF-IDF NLP features in Spark Mllib. The TF-IDF HashingTF function in Mllib uses the 'hashing trick' to efficiently allocate terms...

Asked on 08/08/2021

0 answer

Impute missing value: transpose or not?

I'm building a model that fills the missing values from a Dataframe that contains the number of visitors for different stores, each day:daystore_astore_bstore_c2021-01-011002003002021-01-021102202902021-01-03501101702021-01-04NAN2202902021-01-05716NAN2021-01-0690NANNANI'm using the ...

Asked on 08/08/2021

1 answer

How to obtain and load a good initial data set for object localization?

I'm looking for a good data set for training a CNN based network to do object localization (i.e. a data set with class labels and bounding box data). What is...

Asked on 08/08/2021 by user3731622

1 answer

Classification algorithm that only matches trained examples

I have 10 categorical features and a multi-class target. Training data contains rows where the same 10 categorical features may map to a different target class. What classification algorithm should...

Asked on 08/08/2021 by dwkd

1 answer

Can we use fractional regression for a dependent variable that is made of continous numerator and denominator?

I have a dependent variable that is a ratio, i.e. it takes the values between 0 and 1. Some 30% of values are 1s. The dependent variable measures the distribution...

Asked on 08/08/2021 by Ken Lee

1 answer

High correlation between the independent and dependent variables but low performance of regression model

I have a dataset of 4900 rows and 2060 feature. I calculated the correlation using kendall method between the dependent and independent features, and found out that 5 of these...

Asked on 08/08/2021

0 answer

How do the authors get this updating formula for all $beta$ in $beta$-divergence

I'm reading the paper Algorithms for nonnegative matrix factorization with the β-divergence by Cédric Févotte and Jérôme Idier. Package scikit-learn uses their algorithm for module ...

Asked on 08/08/2021 by Akira

0 answer

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP