TransWikia.com
  1. All Categories
  2. Data Science

Data Science : Recent Questions and Answers (Page 328)

Find answers to your questions about Data Science or help others by answering their Data Science questions.

Hive / Impala best practice code structuring

Coming from a DWH-background I am used to putting subqueries almost everywhere in my queries. On a Hadoop project (with Hive version 1.1.0 on Cloudera), I noticed we can forego...

Asked on 08/02/2020 by Gerardsson

1 answer

How to compute unseen bi-grams in a corpus (for Good-Turing Smoothing)

Consider a (somewhat nonsensical) sentence - "I see saw a see saw" The observed bi-grams would be:"I see" "see saw" "saw a" and, "a see". My aim is to...

Asked on 08/01/2020 by rahs

1 answer

When one model is superior in real world use?

I have an NLP neural network that I have developed with Keras for multi-label classification. I have fit the model several times and save the best results (via best...

Asked on 08/01/2020 by I_Play_With_Data

1 answer

A/B testing: How to calculate p-value on post test segments?

My question on A/B testing is about doing post test segmentation analysis. For example: I run an A/B test on my website to track bounce rate. On the...

Asked on 07/31/2020 by jxn

1 answer

Pruning input layers and output layers after training model with shared layers

I'm trying to train a search and item encoder and this is the model I haveinput_search = Input(shape=(40,), dtype='int64', name='input_search')input_title = Input(shape=(40,), dtype='int64', name='input_title')input_desc = Input(shape=(40,), dtype='int64', name='input_desc')input_brand =...

Asked on 07/31/2020 by Howard Wang

0 answer

How to cluster government census data in order to group Metropolitan statistical areas

I have collected a bunch of census data from 2012 - 2018. I wanted to apply some clustering algorithms in order to compare Metropolitan statistical area (MSA's). Ideally once I...

Asked on 07/31/2020 by Snorrlaxxx

1 answer

Inputting (a lot of )data into a dataframe one row at a time

I'm using python. Some 2D numpy arrays are stored in individual rows of a Series. They are 30x30 images. It looks something like this: pixels0...

Asked on 07/30/2020 by Isu Shrestha

2 answer

CUDA_ERROR_OUT_OF_MEMORY: out of memory. How to increase batch size?

I have one GPU: GTX 1050 with ~4GB memory. I try Mask RCNN with 192x192pix and batch=7. I got an error: CUDA_ERROR_OUT_OF_MEMORY: out of memory I found:...

Asked on 07/30/2020

3 answer

Constructing an image from a dense layer output

I am trying to reconstruct an image from a dense layer with is a concatenation of outputs from a1) convolutional network with image inputs; and2) dense layer with...

Asked on 07/29/2020

1 answer

How to find the driver features towards a particular result in Classification problems

In a classifier model, we can predict the outcome class, but here I need to find out the features that drive towards a particular result in a classification problem, that...

Asked on 07/29/2020 by Vinay

3 answer

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP