TransWikia.com

Help deciding the best statistic tool for my data

Data Science Asked by Torsten W. on July 7, 2021

I’m not from the data science area, and I need to perform some statistical analysis on my data, but I don’t know which analytical tools I should use.

Number of variables – 6
Independent Variables 3 – (one is scalar (e.g., 2,4,6,8,10) and the other two are nominal (e.g., A,B,C and Lote 1, Lote 2, Lote 3, Lote 4, Lote 5)
Dependent Variables – 3 (all scalar variables and dependent on all three independent variables, they are basically performance metrics)

One question I would like to answer is how the dependent variables behave when the scalar independent variable increases, taking into account the nominal variables. The picture below shows a sample of my data.

Sample Data

2 Answers

You could start with plotting the relation between any relevant pair of variables, typically with a simple scatter plot and possibly using color to represent a third variable.

Pearson correlation coefficient is a simple but useful measure of association between two variables. By calculating it between two variables on the whole data and then on subsets based on specific conditions (e.g. third column is A) you can observe the effect of the condition, i.e. see if it increases or decreases the correlation level.

Answered by Erwan on July 7, 2021

You can start with RapidMiner tool, which provides rich graphical interface to load and analyze structured data and unstructured data like text, images, and media. It can also transform unstructured data into structured. Here is link for the tool !

Answered by Soumendra Mishra on July 7, 2021

Add your own answers!

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP