Cross Validated Asked on December 13, 2021
I am analyzing my results for a high-school biology paper. My data set consists of 20 groups in total, each having 4 repeats, so each is a small sample. I have a few (4) outliers that have a value lower than the control group (which is practically impossible and thus certainly the effect of a technical error).
Can I exclude the outliers when calculating the mean value or standard deviation, as they largely affect both parameters?
I am quite green when it comes to statistics, so I would hugely appreciate your help. Many thanks!
Sometimes the outliers end up being of more interest than the rest of the data. The discovery of penicillin was from studying the outlier.
Can you verify that the outliers are due to technical problems? If you can show that they are impossible values then you have justification for not including them, or you may find something even more interesting when trying to figure out the unusual values.
The general recommendation these days is to not discard outliers without good, external reasons. If you can throw out any values that you do not like, then you can make the remaining data say anything that you want, which is not good science.
If you still do not like the outliers, but cannot show the errors that caused them then you could analyze the data both with and without outliers and show how similar/different the results are. There are also "robust" methods that are less affected by outliers that you could consider using (though you may need to consult with a statistician for those).
Answered by Greg Snow on December 13, 2021
Get help from others!
Recent Questions
Recent Answers
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP