Tuesday, June 16, 2020

Answer for How to determine the number of intervals

https://365datascience.com/dwqa-answer/answer-for-how-to-determine-the-number-of-intervals/ -

Hi Gilles,

Thanks for reaching out.

In general we are looking for a representation that has between 5 and 20 intervals. 

To determine the skewness, I usually start with a histogram with many bins (e.g. 100-1000, depending on the size of the dataset). This gives a good feel about the frequency distribution of the dataset.

Afterwards, I’d normally get the bins down to 20 and see if the general trend remains.

Overall, I try to preserve the general trend while using as few bins as possible.

Best,

Iliya




#365datascience #DataScience #data #science #365datascience #BigData #tutorial #infographic #career #salary #education #howto #scientist #engineer #course #engineer #MachineLearning #machine #learning #certificate #udemy

No comments:

Post a Comment