Wednesday, April 8, 2020

Answer for Customer Analytics in Python

https://365datascience.com/dwqa-answer/answer-for-customer-analytics-in-python-2/ -

Hi Ramanjaneyulu, 

the df_segm_kmeans data frame is our regular data df_segmentation + the assigned cluster from each algorithm. So, in this data frame we have a line for each customer, 2000 in total, with a column for their Age, Occupation, etc. and the cluster they’ve been assigned to. So the dimension of this table are 2000 rows by 8 columns

df_segm_analysis, on the other hand, is a summary table for the four clusters and has 4 lines in total. Here, for each cluster we have information on the distribution of each feature: Age, Occupation, Income and so on. When we create it the table has the size 4 rows by 7 columns (though we add 2 additional columns later on).

Hope this clears up the confusion!

Best, 

Eli




#365datascience #DataScience #data #science #365datascience #BigData #tutorial #infographic #career #salary #education #howto #scientist #engineer #course #engineer #MachineLearning #machine #learning #certificate #udemy

No comments:

Post a Comment