https://365datascience.com/dwqa-answer/answer-for-python-pandas/ -
Hi Satya!
Thanks for reaching out!
In Python, particularly in pandas, you do have a .groupby() method that does relate to the GROUP BY function used in MySQL. Here’s a link to the Documentation about this method: https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.groupby.html
Regarding using dummies – that’s something different and it rather refers to the econometric technique of converting some of your regressors/independent variables (which, in our case, refer to columns of the DataFrame we use) into dummies – i.e. variables that can take one of only two values – of 0 or 1, to indicate the absence or presence of a certain phenomenon.
So, in this scenario, we use the same term “grouping”, for the lack of a better term, perhaps. What we actually do is group, or stack, or combine, certain reasons for absence into specific groups. But it’s not related to filtering your data in anyway – we are still on the opposite site, so to speak; we are organising/preprocessing the data to bring it to a format that will be suitable for analysis.
Hope this helps but please feel free to get back to us should you need further assistance.
Best,
Martin
#365datascience #DataScience #data #science #365datascience #BigData #tutorial #infographic #career #salary #education #howto #scientist #engineer #course #engineer #MachineLearning #machine #learning #certificate #udemy
No comments:
Post a Comment