Avocado Size Preference in US Cities: Principal Component Analysis (PCA) of Volumetric Sales of Avocados in 2020
The project focused on in-depth exploratory data analysis to uncover trends and preferences in avocado sizes across major US cities. By applying the PCA to volumetric sales data of avocados in three different sizes (small, medium, and large) in 2020 categorized by PLUs 4046, 4225, 4770, respectively, a distinct preference for either small or medium-sized avocados in most cities was revealed, with outliers showing higher sales in all three sizes. The project was implemented using Python, and the dataset was sourced from www.kaggle.com. I opted to analyze this dataset because of my personal fondness for avocados, especially as I include avocado toast in my daily breakfast routine. My preference for using the entire avocado for the toast and my reluctance to store cut avocados for the next day lead me to favor small-sized avocados. However, I occasionally encounter challenges in finding small-sized avocados sold under PLU 4046 at my usual grocery store. This curiosity prompted me to delve into the trends of preferences in avocado sizes in US cities. Read more