Chocolate Ratings Analysis Using Tableau

Chocolate has weaved its way into our cultural fabric. This popular snack is enjoyed by young and old alike.

Having a sweet tooth myself, I decided to analyze the ratings of chocolates to understand what exactly makes them tick. Is it based on the manufacturer? Or is it the Cocoa Percentage? Or something else?

Lets delve deep into this problem using Tableau.

About The Dataset:

I have used the following dataset from Kaggle:

https://www.kaggle.com/datasets/andrewmvd/chocolate-ratings?select=chocolate_ratings.csv

The columns are :

  1. REF (or the ID of the chocolate)

  2. Company (Manufacturer of the chocolate)

  3. Company Location

  4. Review Date

  5. Country of Bean Origin

  6. Specific Bean Origin or Bar Name

  7. Cocoa Percent

  8. Ingredients

  9. Most Memorable Characteristics

  10. Rating

Data Preparation:

  1. I created a calculate field of "Cocoa Percentage" from "Cocoa Percent" Attribute by multiplying Cocoa Percent with 100 to bring the attribute in percentage form.

  2. I changed the geographic role of Company Location to "Country/Region" since it was initially given in text format.

Analyzing the Manufacturers in the Chocolate Industry:

I began the analysis by taking the count of records as columns and company i.e. manufacturer as rows

Then I applied a filter to get the Top 10 most popular manufacturers or those manufacturers who occur the most number of times in the table.

Analyzing the Prominent Cocoa Percentages:

To create this line chart, I took the Cocoa Percentage attribute as columns and count of records as rows.

Then I added a dropdown filter for the Company(Manufacturer) so that the graph is adjusted accordingly whenever I choose a particular company.

In order to make this line chart, I took the Rating attribute as columns and the count of records as rows.

Then I added a dropdown filter for the Company(Manufacturer) so that the graph is adjusted accordingly whenever I choose a particular company.

Average Rating of the Manufacturer:

In order to get this metric, I considered the average rating as "Text" and added a Company(Manufacturer) filter so that the value in the text box changes for each company chosen.

Total Number of Manufacturers:

To get this metric, I took the distinct count of manufacturers as "Text".

Dashboard:

To combine all the information pertaining to the manufacturers in the chocolate industry, I created the following dashboard:

Inference from this Dashboard:

  1. Soma is the most popular chocolate manufacturer

  2. Majority of the chocolates have 70% of cocoa percentage

  3. A vast amount of chocolates have a rating of 3.5 followed by 3.0

  4. The average rating of manufacturers is 3.196

Analyzing the Countries Involved in the Chocolate Industry:

To get the company locations, I made a bubble chart by taking the Company Location as "Text" and count of records in "Size".

In order to get the top locations, I created a filter on the Company Location for the top 5 locations.

Analyzing the Country of Bean Origin:

To make this map, I took the Latitude and Longitude as rows and columns respectively. Then I dragged the country of Bean Origin into detail and the count of records into color.

In order to make this line chart, I took the Rating attribute as columns and the count of records as rows.

Then I added a dropdown filter for the Country of Bean Origin so that the graph is adjusted accordingly whenever I choose a particular country.

Assessing the Cocoa Percentages based on the Country of Bean Origin:

To create this line chart, I took the Cocoa Percentage attribute as columns and count of records as rows.

Then I added a dropdown filter for the Country of Bean Origin so that the graph is adjusted accordingly whenever I choose a particular country.

Dashboard:

To combine all the graphs pertaining to the countries involved in the chocolate industry, I have created the following dashboard:

Inferences from the Dashboard:

  1. USA is the most popular company location.

  2. Venezuela and Peru, followed by Ecuador are the most prominent countries in bean origin.

Analyzing the Contents of Chocolates:

To make this bar chart, I took the Ingredients attribute as columns and count of records as rows.

Then I created a filter for the Top 5 ingredients based on the count of records.

Analyzing the Average Rating of Ingredients:

I took the average rating as columns and the ingredients as rows. Then I created a filter for the top 10 ingredients having the highest averages to declutter the graph.

Top 10 Most Memorable Characteristics of Chocolates:

In order to make this graph, I took the count of records as rows and the "Most Memorable Characteristics" attribute as columns.

Then I applied a filter on the characteristics attribute to obtain the top 10 characteristics.

Total Number of Ingredients Considered:

To obtain this metric, I took the distinct count of ingredients as "Text".

Total Number of Chocolate Characteristics:

To obtain this metric, I took the distinct count of "Most Memorable Characteristics" attribute as "Text".

Dashboard:

To combine all the information pertaining to the contents of chocolates in a single place, I have made the following dashboard:

Inferences from the Dashboard:

  1. 3-B,S,C is the most popular ingredient in chocolates.

  2. 3-B,S,C has the highest average rating among all other ingredients.

  3. "spicy, cocoa" and "creamy, nutty, cocoa" are the most memorable characteristics of chocolates.


Found this article interesting? Give it a like and share it with your friends !!

#data #datascience #dataanalysis #dataanalytics #datavisualization #visual #graph #chocolate #chocolaterating #ingredients #country #cocoa #graphs #dashboard#dashboards #tableau #barchart #bubblechart #metrics #linechart #filters