The Soccer by the Numbers blog continues to provide me great inspiration. After Manchester City destroyed Manchester United 6-1 on October 23rd (at ManU), they blogged about the results and provided a bit of statistical analysis. City has long been the “noisy neighbors” to United, but they now are doing their best to buy a title, rather than grow their own talent. Soccer by the Numbers posed this fundamental question:
We now know that the outcome of the match was truly unusual. But how unusual?
They created the table below, which is a frequency distribution of scored lines since 1907 of matches at Manchester United. To use the chart you simply identify the score line by going across City’s scores then down ManU’s scores. So the 6-1 scoreline has occurred 2.99% of the time since 1907. Clearly this was an unusual result.
But I think this table could be improved. I made these changes:
- Changed the numbers to percentages and rounded to one decimal. Two decimals is unnecessary precision.
- Removed most of the gridlines so that the lines separate the data from the categories
- Formatted the results as a heat map. I chose a red-white two-color scheme since Red is ManU’s color. This makes the largest percentage of result very obvious. For example, you can now easily see, without having to scan across all of the data points, that 1-1 is the most common score line…boring result!
- Formatted the totals as a second heat map. I chose a brown-white scheme for these. The totals show you the % of the total goals scored for each time. ManU has scored one or two goals 64.2% of the time while City has scored one or two goals 58.2% of the time.
Which format do you like best? Does one make the story easier to interpret than the other?