Given the emphasis that Columbus coach Gregg Berhalter places on possession and passing, it probably isn’t a surprise that I’ve started to focus on those statistical categories. Throughout the 2016 season, as troublesome results started to accumulate, I’ve tried to understand the Columbus approach to possession and passing within the context of other teams in MLS.
Now including all 2016 MLS games. Sorted box-and-whisker plots of team possession %. pic.twitter.com/VFu6jU3WL7
— Matt Bernhardt (@bernhardtsoccer) August 11, 2016
Comparing Columbus home and Vancouver away possession %s so far in 2016 #CLBvVAN pic.twitter.com/WJHg22zczE
— Matt Bernhardt (@bernhardtsoccer) September 10, 2016
I’m hardly alone in this, of course – which is one of my motivations in writing this post.
Now that the 2016 MLS league season has concluded, I combed through the stats pages for each game and recorded a series of data points. The resulting dataset has been posted to GitHub.
Each observation is a single team’s performance in a single game – so with 20 teams each playing 34 games, there are 680 rows available. Fields include:
- Possession %
- Pass completion %
- Passes in specific areas of the field (attacking half, final third, and crossses)
- Pass completion % in those areas
- Shots on target
Hopefully the work to assemble this data proves useful to someone. I’ve been using this data for many of the plots that I’ve shared on Twitter this season, and now that Columbus is done for the year I’ve been exploring it in more detail.
I hope to be able to share what I’ve found over the coming weeks, but for now I mostly just want to see whether anyone is interested in the data itself.
Get the data in CSV format over on GitHub.
Here are some sample plots that I’ve been working with that are generated by this data. More information will be shared in future posts.