About
Dataset
We chose to work with the ATP Tennis with Betting Odds found on Kaggle.
The dataset consists of 36'120 matches summaries. Each line contains:
- Information about the tournament (name, location, etc...)
- Information about the match (surface, date, rounds, comment, etc...)
- Information about the players (winner, loser, nationalities, world ranking, ATP points, etc...)
- Physical information about the players (playing hand, weights, height, etc...)
- Betting odds information (odds of match loser and winner from different online platforms)
Problematic
Since its creation in the nineteenth century, tennis has kept increasing in popularity to become one of the most played sports, and one of the most mediatized. You cannot ignore the names of the best figures such as Rafael Nadal or Novak Djokovic that compete in the unmissable Grand Slam tournaments: from Roland Garros to Wellington. Nonetheless, the usual picture of tennis shared by the media tends to focus on a tiny subset of both the events and the players. With this visualization, we want to give a holistic image of the community of professional tennis players, its global structure.
First, we want to visualize the pattern of matchmaking through tournaments and through the seasons. All the tournaments can be seen as a graph whose nodes are the players and the edges are the tennis matches. We want to visualize the structure of this graph.
In addition to the competitions, we would focus on individual players: besides the well-known superstars, what is the usual career of a professional tennis player, what are his characteristics? We want to share the stories of the ones whose names will never appear in any international headlines, while they make up the biggest chunk of the community. Those questions, such as the role of right or left handedness, are the subject of common discussions. Through the use of data visualization, we aim at shedding new light on these topics.
This visualization is intended for the general public, with no particular interest in tennis. Our goal is to show the fascinating patterns that emerge in the data visualization from this field and to share the statistical stories that never appear in newspapers.