To help make the relationship between height and weight clear, I'm going to set the lower bound to 100. Next let's adjust the vertical axis scale. Just select the title, type an equal sign, and click the cell. Now, since we already have a decent title in cell B3, I'll use that in the chart. X values come from column C and the Y values come from column D.
Here you can see there is one data series. Let's check Select Data to see how the chart is set up. When I click the mouse, Excel builds the chart. The first preview shows what we want - this chart shows markers only, plotted with height on the horizontal axis and weight on the vertical axis. Here I'll select all data for height and weight, then click the scatter icon next to recommended charts. When creating scatter charts, it's generally best to select only the X and Y values, to avoid confusing Excel. Let's create a scatter plot to show how height and weight are related. On this worksheet, we have the height and weight for 10 high school football players. A scatter chart has a horizontal and vertical axis, and both axes are value axes designed to plot numeric data. We can’t help but focus on the outliers.In this video, we'll look at how to create a scatter plot, sometimes called an XY scatter chart, in Excel.Ī scatter plot or scatter chart is a chart used to show the relationship between two quantitative variables. If the outlier is un-important (easily explained away by some known quirk in the data) you should use an annotation (perhaps below the chart with an asterisk in the chart) to explain it. If this outlier is important, fantastic, that’s what you want. The white space and separation will naturally draw the eyes of your reader. Visualization can be powerful, and even if you are not lying with the data, understanding that spurious correlations exist should influence your data design choices.Īny outliers in a scatter plot will become visually prominent. One of the most commonly used examples is ice cream sales and murder rates. But just because we can see a relationship does not make it meaningful. With scatter plots we are showing the relationship between two variables. NY Times – Learning Network – What’s Going On in This Graph? | ApOther Considerations In this way, it’s still a scatter plot but it’s used to illustrate more of an individual-focused story. Which is why each point (well most points, including every point on the outer edges) is labeled with the players name. One other thing to note, is that instead of the focus being on the overall distribution (as it was for the previous two examples) the focus is now on individual baseball players. With these reference lines you get to take a bunch of random points and tell a cohesive story that is easy for the reader to grasp. The players in the lower right contribute less but are paid more. It includes two reference lines showing average (one for the x axis and the other for the y axis) which splits the chart into 4 quadrants.īasically the chart suggests that the players in the upper left quadrant are paid less than average but contribute more than average to their team’s success. The following chart shows a baseball stat (wins above average) against a players salary. NY Times – Learning Network – What’s Going On in This Graph? | Nov. Annotations and reference lines are incredibly useful in scatter plots. The creator of the chart also added a single reference line, communicating what they see in the chart to the user. It’s not a simple linear relationship, and the distribution of data points shows that.
#Scatter chart excel how to label the points drivers
The following scatter plot tells a story of how much cruising time is spent by rideshare drivers compared to trip requests. NY Times – Learning Network – What’s Going On in This Graph? | Women Marathoners’ Running Times The line chart would have told the same story, but seeing the underlying data lets us see that it’s not just a few outliers bringing up the average. They could have instead, taken the average of the top 50 times for each year and drawn a line chart. Ultimately, it tells the story of marathon runners getting faster. It shows the fastest women’s marathons each year. They are one of the best ways to pack a ton of data into a single chart. There are all sorts of things you can do with scatter plots.