Do any values appear to stick out? Often such values tell us something interesting or exciting about the data. You should always point out any stragglers or outliers that stand off away from the body of the distribution. For example, if you’re studying the personal wealth of Americans and Bill Gates is in your sample, he would certainly be an outlier. Because his wealth would be so obviously atypical, you’d want to point it out as a special feature,

Outliers can affect almost every method we discuss in this book, so we’ll always be on the lookout for them. An outlier can be the most informative part of your data,or it might just be an error. Either way, you shouldn’t throw it away without comment. Treat it specially and discuss it when you report your conclusions about your data. (Or find the error and fix it if you can.)

How you characterize a distribution is often a judgment call. Do the two humps in the histogram really reveal two subgroups, or will the shape look different if you change the bin width slightly? Are those observations at the high end of the histogram truly unusual, or are they just the largest ones at the end of a long tail? These are matters of judgment on which different people can legitimately disagree. There’s no automatic calculation or rule of thumb that can make the decision for you. Understanding your data and how they arose can help. What should guide your decisions is an honest desire to understand what is happening in the data. That’s what you’ll need to make sound business decisions.

Viewing a histogram at several different bin widths can help you to see how persistent some of the features are. Some technologies offer ways to change the bin width interactively to get multiple views of the histogram. If the number of observations in each bin is so small that moving a couple of values to the next bin changes your assessment of how many modes there are, be careful. Be sure to think about the data, where they came from, and what kinds of questions you hope to answer from them.

