I. The Harmonies
Let me take one step closer toward fully embracing the Hall of Fame standard toward which I have been working in recent years, and let me begin with this argument, which is rather a trite argument, but necessary to us as a point of departure. Most of you recognize, I suspect, the inherent flaw in the concept of IQ, which is that it attempts to measure in one dimension something which in reality has many different dimensions. Suppose, for example, that one were to try to summarize all persons’ bodies on a scale in which an average body was 100, the best bodies were at 150 or above—five standard deviations above the norm—and the worst bodies were at 50. It is not absolutely impossible to do this, but at some point it presents us with unsolvable problems. What exactly is a "good" body? A person could be extremely attractive, but also terminally ill. Is that a good body, in that it is attractive, or a bad body, in that it is near death? One would have to measure strength, agility, quickness, balance; one would have to score 80-year-olds and 15-year-olds on a common scale; one would have to score men and women on a common scale. Even to measure "strength"—a small part of the whole—is impossibly complex, in that we have hundreds of different muscles. A person can be immensely strong in some muscles, weak in others. One can be strong but fat; one can be lean but weak. "Health" is more complex than "strength", and "health", again, is merely a small part of the whole.
And yet, this is not to say that we cannot do it or that we do not do it. If you say that your grandmother is in great shape for 85, everybody knows what that means. If you say that your blind date was a 7, everybody knows what that means. Last spring, in spring training, I asked our farm director, about one of our minor leaguers, "is he out of shape, or is he just a bad-body guy?" Everybody knows what that means, in general terms.
Most of you recognize, I suspect, that the concept of "IQ" has the same issues: What, exactly, is a "good" mind? The mind, like the body, has many, many dimensions. A person can be brilliant but crazy. A person can be both brilliant and sane, but cruel. A person who seems dull can nonetheless embody a positive philosophy that will contribute immensely to their mental health and their success in life, and that makes them a joy to be around. A person can be brilliant at math but unable to carry on a normal conversation. People who have fantastic interpersonal skills are, many times, irrational people who make bad life decisions as a matter of course. Some elements of intelligence can be refined through education; others cannot. Do we treat these the same?
The concept of "IQ" is, on a certain level, silly. We accept it to the extent that we do simply because we are unable to visualize its failings. If, in the third grade, we were all assigned a "BQ"—a "Body Quotient"—society would reject that, because we would immediately SEE the failings and limitations of the concept. We accept the one-dimensional measurement of IQ to the extent that we do only because that which it attempts to measure is elusive if not entirely invisible; therefore, we are unable to see immediately the failures and limitations of the measurement. This is not to say that there is no value in the effort. We understand in general terms what it means to be intelligent, what it means to be stupid.
Such it is with Win Shares, WAR, and all other Total Player Ratings. It is not that there is no value in the effort, but we should never forget that what we are trying to do here is, in the end, impossible. We are trying to state all contributions to a team in one dimension, but in reality they exist in many different dimensions. What we are trying to do is not merely impossibly difficult, but theoretically impossible, in the same way that it is theoretically impossible to state both height and weight in one scale. You certainly could combine height and weight into one number, but by doing so we would end up saying that a person who is 6-foot-2 and 165 pounds is the same size as a person who is 5-foot-7 and 210 pounds. You can say he is the same size, but he is not the same size. We should never lose sight of the problems that this causes.
Because this is true, I think that it is best, in reviewing Hall of Fame candidates, not to try to go all the way to the finish line with one metric. Let us say, for example, that 350 Win Shares represents a Hall of Famer, or that 60 WAR represents a Hall of Famer. You can say that Vladimir Guerrero and Dazzy Vance are the same, in that they are both at 59.9 WAR, but the reality is that they are NOT the same. They are very different. In saying that they are the same, we are merely pretending that something is true that we know very well is not actually true.
The scale on which I evaluate players, then, is Win Shares and Loss Shares, and my criterion for the Hall of Fame is this:
1) There are two relevant standards, which are 300 Career Win Shares and 100 more Win Shares than Loss Shares,
2) If a player meets both of these standards, he is a Hall of Famer, unless he has some other disqualification,
3) If a player meets neither of these standards, he is not a Hall of Famer unless he has some other qualification, and
4) If a player meets one of these standards but not the other, then it is a matter best left to judgment as to whether he should or should not be selected.
These standards do not work for relievers, for reasons that I accept but do not quite understand, and they do not work, of course, for managers, general managers, scouts or pioneers. There may or may not be an issue with catchers. Other than that, they appear to work satisfactorily in every case—accepting certain issues, which we will discuss later. I have been using these standards for several years, and I have yet to find a player, other than relievers, for whom this process does not seem to deliver an answer I can live with. Such a case may exist; I just haven’t found it yet.
These standards work because of two coincidences, or maybe they are not coincidences; maybe we should describe them as harmonies. First, it happens that the size of a player’s career is essentially the same as the size of a team. Again, the problem of measuring things in one dimension which exist in fact in multiple dimensions, but. . .how many players make up a team? There are 25 players on a roster, but I am not asking how many players are on a roster; what I am asking is, how many players does it take to win a pennant? About 17, 18, something like that. You need 8 regular position players, 9 with a DH; you need 5 starting pitchers, and you need at least 3 relievers. That’s 16, 17; you need a bench player or two. Some of the roster is filler material. A team really consists of about 17 or 18 contributing players.
And how long does a player’s career last? 17, 18 years in most cases is a full career. Think of it in terms of outs. If a team plays 162 games and makes 27 outs, that is 4,384 outs, or 8,768 outs if you include pitching, defense, hitting and baserunning. A full team is about 8,768 outs.
And how many outs is a player responsible for in his career? It’s about the same number. Let’s take. . .Willie Stargell, or Willie McCovey. McCovey made 6,259 batting and baserunning outs in his career; Stargell accounted for 5.938. McCovey, however, also played 18,818 innings in the field, and Stargell also played 17,956 innings in the field. Let’s start with the number 18,000 innings, which is 54,000 outs. About 40,000 of those are probably best assigned to pitchers; that leaves 14,000. When those 14,000 outs are split among eight fielders, that’s 1,750 per fielder. McCovey and Stargell were not defensive stars or defensive specialists; more credit must be given to shortstops, catchers and center fielders for their defensive work than to first basemen and left fielders, so. .. .let’s say 1,200 each. Add those 1,200 outs to the batting outs, and. . .a player’s career, in terms of outs, is roughly the same size as a team/season.
The second coincidence or second harmony is that the standard deviation of success for players in a career is similar to that of players on a team. I never realized this until the last year or two. I had always assumed that the spread of talent among players on a team was larger than the spread of talent among seasons of a career, but actually, it isn’t, or at least I now believe that it isn’t. Willie Stargell had some seasons in which he was absolutely great (1966, 1971, 1973, 1978) and some seasons in which he was less than average. I don’t think that anyone actually knows exactly what the normal standard deviation of winning percentage is for players on a pennant-winning team or for seasons of a long career, but I think that when sabermetrics finally arrives at consensus numbers for those two, they will be about the same.
Because of these two harmonies, a player’s career may be represented as if it were a team/season—not in terms of all of the elements, having a shortstop, having a closer, etc., but in terms of size and quality. Well, think about it: What is 300 Win Shares, and 100 more Win Shares than Loss Shares? It is a championship team. 300 Win Shares is 100 wins. 100 more Win Shares than Loss Shares is 33 games above .500. If you win about 100 games and/or finish 33 games over .500, you win the pennant, or you win your division. You’re a championship team.
What we are really saying, then, is that a player is a Hall of Famer if, when his career is looked at as a team/season, that is a championship team. A player who has a good long career is assigned responsibility for about one season’s worth of wins and losses. If that’s a championship season, he’s a Hall of Famer.
There is a third harmony here, which is that 300 Wins is a recognized standard of a Hall of Fame pitcher. A pitcher’s Win Shares and Loss Shares are, in many cases, about the same as his wins and losses. 300 Win Shares by a position player. . .300 wins by a pitcher. It’s about the same thing. That’s another way of explaining why it works.
But I could give you a thousand reasons why such an approach should work, and, if it doesn’t work, it doesn’t work. This not only should work, but actually does. I have yet to find any case in which it doesn’t—other than closers.
This article is the first of a four-part series about the Expansion era ballot; the other three will be published over the next three days. As a writer, I very much dislike publishing a "series" of articles like this, because I dislike reading articles that way. When somebody publishes a series of articles on a topic, I normally read the first one and maybe the second one and then lose interest. However, the four articles total about 12,000 words, and in the modern world it is just not practical to publish a 12,000-word article on the web.
What I am saying is, I understand if you drift off in the middle of this series, but the most significant of the series is Part IV, which I think will be published on Friday. That part of the article deals with a couple of sabermetric issues, and suggests new approaches. If you don’t get time to read the whole article, I understand, but if you could check back in on Friday, I’d appreciate that. Thanks. Bill