Username:	Password:

Remember me

Forgot your username/password?

Print Email

Home>Articles

A Hitter’s DNA

By Bill James

June 2, 2014

This study is not about similarity scores, although it does use a method which is somewhat like similarity scores, and it is not about fluke seasons, although it does point the finger toward fluke seasons. It is not about consistency, although the subject also overlaps with consistency. I have written about all of those things before. My focus at the moment is on the question of whether two seasons are or are not likely to be products of the same hitter.

Let us suppose that we take these two seasons:

G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
127	447	51	116	20	5	2	34	35	51	2	0	.260	.314	.340	.654
120	461	50	127	14	2	2	34	37	44	2	2	.275	.327	.328	.655

One might guess that those two seasons were two seasons of the same hitter, and indeed, you would be correct if you were to make that guess; those are the 1974 and 1975 seasons of Marty Perez, who was at that time the Atlanta Braves’ second baseman.

On the other hand, one might guess that these two seasons are not two seasons of the same player:

G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
150	533	132	188	22	5	52	130	112	99	10	1	.353	.464	.705	1.169
153	514	67	144	19	4	0	54	79	27	31	7	.280	.376	.333	.709

And, of course, they are not two seasons of the same man; those are the seasons of Mickey Mantle in 1956, and Ozzie Smith in 1986.

In those two cases it is easy, but what if it is not easy? Let us take these two seasons:

G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
155	634	111	200	30	11	14	58	40	127	43	18	.315	.358	.464	.821
154	525	59	141	27	4	10	65	28	109	17	5	.269	.308	.392	.700

Or these two:

G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
121	434	44	125	17	8	1	49	30	27	17	0	.288	.337	.371	.708
150	622	75	168	29	5	1	54	44	36	17	11	.270	.333	.338	.670

Or these two:

G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
158	553	82	139	24	4	28	89	91	108	2	0	.251	.354	.461	.815
147	521	62	125	15	2	15	75	78	91	0	1	.240	.339	.363	.702

Are those two seasons of the same hitter, or are they not? And can one tell?

Well. . .no, one can’t reliably tell; let me get that issue off the table. It does happen that different players have seasons which are similar on every scale, and it does happen that one player will have two seasons which are radically different. You can’t always tell. The six seasons above are by Lou Brock (1964), Ian Desmond (2010), Steve Yerkes (1915), Fernando Vina (2002), Rico Petrocelli (1971) and Rico Petrocelli (1972), but. . .you can’t always tell. The zero caught stealing by Yerkes is not an actual zero; it is an absence of information.

This method has to do with, for example, scoring the accuracy of predictions—and in particular, the accuracy of predictions for rookies or for players from Japan or Cuba, for whom we have to translate the data. We have traditionally scored our predictions by the use of similarity scores. Similarity scores, however, rely upon and are content with accidental similarities. If we say that a player should drive in 65 runs and he drives in 72, we’re happy.

A lot of those similarities, though, are just luck. We have to guess the player’s playing time. If we guess the player’s playing time accurately, everything else will fall in place most of the time. That’s just luck. On the other hand, if we project a player to bat 570 times and he gets hurt, the projection looks bad, but that’s just luck, too. I’m looking for a method that takes some of the luck out of it.

It has been pointed out that if you take our projections for rookies and "score" the accuracy of, for example, batting average, it would be just as accurate—or more accurate--to predict that every rookie would hit .260. That’s true, but it’s not exactly accurate. The normal year-to-year variation in batting average for regular players (400 or more plate appearances) is almost as large as the standard deviation of batting average. The standard deviation of batting average with 400 or more plate appearances is 31.8 points, less than that if you stick to batting averages within an era. The normal year-to-year fluctuation in batting average with 400 or more plate appearances is 24.4 points.

Since the normal year-to-year fluctuation in batting average is nearly as large as the standard deviation of batting average, it is nearly impossible to predict batting averages more accurately than simply predicting that everybody will hit .260. Since many of the rookies that we print projections for will have only six at bats and will hit .000, that creates extreme projection-to-reality variation, which means that it may be entirely impossible to predict batting averages for rookies more accurately than simply predicting that everybody will hit .260.

We’re asking the wrong question, and I’ve been aware for years that we were asking the wrong question, but I’m finally trying to do something about it. The question we have been asking is, "How similar are these two records?" The question we need to ask is "How improbable is it that these two records—the projection and the actual record—would be produced by the same player?" That’s the question here: How improbable is it?

I’ve designed a method to tackle that question, and I’ll explain the method, but that will be the last thing I’ll do; before I explain the method I’ll give you all the other stuff. First, highly improbable seasons that were in fact produced by the same player in consecutive seasons. The all-time king is Fred Dunlap, 1884-1885:

YEAR	G	AB	R	H	2B	3B	HR	RBI	BB	SO	Avg	OBA	SPct	OPS
1884	101	449	160	185	39	8	13	0	29	0	.412	.448	.621	1.069
1885	106	423	70	114	11	5	2	25	41	24	.270	.334	.333	.667

Dunlap hit .412 and scored 160 runs in 101 games in 1884, and hit about like Marty Perez in 1885. Obviously, something fishy was going on there. Dunlap in 1884 was playing in what was not only a minor league, but a very bad minor league, which, by the whims of history, happened to be designated by bad historians as a "major" league, and which therefore shows up in our data. The very dramatic changes in his performance are exaggerated by changes in the data; we have him with zero RBI and zero strikeouts in 1884, because the bad minor league he was playing in didn’t keep records of those things.

Actually, most of the most improbable sequences in history were produced in the 19^th century, for reasons similar to Dunlap’s—the available data is always changing, and new leagues were always forming and folding. Second on the list is Tip O’Neill, 1886-1887, third is Orator Shaffer, 1884-1885, and fourth is Chicken Wolf, 1890-1891.

OK, we’re going to ignore the 19^th century from now on; it’s not really major league baseball. The most improbable combination of adjacent major league seasons since 1900 was by George Sisler, 1922 and 1924:

YEAR	G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
1922	142	586	134	246	42	18	8	105	49	14	51	19	.420	.467	.594	1.061
1924	151	636	94	194	27	10	9	74	31	29	19	17	.305	.340	.421	.762

Sisler’s batting average dropped 115 points, his strikeout to walk ratio deteriorated, and he lost 60% of his stolen bases. We know why this happened: Sisler missed the 1923 season with an infection of the eyes, and when he returned he was not quite the same player as before.

Of course, Sisler’s 1922 and 1924 seasons—while we’re not going to ignore them, like the 19^th century—don’t actually qualify as consecutive seasons, either. The leaves as the most improbable consecutive seasons by any player since 1900 a pair of seasons that we have recently been discussing on this site: George Scott in 1967-68:

YEAR	G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
1967	159	565	74	171	21	7	19	82	63	119	10	8	.303	.373	.465	.839
1968	124	350	23	60	14	0	3	25	26	88	3	5	.171	.236	.237	.473

It is not a normal thing that a player will hit .303 with 19 homers one season, and .171 with 3 homers in 124 games the next season. We’re looking at the outer boundaries of what is possible here. How dissimilar can two seasons be, and still be produced by the same player? That’s the limit—George Scott, 1967-68; that’s how dissimilar it can get. After George C. are Rogers Hornsby, 1925-26, George Burns, 1917-1918, Darin Erstad, 2000-2001 (actually 1999-2000-2001), Norm Cash, 1961-62, Max Carey, 1925-26, Jeff Heath, 1940-41, Dave Roberts, 1973-74 Harry the Hat Walker (1946-47), Tommy Harper, 1969-1970, Andres Galarraga, 1992-93, Adam Dunn, 2010-2011, Adrian Beltre, 2003-2004, and Kirby Puckett, 1985-86.

Rogers Hornsby

YEAR	G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
1925	138	504	133	203	41	10	39	143	83	39	5	3	.403	.489	.756	1.245
1926	134	527	96	167	34	5	11	93	61	39	3	0	.317	.388	.463	.851

George Burns

YEAR	G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
1917	119	407	42	92	14	10	1	40	15	33	3	0	.226	.264	.317	.581
1918	130	505	61	178	22	9	6	70	23	25	8	0	.352	.390	.467	.857

Darin Erstad

YEAR	G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
1999	142	585	84	148	22	5	13	53	47	101	13	7	.253	.308	.374	.683
2000	157	676	121	240	39	6	25	100	64	82	28	8	.355	.409	.541	.951
2001	157	631	89	163	35	1	9	63	62	113	24	10	.258	.331	.360	.691

Norm Cash

YEAR	G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
1961	159	535	119	193	22	8	41	132	124	85	11	5	.361	.487	.662	1.148
1962	148	507	94	123	16	2	39	89	104	82	6	3	.243	.382	.513	.894

Max Carey

YEAR	G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
1925	133	542	109	186	39	13	5	44	66	19	46	11	.343	.418	.491	.909
1926	113	424	64	98	17	6	0	35	38	19	10	0	.231	.294	.300	.594

Jeff Heath

YEAR	G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
1940	100	356	55	78	16	3	14	50	40	62	5	3	.219	.298	.399	.697
1941	151	585	89	199	32	20	24	123	50	69	18	12	.340	.396	.586	.982

Dave Roberts

YEAR	G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
1973	127	479	56	137	20	3	21	64	17	83	11	2	.286	.310	.472	.782
1974	113	318	26	53	10	1	5	18	32	69	2	0	.167	.246	.252	.497

Harry Walker

YEAR	G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
1946	112	346	53	82	14	6	3	27	30	29	12	0	.237	.300	.338	.638
1947	140	513	81	186	29	16	1	41	63	39	13	0	.363	.436	.487	.924

Tommy Harper

YEAR	G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
1969	148	537	78	126	10	2	9	41	95	90	73	18	.235	.349	.311	.660
1970	154	604	104	179	35	4	31	82	77	107	38	16	.296	.377	.522	.899

Andres Galarraga

YEAR	G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
1992	95	325	38	79	14	2	10	39	11	69	5	4	.243	.282	.391	.673
1993	120	470	71	174	35	4	22	98	24	73	2	4	.370	.403	.602	1.005

Adam Dunn

YEAR	G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
2010	158	558	85	145	36	2	38	103	77	199	0	1	.260	.356	.536	.892
2011	122	415	36	66	16	0	11	42	75	177	0	1	.159	.292	.277	.569

Adrian Beltre

YEAR	G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
2003	158	559	50	134	30	2	23	80	37	103	2	2	.240	.290	.424	.714
2004	156	598	104	200	32	0	48	121	53	87	7	2	.334	.388	.629	1.017

Kirby Puckett

YEAR	G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
1985	161	691	80	199	29	13	4	74	41	87	21	12	.288	.330	.385	.715
1986	161	680	119	223	37	6	31	96	34	99	20	12	.328	.366	.537	.903

I don’t know if you guys remember this story. Bob Costas was broadcasting a Minnesota Twins game early in the season in 1986; I think it may have been April 26, 1986. Kirby Puckett, who had hit no home runs at all in a very fine rookie season in 1984 and had hit only 4 home runs in 1985, had already hit 5 home runs in the first 17 games of 1986, and they were talking about this power surge on the air. Costas’ wife was pregnant, due to deliver almost any day now, and Costas said, "If he hits another home run here I’ll name my son after him." And sure enough, about two seconds later—gone. He named his son Keith Michael Kirby Costas. Nice kid; he’s a grown man now. I’ve gone to ballgames with him a couple of times. (The story as it appears in Wikipedia is a little bit different, but in this particular case I trust my memory more than Wikipedia.)

Anyway, there is an "Improbability Score" which evaluates the improbability of any two seasons being produced by the same player. The Improbability Score of George Scott’s 1967 and 1968 seasons being produced by the same player is 386, the highest since 1900 for any player in consecutive seasons. The Improbability Score for Kirby Puckett, 15^th on the list of improbable but actual combinations, is 278. There are 64 cases in history of players having consecutive seasons with Improbability Scores of 200 or greater.

But if you randomly scramble seasons and compare one to another, then there are thousands of more improbable combinations than George Scott, 1967-68, and the average Improbability Score (of randomly matched seasons) is almost 200. This is the scale that I established:

Similarity_Table

You get different results if you compare consecutive seasons by the same player than if you compare non-consecutive seasons by the same player. In the case of the exceptionally consistent Henry Aaron, for example, the two most poorly-matched consecutive seasons of his career are the 1959 and 1960 seasons:

YEAR	G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	Avg	OBA	SPct	OPS
1959	154	629	116	223	46	7	39	123	51	54	8	.355	.401	.636	1.037
1960	153	590	102	172	20	11	40	126	60	63	16	.292	.352	.566	.919

Although his power numbers were the same, Aaron’s batting average fell by 63 points in 1960, which is an unusually large change. His doubles fell from 46 to 20, while his strikeouts and walks both increased. All of his "speed" numbers were way up; his stolen bases doubled, his triples increased, and (not shown) his Grounded Into Double Play count dropped from 19 to 8. One of the things we do to try to figure out if it’s the same player or not is to compare the Speed Scores. If a player is fast one year and slow the next, that’s an indication that it’s not the same player—but Aaron appears to be quite significantly faster in 1960 than in 1959.

That’s an "improbability score" of 102, which puts him in the indeterminate range. But if you compare Aaron’s prime seasons with his end-of-line, playing-out-the-string numbers in Milwaukee in the mid-1970s, then you would get scores much higher than 102:

YEAR	G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
1963	161	631	121	201	29	4	44	130	78	94	31	5	.319	.391	.586	.977
1975	137	465	45	109	16	2	12	60	70	51	0	1	.234	.336	.355	.691

That’s an Improbability Score of 282—but, of course, Aaron was not really the same player in 1975 that he was in 1963. He was about the same player until 1971; after 1971 he was clearly losing the ability which defined the rest of his career. (Aaron’s 1976 season is even worse than 1975, but since he batted only 271 times in 1976, the Improbability Score is lower.)

When we compare players in consecutive seasons—what will be marked as "consecutive" in the chart below—the average Improbability Score is 28, with a standard deviation of 33.

When we compare players in non-consecutive seasons, like Aaron in 1963 and 1975—what will be marked "non-consecutive" in the chart below—the average Improbability Score is 44, with a standard deviation of 52.

When we compare randomly matched seasons—what will be marked "random" in the chart below—the average Improbability Score is 191, with a standard deviation of 182.

All of these numbers (and those below) were calculated using only seasons beginning in 1900, with 400 or more plate appearances. When we say "randomly matched seasons", we mean seasons of different players, although of course one time in a thousand the random numbers will match up two seasons of the same player, but 99.9% of the time those are seasons of different players:

		Consecutive	Non-Consecutive	Random
Zero	Strong Likeihood of the Two Seasons Being by the Same Player	10%	6%	One-Third of One Percent
1 - 60	Seasons Consistent with Being Produced by the Same Player	77%	69%	16%
61 - 120	Intermediate or Indeterminate Range. Seasons in this range are often produced by the same player, but the differences are larger than normal.	10%	17%	24%
121-180	Seasons More Consistent with Being Produced by Different Players than by the Same Player.	2%	5%	18%
181-240	Strong Likelihood of Seasons Being Produced by Different Players.	One-Half of One Percent	1%	12%
241-400	If seasons were produced by Same Player, it is a fluke of historic magnitude.	Two-Tenths of One Percent	1%	18%
401 and above	Seasons were not produced by the Same Player.	Has Never Happened	Two-Tenths of One Percent	11%

A couple of interesting examples of contrasting seasons. Take these two seasons. Do you think they’re by the same player, or a different player?

G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
92	233	38	93	15	3	5	53	18	11	1	0	.399	.444	.554	.998
93	240	24	58	15	1	6	41	26	14	1	0	.242	.321	.387	.708

That’s actually Don Padgett, 1939, and Don Padgett, 1940. If you look at it, everything is pretty much the same in the two seasons—except that he had an in-play batting average of .406 one year and .229 the next. We give it an Improbability Score of 259, because, even though everything is functionally identical except the batting average, and even though the at bats are limited, it is just very, very strange that a player would hit .399 one year and .242 the next. Here’s another one like that:

YEAR	G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
1995	25	73	11	30	8	0	2	6	7	11	0	0	.411	.462	.603	1.065
1996	49	82	11	11	1	0	0	3	7	15	1	0	.134	.200	.146	.346

That’s actually Todd Haney, 1995 and Todd Haney, 1996. That’s an Improbability Score of 155, which is pretty amazing given the limited number of at bats and the fact that the strikeouts and walks don’t really change. Can’t you just imagine Todd Haney trying to explain to his relatives at the Thanksgiving Dinner, "No, really. I hit the ball exactly the same this year as I did last year. I just couldn’t find a hole."

Getting back to the rookies. . ..if we create projections, for rookies, which reasonably could be records of the same hitter as the actual record, that’s really as much as we can do. If we project him for 550 at bats and he gets 20, the projection will be "wrong", but we don’t have any way of knowing whether he will bat 550 times, or 20, so we can’t worry too much about that. If it could be the same player, I’m happy.

How Improbability Scores are Figgered

Outline

We begin by figuring a "reliability score" for the comparison, based on the number of plate appearances in each season.

We compare the two seasons, A to B, on nine elements of dissimilarity, and "score" each discrepancy, then sum up the discrepancies. We compare the seasons A to B and B to A, since sometimes it will score differently one direction than the other, and the lower sum is the Improbability Score.

The nine elements of dissimilarity are:

A) Batting Average,

B) Extra Bases Hit Frequency,

C) Home Runs,

D) Strikeout and Walk Frequency,

E) Strikeout to Walk Ratio,

F) Stolen Base Attempt Frequency,

G) Hit By Pitch Frequency,

H) Sacrifice Hit Frequency, and

9) Speed Score.

Hit by Pitch Frequency and Sacrifice Hit Frequency are not "important" elements of a hitter’s record, in the sense that they are things we would normally pay a lot of attention to in assessing a hitter’s value, but they can be "signature" elements which help us establish his identity. A player does not ordinarily have zero sacrifice bunts one year and 15 the next, so if one season has 15 sac hits and the other has zero, that’s an indication that it may not be the same player.

Reliability Score

The Reliability Score is figured as

PA1 * PA2 * 4		MIN(PA1.PA2)
----------------------------	*	------------------------
(PA1 + PA2 + 1) ^2		500

Where PA1 is Plate Appearances in one of the two seasons, and PA2 is Plate Appearances in the other of the two seasons. This formula will normally result in a reliability score of less than one if the player has fewer than 500 plate appearances in one season or the other, but greater than one if the player has more than 500 plate appearances in each season. Since Pete Rose had 770 Plate Appearances in 1974 and 764 in 1975, we have an unusually reliable data set to compare, and the Reliability Score for comparing those two seasons would be 1.526.

Batting Average

The "Batting Average Discrepancy Score" is the most complicated of our nine measurements, except possibly home runs.

1) Subtract the lower batting average (of the two seasons) from the higher batting average, and subtract 35 points (.035).

2) Square that.

3) And multiply it by 25,000.

So that, for example, if the difference in batting average between the two seasons is 40 points, that’s a Discrepancy Score of 0.625; 50 points in batting average, 5.625 points of Discrepancy; 60 points batting average, 15.625 points Discrepancy; 80 points batting average, 50.625 points Discrepancy, and 100 points batting average, 105.625 points Discrepancy.

4) However, if the player has batted less than 25 times in either of the two seasons, the Batting Average Discrepancy Score is zero.

5) If the difference in batting average between the two seasons is less than 35 points, the Batting Average Discrepancy Score is zero.

6) If there are less than 300 at bats in either season, then the Batting Average Discrepancy Score is discounted as follows:

Count 1 point if one player had 100 at bats in a season,

Count 2 points if he had 300 at bats in a season,

Add the two together, add 2, and divide by six.

So that, for example, if one player had 80 at bats in a season but the other had 350, then we multiply the Batting Average Discrepancy Score by .667 (0 + 2 + 2) / 6 = .667. If one player had 150 at bats in a season and the other had 500, we multiply the Batting Average Discrepancy Score by .833 (1 + 2 + 2) / 6 = .833.

7) One more control on the Batting Average Discrepancy Points. Take the differences between the two batting averages, multiply by 1000 (so that a 120-point difference in batting average becomes 120), and add 50 (so that a 120-point difference in batting average becomes 170.) If the Batting Average Discrepancy Points as figured in parts 1-6 of this rule are greater than that number, then substitute this second number. (This almost never happens. It’s just a weird-situation control.)

8) Multiply this total (from parts 1-7) times 2.

Extra Base Hit Frequency

Designate one of the seasons as Season A and the other as Season B. Take the player’s Extra Base Hits in Season A, and project them into his Plate Appearances in Season B. Subtract his actual Extra Base Hits in Season B.

If the discrepancy is 13 extra base hits or less, no points. If the discrepancy is greater than 13 extra base hits, count six (6) Extra Base Hit Discrepancy points for each extra base hit.

Babe Ruth in 1921 had 693 Plate Appearances, and had 119 Extra Base Hits. Glenn Beckert in 1966 also had 693 Plate Appearances, but had only 31 Extra Base Hits. That’s a discrepancy of 88 Extra Base Hits, of which we will ignore the first 13. The other 75, we charge at 6 points apiece—an Extra Base Hit Discrepancy Score, comparing those two seasons, of 450 points.

On the other hand, Don Kessinger in 1975 also batted 693 times, and had 36 Extra Base Hits. So comparing Beckert (1966) and Kessinger (1975), the Extra Base Hit Discrepancy Score would be zero, since the difference is less than 13.

We compare A to B and B to A. Sometimes a player may have 4 Plate Appearances and 2 Extra Base Hits; when you compare him to a player with 700 plate Appearances, he should have 350 Extra Base Hits. But since, in the end, we only use the smaller of the two comparisons, this "projection error" has no effect on the Improbability Score at the end of the process.

Home Runs

Project the Home Runs from Season A into Season B. We will call these figures X1 (actual home runs in one season), and figure X2 (projected home runs in the other season.)

Subtract X1 from X2, and divide by 30. This we will call X3.

Add 7 to X1, add 7 to X2, and divide the larger number by the smaller. Subtract 1. This figure we will call X4.

Multiply X3 by X4. This figure we will call X5.

If X5 is less than .100, just ignore it; there is no penalty for a home run discrepancy if X5 is less than .100.

If X5 is greater than .100, then multiply it by 100. This figure we will call X6.

Take the player’s Plate Appearances in Season A, plus 1, and his Plate Appearances in Season B, plus 1. (The +1 is just to avoid the possibility of dividing by zero.) Divide the smaller figure by the larger. This figure we will call X7.

Multiply X6 by X7.

The result is the Home Run Discrepancy Penalty.

Let us suppose we are comparing two players. Player A hit 21 homers in 271 plate appearances, and player B hit 3 homers in 164 Plate Appearances.

Figure X1 is "3", the actual number of home runs hit by Player B.

Figure X2 is "13", the projected number of home runs to be hit by Player A in 164 Plate Appearances. Actually, it is 12.739; 13 rounded off.

Subtract X2 from X1; the difference is 9.739. Divide by 30; that’s .325. Figure X3 is .325.

Add 7 to each one, and divide the larger by the smaller; that is 19.739 divided by 10, which is 1.9739. Subtract 1; that’s .974. Figure X4 is .974.

We use these two figures, Figure X3 and Figure X4, because we’re looking at the Home Run difference both as a relative difference, and as an absolute difference. 13 home runs compared to 3 is an absolute difference of 10 home runs—which is fairly small—but a relative difference of 4 to 1, which is huge. And we add the "7" to both figures before we figure the relative differences because, if you don’t do something like that, then zero home runs compared to 1 home run is the same as zero compared to 65 homers.

Multiply Figure X3 by Figure X4; that’s .325 by .974, which is .316. That’s figure X5.

Multiply Figure X5 (because it is larger than .100) by 100; it becomes 31.6. That’s figure X6.

Take the player’s Plate Appearances in Season A (271), plus 1 (272), and his Plate Appearances in Season B (164), plus 1 (165). Divide the smaller figure by the larger (165 / 272 is .6066.) This figure we will call X7.

Multiply X6 times X7; 31.6 times .6066 is 19.2. . .so the Home Run Discrepancy Penalty between these two seasons is 19.2 points.

These two players are Art Shamsky, 1966, and Art Shamsky, 1967. It’s the same player; he just hit a bunch of home runs one season, and then didn’t hit them the next. Home runs are tremendously important in assessing a player’s value, but less important in looking at his identity, particularly for a part-time player. Some seasons players just don’t hit the home runs that they expect to hit, particularly if they don’t get a lot of at bats.

Strikeout and Walk Frequency

Add the player’s strikeouts and walks together. Project the strikeout/walk total in Season A into Season B, and subtract this from the actual strikeout/walk total in Season B.

If the difference is 15 or less, just ignore it; no penalty. If the difference is greater than 15, then one point for each additional strikeout or walk.

For example, Adam Dunn in 2012 had 105 walks, 222 strikeouts in 649 plate appearances. Irish Meusel in 1923 also had 649 plate appearances, but he had only 16 walks and 38 strikeouts, a total of 54 strikeouts and walks. Dunn’s total was 327. That’s a difference of 273. We ignore the first 15 (making 258), and count one Strikeout/Walk Discrepancy point for each of the others, which makes a 258-point discrepancy.

Strikeout to Walk Ratio

Figure the player’s walks as a percentage of his strikeouts and walks.

Take this percentage for one player, and project it onto the strikeout and walk total of the other player.

Subtract this from his actual walks.

If the difference is ten or less, no penalty. If the difference is greater than 10, 2 points for each walk (greater than 10).

Adam Dunn and Irish Meusel again. . .. their strikeout to walk ratios are actually about the same. Meusel had 16 walks out of 54 strikeout/walk events, or 29.6%; project that into Dunn’s strikeouts and walks, it’s 97 walks. That’s within 10 of his actual number, so. . .no penalty points.

Dunn had 105 walks among 327 strikeout/walk events, or 32.1%. Project that into Meusel’s strikeouts and walks, it’s 17 walks. Meusel actually had 16, so. . .no points.

If you compare Ozzie Smith, 1985, and Alfredo Griffin, 1985, they’re very similar hitters apart from their strikeout and walk data.

YEAR	G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
Alfredo	162	614	75	166	18	7	2	64	20	50	24	9	.270	.290	.332	.622
Ozzie	158	537	70	148	22	3	6	54	65	27	31	8	.276	.355	.361	.716

There actually are no Improbability Points in this matchup, apart from the strikeouts and walks and a small difference in the sacrifice hit category.

But the strikeouts and walks suggest that it is not the same hitter. First, strikeout and walk frequency. Afredo Griffin had 70 strikeouts and walks in 646 Plate Appearances. Ozzie Smith had 615 Plate Appearances. Project Griffin’s total into Ozzie’s Plate Appearances; that’s an expectation of 67 strikeouts and walks. Ozzie actually had 92 strikeouts and walks, so that’s a discrepancy of 25 strikeout/walk events. We ignore the first 15 of those, and count one point per event above 15, and there’s a 10-point penalty for the strikeout and walk frequency.

But Griffin, given 92 strikeout and walk events, would probably have had 26 walks, 66 strikeouts. That’s a discrepancy of 39 walks. We ignore the first 10 of those and charge two points each above 10; that’s a 58-point penalty. And that throws the comparison of these two players—otherwise extremely similar—into the "inconclusive" range. The same player does not ordinarily have a 50-20 strikeout to walk ratio one season, and 27-65 another season, although such things have happened on occasion.

Stolen Base Attempt Frequency

Take the Stolen Base Attempts of Season A, and project them into Season B, based on the number of times the player was on base. Compare this to the actual Stolen Base Attempts in Season B.

The number of times the player is on base is defined for this purpose as hits, plus walks, plus hit by the pitch, minus extra base hits.

If the discrepancy is 8 or less, just ignore it; no penalty. If the discrepancy is greater than 8, 2 points for each additional stolen base attempt.

Rickey Henderson, 1986, and Rico Petrocelli, 1971, had quite similar seasons—apart from the fact that Henderson had 105 stolen base attempts, and Rico had only 2:

G	AB	R	H	2B	3B	HR	RBI	BB	SO	SB	CS	Avg	OBA	SPct	OPS
153	608	130	160	31	5	28	74	89	81	87	18	.263	.358	.469	.827
158	553	82	139	24	4	28	89	91	108	2	0	.251	.354	.461	.815

Rickey Henderson was on base 187 times, not counting the extra base hits; Rico was on base 176 times. If Rickey had been on base only 176 times, he would probably have attempted only 99 stolen bases. This is 97 more than Petrocelli attempted. We ignore the first 8 of these, and score them at 2 points apiece after 8. That’s a 178-point penalty. That marks them as almost certainly different players, since a player does not ordinarily attempt 99 stolen bases in one season, and 2 in another.

Hit By Pitch Frequency

Take the HBP total from Season A, and project it into the Plate Appearances from Season B. Subtract this from the actual Hit By Pitch Total of Season B.

If the difference is 5 or less, just ignore it; no penalty. If the difference is greater than 5, 2 points for each additional HBP.

Sacrifice Hit Frequency

Take the Sacrifice Hit (SH) total from Season A, and project it into the Plate Appearances from Season B. Subtract this from the actual Sacrifice Hit Total of Season B.

If the difference is 4 or less, just ignore it; no penalty. If the difference is greater than 4, 3 points for each additional HBP.

Speed Score

Make a "Simple Speed Score" for each hitter in this way:

3B times 10,

Plus SB times 5,

Minus CS times 3,

Minus GIDP times 4,

Plus .200 times plate appearances,

Times 20,

Divided by (Plate Appearances + 10).

If the result is greater than 10, level it off to 10; if it is less than zero, move it back to zero.

Dale Mitchell in 1949 stole only 10 bases in 13 attempts, 677 plate appearances; however, he hit 23 triples and grounded into only 8 double plays, which is half of a normal number for 677 plate appearances. Based on this, he has a speed score of 10:

[(23 * 10) + (10 * 5) – (3* 3) + (.200 * 677)] * 20

--------------------------------------------------------------

677 + 10

That works out to 8128 divided by 687, which is greater than 10, so his speed score for that season is 10. Rickey Henderson’s Simple Speed Score is 10 almost every year.

Subtract the Speed Score of Player A from the Speed Score of Player B, minus 1; make it an absolute number (6.29, rather than -6.29). Multiply that by 10, and multiply that by the reliability score of the two seasons. The result is the "Speed Discrepancy" between the two seasons.

COMMENTS (24 Comments, most recent shown first)

trn6229
Thanks Bill, nice article. Some players are more consistent than others. Fred Lynn was a real favorite of mine, he was great in 1975 as a rookie, good in 1976, hurt his ankle in 1977 and was ok, was good in 1978, great in 1979, good in 1980, hit .219 in a strike shortened 1981 season for the Angels. Angels Stadium was not the hitters park that Fenway was. Steve Garvey was very consistent from 1974 to around 1982. He hit with more power in 1977.

Tommie Agee had a crummy hitting year in 1968 as did George Scott. They both had better seasons in 1969. 1968 was the year of the pitcher, 1969 was more balanced, 1970 was a hitters years, 1971 was more balanced, 1972 was a pitchers year and 1973 was a better hitters year.

Rusty Staub was a good hitter. He had seasons were he hit .224 as a rookie hitting in a hitters grave yard, Colt Stadium. He had a fine year in 1969 for the expansion Expos. I learned from you that park effects have a large impact on stats. Sandy Koufax became Sandy Koufax when Dodger Stadium opened.

Take Care,
Tom Nahigian
5:29 PM Jun 13th

bjames
On the Costas story. . .I'm betting on my memory to be proved correct as far as it goes. I was watching on TV when Costas made the comment that I remember, and I talked to Bob about it just a few days after his son was born. I don't talk to Costas all THAT often, so it kind of sticks with me. I believe that the birth certificate was changed to more fully incorporate the "Kirby" reference, and I also think that as well as what was said on the air, something was said off the air between Puckett and Costas that I was never aware of at the time.
11:40 AM Jun 3rd

bjames
The top 15 seasons in terms of improbable "second acts"--that is, seasons inconsistent with the previous season--are, in order, 1970, 2000, 1989, 1927, 1977, 1943, 1969, 1942, 1926, 1930, 1988, 1976, 2010, 1971 and 1925. 1961 ranks 92nd on the list as a producer of unexpected seasons--that is, near the bottom--and 1962 ranks 66th. Players who had seasons in 1970 significantly inconsistent with their 1969 or 1971 seasons include Cito Gaston, Bernie Carbo, Bobby Murcer, Bob Bailey, and Carl Yastrzemski.
11:34 AM Jun 3rd

OldBackstop
Re: Norm Cash. The ’61 to ’62 HRs only went from 41-39, but slugging went from .662 to 513, and doubles from 22 to 16 and average, as Bill said, down .118. To me at least as interesting a discrepancy – based just on numbers, root causes such as expansion and age set aside -- was ’60 to ’61 for Cash, with jumps in both power and average. Cash’s HRs went from 18 to 41, slugging went .501 to .662, and BA went up 75 points.

@wovernstrap. I don’t want to clog up the article with this chatter, but, Jeez, the Kirby story is worthy of a whole trace book. There are reports from major media out there on the intraweb thingie saying it was based on HRs, a HRs comment in a game, BA at end of year, BA when son was born, the birth certificate was changed later etc. Maybe some of the confusion is that all the stories have a bit of truth. Costas was interviewed when Kirby died and said that early in the year he told Kirby if he was batting over .350 when his son was born, he would name him after him. Kirby was, May 9, and Costas did. Costas then says that when Kirby was told, he hit a HR off Mike Flanagan on the Orioles, and dedicated it to “Kirby Costas”. articles.chicagotribune.com/2006-03-09/sports/0603090160_1_bob-costas-keith-michael-kirby-costas-kirby-puckett The only HR Kirby hit off Flanagan that year was on July 5. So there are elements of both BA and an HR in the truth according to Costas. Of course, I can’t think of an area more prone to embellishment/revisionism than the birth of your firstborn. My son emerged from a unicorn streaking across the night sky like a comet.
11:26 AM Jun 3rd

bjames
Matty Alou (65/66) was close to being on the top ten list, although Alou's 66 season actually scores as very consistent with his first two seasons in San Francisco. . .I think '61 and '62.
11:15 AM Jun 3rd

hotstatrat
I would think batting averages are more subject to variations in luck (as allegedly reflected in BABiP), but home runs reflect variations in a player's physical strength - which is more subject to change than batting eye. This isn't to argue a point made by anyone, just an observation.
11:40 PM Jun 2nd

wovenstrap
Bill: I see what you mean. A matter of classification, Erstad for instance as a power surge when it's really a BA surge. Funnily enough the only two years I ever did any damage in fantasy baseball were 2000 (Erstad on my team) and 2004 (Beltre).

As it happens I also blew the Kirby story too (something to do with him hitting .350 actually) so that was a pretty worthless comment on my part.
9:45 PM Jun 2nd

AJD600
How dissimilar/unlikely was Matty Alou's season when Harry Walker got ahold of him from the season before?
9:31 PM Jun 2nd

thegue
I think it should also be noted that Norm Cash's 1961 season occurred in an expansion year - Bill, have you noted an exaggeration of dissimilar seasons around expansion years?
8:37 PM Jun 2nd

bjames
Responding to Wovenstrap. . .the comment about all of the dissimilar years being caused by power surges is just absolutely not true at all. Norm Cash. . .power is the same; his batting average just changed 118 points. George Sisler. . .batting average dropped 115 points, power didn't change. George Burns. . .minimal difference in power. Darin Erstad. .. change in power is much less notable than the change in batting average. Max Carey. . .little difference in power. Jeff Heath . . .no difference in power rate. Harry Walker. . no meaningful difference in power rate. Galarraga. . .no difference at all in home run rate. Adam Dunn. . .101 point drop in batting average. If I was revising the system, I would INCREASE the penalties for changes in home runs, not decrease them. I was surprised, for example, that Brady Anderson and Luis Gonzalez were not on the list, and I thought maybe I should bump up the "power" adjustments so that they would.
7:16 PM Jun 2nd

Hal10000
It seems like what you're moving toward here is a semi-Bayesian approach. Given a player has an innate performance level of X-Y-Z, what is the likelihood that he would produce a season with a performance of A-B-C.
7:15 PM Jun 2nd

abiggoof
Season to season, sure, these are impressive, but there's never been a career as dissonant as Mariano Duncan, has there? Every year, he was reinvented, from low average to high and back, high on doubles one year, triples the next, steals the following, and so on.

5:13 PM Jun 2nd

izzy24
I just have to know: Did anyone else originally read this article title as "Hitler's DNA?"
4:51 PM Jun 2nd

hotstatrat
Were there more dissimilar seasons by the same players in the periods from say 1995-2000 or 2001-2006 than from 2008-2013 or 1982-1987? (i.e. evidence or lack thereof regarding the impact of steroids. In my mind, which could well be off, the steroids era trickled in with the '88 A's, then really started taking off around 1994, becoming a joke around 2000. The rules against it started coming in about five years later, but really didn't become fully effective until 2008.)

I wonder how much more improbable consecutive seasons are during a managerial change than not.
4:29 PM Jun 2nd

OldBackstop
SI has it thusly: " When Bob and Randy had a baby of their own in May 1986, they named him Keith Michael Kirby Costas, the second middle name chosen to fulfill a promise he had made to name his child after Puckett if he was hitting .350 at the end of May."

sportsillustrated.cnn.com/vault/article/magazine/MAG1003993/1/index.htm

Remember, if you have two middle names, you have no middle name.
2:37 PM Jun 2nd

doncoffin
Tango--Those are, in fact, the guys. And, yeah, the career lengths are vastly different. What interests me, in this context, is the extent to which career differences can mask career similarities. I think both (career totals and something like a seasonal average) are useful ways of comparing performance.
2:34 PM Jun 2nd

wovenstrap
2 comments:

1. This may mess up all your fine math, but it's very noticeable that all of the big outliers in terms of dissimilarity derive from a player having a big power output that is not in line with other seasons. Perhaps HRs should be treated a little differently? In other words, in assessing the likelihood of two seasons being by the same person, not as much weight should be given to power stats as to, say, speed or K/BB ratio or something. I don't know if there's any basis for saying that power stats fluctuate more, but there might be.

2. On Kirby and Costas, my memory of it (I was 15 at the time, probably my maximum baseball intake) is that Costas named a figure, I think 30 HRs, and everyone had to wait until the end of the season to find out if Kirby would force Costas' hand. I'm not relying on Wikipedia (I don't know what it says there), but it's perfectly possible I'm relying on other sources from the 1990s or something that are equally suspect, who knows. But the 30-HR variant has the advantage of making it a sustained story over many months. He may have changed the name after the child was already born.
2:06 PM Jun 2nd

OldBackstop
PS: Norm Cash the year before the above.
1:49 PM Jun 2nd

OldBackstop
This is a very cool study, Bill!

I would be interested in seeing the scores for two seasons that come up in Reader Posts a lot.

Norm Cash (age 25 to 26) 1960-61
Keith Hernandez (age 24 to 25) 1978-79

I am not confident I could put the tables in here in one fell swoop, with the no-edit format. I have cherry-picked stats below which help my point, which is totally in-bounds on RP.

Norm Cash from 1960-1961 (1/3 more ABs in 1961)
HRs 18 to 41
BA .286 to .361
RBIs 63 to 132
BB 65 to 124
Speed stayed about the same. This season is discussed because of admission of bat corking and whether it helped performance.

Keith Hernandez 1978-79
BA .255 to .344
RBIs 64 to 105
.OPS 731 to .930

This is discussed because of coke admission and whether it helped performance. Speed stayed about the same.

Hernandez was a vet of five years, and this was a step up to a level he continued, which is interesting in the improbability discussions. Predict that one.

Also have always been interested in Ellsbury 2011.

This opened my eyes that if you, correctly, throw out ribbies, and look at similarities rather than differences, the fingerprints are still there. Great study.

1:46 PM Jun 2nd

tangotiger
Dave Kingman and Ron Kittle.

Given that Kingman had over double the PA, that's a huge reason why they don't sim at all.

It all depends how you construct your sim scores, be it rate based or counting based, or somehow merge the two in some manner.
1:32 PM Jun 2nd

bjames
Brady Anderson came up 16th, 17th on the improbability list, something like that. He was near the top 10 list; he just didn't quite make it.
1:02 PM Jun 2nd

doncoffin
I'm guessing we'd conclude these two seasons are from the same player:

...........Player A......Player B
PA.........620.............579
AB.........557.............520
R.............75..............68
H............131............125
2B............20..............19
3B..............2...............1
HR............37..............34
RBI..........101..............88
SB...............7...............3
CS...............4...............3
W...............51.............45
K..............152...........143
BA............236...........239
OBA..........302...........306
SA............478...........473

But they are not. These are the average 162-game stats for two different (but obviously extremely similar) players, who were both active fairly recently (1971-1986 for Player A and 1982-1991 for Player B).

This suggests another approach to "similarity scores"--using the approach Bill has outlined to look at this sort of difference. (Incidentally, neither player shows up as one of the other's top-10-career comps, using similarity scores.)

(Data: BBRef.)
12:42 PM Jun 2nd

bjjp2
Brady Anderson?
9:51 AM Jun 2nd

Steve9753
I'm suprised Mark McGwire isnt on the dissimilar list
8:48 AM Jun 2nd

A Hitter’s DNA

COMMENTS (24 Comments, most recent shown first)

Leave a comment

Report inappropriate comment


Type of Abuse:
Comments: