Username:	Password:

Remember me

Forgot your username/password?

Print Email

Home>Articles

Don Rush and Bob Cardwell

By Bill James

May 2, 2007

Bob Rush in 1956 and Don Cardwell in 1965 both finished the season with won-lost records of 13-10, a not particularly notable fact in that there are 67 pitchers in baseball history (through 2006) who finished their season 13-10. Both Rush and Cardwell, however, pitched 240 innings—240.0. Each pitcher allowed 101 runs, which in each case included 85 earned runs, giving each of them a 3.19 ERA. Rush struck out 104 and walked 59; Cardwell struck out 107 and walked 59.

Let us say for the sake of argument that these are the two most-similar seasons by pitchers in baseball history, which I don’t know that that’s true, but we’ll debate that later. When you look closely at Rush and Cardwell, you find any number of secondary or incidental similarities. Both Cardwell and Rush, for example, had relatively long careers for pitchers who had losing records. Rush pitched in the majors for 13 years, winning 127 games but losing 152. Cardwell pitched in the majors for 14 years, winning 102 games but losing 138. There are only 35 pitchers in major league history who won 100 or more games in their careers but had winning percentages under .460. Rush and Cardwell are two of them.

Both Rush and Cardwell spent significant portions of their careers with the Cubs, and both pitched for the Braves at the end of their careers (Rush winning two games for the Braves in 1960, his last season, and Cardwell winning two games for them in 1970, his last season). Rush pitched 417 games in his career and struck out 1,244 batters; Cardwell pitched 410 games and struck out 1,211 batters. Rush in his career finished 44 games that he did not start (GF) and saved 8 of them; Cardwell had 44 GF and 7 saves. Cardwell’s career high in strikeouts was 156; Rush’s was 157. Cardwell’s career high in innings pitched was 259; Rush’s was 255.

Both men were big men and right-handed pitchers. Cardwell is listed in the books a 6’4”, 210 pounds; Rush was 6’4” and 205, in a time when there were not nearly as many pitchers of that size. (Only 10% of major league pitchers in 1955 were 6’4” or taller, as opposed to 27% in 2005.) Both men were regarded as having excellent fastballs, and both men basically lived off the fastball.

Both men had blue eyes. Both listed “Golf” as their number one hobby. Rush, who was born in December, 1925, was 30 years old when he had his 13-10 season; Cardwell, born in December, 1935, was 29 years old when he had his. Two years earlier, both pitchers had gone 13-15. The next season, each pitcher won only 6 games.

Both men were very slightly above-average hitters for a pitcher, Rush with a career OPS of .426 against a norm for his era of .419, Cardwell at .387 against a norm of .374, although both men struck out 40% more often than an average pitcher and walked even less often than an average pitcher. Both men were used as pinch hitters a few times in their careers—Cardwell six times, Rush three times.

The similarities between two pitchers. . ..or two hitters, or two trees, or two automobiles, or two office managers, but we are talking for the moment about pitchers. . .the similarities between two pitchers can be put under several headings: essential similarities, defining similarities, accidental and random similarities, apparent similarities, etc. No one could doubt that Rush and Cardwell were essentially similar pitchers—but when you think about it, these are the not the defining similarities in the comparison above. I started with the fact that both men had seasons in which they were 13-10, pitched 240.0 innings and gave up 101 runs, 85 of them earned. This is not essential information about the men, as pitchers; it’s just kind of a random match. The defining similarity there is a fairly arbitrary happenstance. But when two players happen to share a range of random characteristics, it is likely that they will share essential characteristics as well. There are a wide variety of mechanisms that cause this to happen, so much so that it becomes difficult to say for sure that any shared characteristic is truly an accident.

If you look up the most-similar season to any season in history, most often you find another season that occurred within three or four years of the focus year, and almost always within 20 years. Remarkably often, the most-similar season to a line from 1973 will be another line from 1973 (or 1916, 1952. . .whatever.) The most-similar season for a right-handed pitcher will usually be another right-hander; the most-similar season for a lefty is usually another lefty. The most-similar pitcher to a tall pitcher is usually another tall pitcher; the most-similar season to a short pitcher is usually another shorty. The most-similar pitcher to a pitcher who walks 100 batters will usually be another pitcher who walks 100 batters, even if you eliminate strikeouts and walks from the similarity routine. The most-similar season to a season by a 35-year-old will usually be another season by a pitcher in his mid-thirties; the most-similar season to a pitcher by a 22-year-old is often another season by a 22- or 23-year-old. The most-similar season to a season by a rookie is often a season by another rookie. The most-similar season to a season by a Hall of Famer is often another season by a Hall of Famer—even if you focus on seasons which are not indicative of Hall of Fame performance.

The most-similar season to a season by a knuckleball pitcher is often another season by a knuckleball pitcher. If you look up the pitcher’s repertoire in the Neyer/James Guide, the most-similar season is often by a pitcher who has a very similar repertoire.

Astonishingly often, when you identify the most-similar pitcher to someone who went to jail shortly after his career, or died of prostate cancer at age 57, or went into politics after his career, you find another pitcher who went to jail shortly after his career, or died of prostate cancer at age 56, or went into politics after he got out of jail. One never knows what to make of these similarities. .whether they are truly accidents, or whether one has accidentally tapped into some profound underlying similarity the root of which is not within our view.

Back to the time when I first became a baseball fan in 1961, I have always been fascinated by players who had very similar statistics for a season or a career. In the early 1980s I introduced a method to “score” the similarity between any two seasons or two careers. The method isn’t important and I don’t really remember what it was; what was significant was not the method but the realization that, even though “similarity” is essentially a subjective concept, we could nonetheless define it and measure it, so long as we were willing to accept that our definition and our measurement were somewhat arbitrary.

This is now so much a standard practice for me. . .and I think for many others, but I won’t speak for others. ..this is so much a standard practice for me that I could hardly go through a day without it. I think that the only use of the concept of similarities in baseball analysis before the early 1980s was in Hall of Fame arguments, and there only in a very narrow and one-dimensional way. People would argue that George Kelly was in the Hall of Fame and Dixie Walker had very similar numbers to George Kelly, therefore Dixie Walker should be in the Hall of Fame, too. This argument was extremely common, although the obvious other side of the argument—that Dixie Walker had very similar numbers to Carl Furillo and Carl Furillo was not in the Hall of Fame, therefore Dixie Walker should not be in there either—never came up. Occasionally this sort of narrow and one-sided “similarities” argument would also surface in MVP discussions, but not much, since there is an obvious problem with it in that context.

Anyway, I use similarity scores, in my work with the Red Sox, on almost a daily basis. I would guess that I might use similarity scores to study an issue more often than I use runs created, secondary average or Win Shares. Let us say that the Red Sox are considering trading for. . .oh, I don’t know, Joe Rudi or Drungo Larue Hazewood. One of the first things I would do, to evaluate the target player, is to try to identify the ten or twenty-five other players who looked most similar to him at the same age.

Suppose that the Red Sox were considering offering a three-year, $28 million contract extension to, I don’t know, Sandy Consuegra or David Cone. One of the first things I would do, to evaluate that investment, would be to identify all of the pitchers who looked just like Sandy Consuegra or David Cone up until this point in his career.

Most of the time this analysis is not dispositive—but occasionally it is. Occasionally you look up the 25 most-comparable players to Joe Rudi at age 32 and realize that, of the entire group, not a single one ever came back after a bad year to re-establish his value. This doesn’t dispose of the issue from the standpoint of the Red Sox, because that’s not my decision—but it does close the book for me. At that point I know what my recommendation will be: stay away from Joe Rudi. Nothing else that I could find would convince to recommend otherwise.

Most often, of course, I get a mixed bag of comparable players, and then I move on to other types of analysis. Comparability or similarity has worked its way into analysis of almost any question related to baseball. Suppose you are wondering what the chances are that Jason Bay, now 28 years old, will one day be in the Hall of Fame. One way to study this is to identify 20 other players who had very similar numbers to Jason Bay at age 28. If 15 of those 20 other players are now in the Hall of Fame, it is likely that Bay will also reach that standard. If only 2 of the 20 went on to be Hall of Famers, it is unlikely.

Suppose that you are wondering whether Jason Schmidt is likely to get 200 career wins. Identify the 20 pitchers most similar to Jason Schmidt at the same age, and look at their career win totals. Suppose that you are wondering whether Team X, which has four regular infielders under age 27, is likely to emerge as a perennial contender. Identify 20 similar teams with young infields. Suppose that you are wondering whether intentional walks have a measurable effect in reducing a pitcher’s home runs allowed. Identify matched sets of pitchers with very different intentional walk totals, and look at their home runs allowed. It’s a ubiquitous method.

Well, no. It’s a ubiquitous approach. The statistical method used to implement the approach varies with the issue being studied. What makes it so ubiquitous is exactly that: that there is no standard and inflexible method for evaluating similarity—therefore, we can adapt it to the needs of the study. If we need to find players coming off of injury seasons, we can change the system so that it emphasizes coming off a bad season. If we need to compare pitchers who had winning records but bad strikeout/walk ratios who are then switched to the bullpen, we can change the system so that it emphasizes won-lost records and strikeout/walk ratios. It’s arbitrary; therefore, we can change it to suit our needs. Being arbitrary doesn’t make the system LESS useful; it makes it MORE useful. Who would have thunk it?

I am now balanced precariously between two articles here. When I sat down to write this article it was just a fun little riff on the most-similar pitcher’s seasons that I could find. Then I started thinking about the history and applications of similarity scores, and I wandered into a quasi-philosophical reflection on the nature and usefulness of similarity. I don’t know which article to finish. I guess I’ll tackle the rumination first.

Similarities between two pitchers (or office managers) can be put under several headings: essential similarities, defining similarities, accidental and random similarities, apparent similarities, etc.

Accidental or random similarities are similarities which could just as easily not exist—for example, the fact that two pitchers are both 13-10. If you take the same pitchers the next season, one of them could just as easily go 17-8 and the other one 8-13, totally dissimilar records.

Defining similarities are those similarities which form the basis of the comparison. Defining similarities can be accidental, or they can be essential similarities. The defining characteristics are those that we change, within the similarity formulas, to focus the study on whatever it is that we are trying to learn more about.

Essential similarities are those similarities which focus on the central characteristics of the player. For example, the essential characteristics of Stan Musial are that he was thin, durable, swift, hit for a very high average with excellent power, and batted left-handed. If you match him with a player who happens to have very similar batting stats in some season but who was a right-handed hitting steroid monster from the 1990s who only had two good seasons, that’s a arbitrary or non-essential similarity. A group of pitchers who share several essential characteristics can be referred to as a “family” of pitchers.

Profound similarities could be defined as multiple essential similarities including similarities on a non-obvious level—for example, Roger Maris’ career is profoundly similar to Hack Wilson’s, although Maris and Wilson never had seasons with similar batting stats. Maris and Wilson had careers of extremely similar shape, duration and value. Both men held very important single-season slugging records, and both men died young. They are profoundly similar although their basic hitting numbers do not really match.

Apparent similarities are random or accidental similarities between players who are profoundly dissimilar. For example, Jeff Pfeffer in 1923 and Mark Williamson in 1987 both finished the season 8-9, and both had ERAs of 4.03. However, there are profound differences between them. Pfeffer was 35; Williams was 27. Pfeffer was a starting pitcher in 1923; Williamson was a reliever in 1987. Pfeffer struck out 32 batters in 152 innings; Williamson struck out easily more than twice as many hitters in 125 innings. They really have nothing significant in common, despite having apparently similar records.

Serious studies on which actual decisions might be premised require that we identify players who are profoundly similar or at least essentially similar. You can write similarity formulas so that they find players who are essentially similar. You cannot write formulas to find players who are profoundly similar, I don’t think; that you have to recognize on your own. You can use formulas to help you.

This article is not about players who are essentially similar or profoundly similar, except by accident. This is just about random matches between pitcher’s seasons which look a lot alike:

Tommy Bond, 1877, and Tommy Bond, 1878
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
1877	58	521.0	40-17	.702	530	170	36	2.11
1878	59	532.2	40-19	.678	571	182	33	2.06

My similarity score for these two seasons is only 932, as opposed to 980 for Rush and Cardwell, but the differences magnify with the innings, and Bond’s consistency is extraordinary for a pitcher pitching 500+ innings.

Bob Caruthers, 1886, and Pete Conway, 1888
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Caruthers	44	387.1	30-14	.682	323	166	86	2.32
Conway	45	391.0	30-14	.682	315	176	57	2.26

Similarity: 950. Caruthers was 22 years old and the time; Conway, 21. Caruthers also played the outfield that year, and hit .334. Conway didn’t play a position that year—he had played some in the outfield earlier in his career—but hit .275 with 3 homers, 23 RBI.

Frank Kitson, 1903, and Red Donahue, 1903
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Kitson	31	257.2	15-16	.484	277	102	38	2.58
Donahue	32	267.2	15-16	.484	287	96	34	2.59

Similarity: 972.

Deacon Phillippe, 1903, and Jesse Barnes, 1919
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Phillippe	36	289.0	25-9	.735	269	123	29	2.43
Barnes	38	296.0	25-9	.735	296	92	35	2.40

Similarity: 944. Phillippe pitched five complete games in the 1903 World Series, winning three of them, although his team lost the series. Barnes didn’t become a World Series hero until two years later, 1921.

Sam Leever, 1906, with Waite Hoyt, 1927 and Mort Cooper, 1944
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Leever	34	260.0	22-7	.759	232	76	48	2.32
Hoyt	36	256.0	22-7	.759	242	86	54	2.64
Cooper	34	252.0	22-7	.759	227	97	60	2.46

Similarity: Hoyt to Cooper, 960; Hoyt to Leever, 959, Leever to Cooper, 944.

Doc White, 1910, and Dennis Martinez, 1988
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
White	36	236.2	15-13	.536	219	111	50	2.66
Martinez	34	235.1	15-13	.536	215	120	55	2.72

Similarity: 973. This one is included because it is so unusual to have very-similar seasons separated by 78 years. A couple of others of those are Gregg and Barber (below) and Joe McGinnity, 1905 with Robin Roberts, 1951 (not included.)

Vean Gregg, 1912, and Steve Barber, 1963
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Gregg	37	271.1	20-13	.606	242	184	90	2.59
Barber	39	259.0	20-13	.606	253	180	92	2.75

Similarity: 960

Lefty Williams, 1919, and Burleigh Grimes, 1920
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Williams	41	297.0	23-11	.676	265	125	58	2.64
Grimes	40	304.0	23-11	.676	271	131	67	2.22

Similarity: 954. Grimes threw the spitball. Williams threw the series.

Dazzy Vance, 1928, with Whitlow Wyatt, 1941 and Dick Ellsworth, 1963
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Vance	38	280.1	22-10	.688	226	200	72	2.09
Wyatt	38	288.0	22-10	.688	223	176	82	2.34
Ellsworth	37	291.0	22-10	.688	223	185	75	2.10

Similarity: Wyatt to Ellsworth, 965, Ellsworth to Vance, 962, Wyatt to Vance, 946. Ellsworth and Vance both pitched for relatively poor teams that finished just over .500, well below .500 with other pitchers on the mound.

Tiny Bonham, 1940, and Mel Stottlemyre, 1964
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Bonham	12	99.0	9-3	.750	83	37	13	1.91
Stottlemyre	13	96.0	9-3	.750	77	49	35	2.06

Similarity: 953. Both rookies, both Yankees. Mel Stottlemyre, called up when the Yankees were floundering in August, 1964, arrived just in time to save the Yankees’ bacon in 1964. Bonham, called up in August, 1940, got there too late to save the Yankees in 1940.

Bill Lee, 1940, with Neal Heaton, 1985 and Jimmy Anderson, 2001
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Lee	37	211.0	9-17	.346	246	70	70	5.03
Heaton	36	207.2	9-17	.346	244	82	80	4.90
Anderson	34	206.1	9-17	.346	232	89	83	5.10

Similarity: Anderson to Lee, 969, Lee to Heaton, 967, Anderson to Lee, 932.

Dizzy Trout, 1942, and Jim Wilson, 1955
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Trout	35	223.0	12-18	.400	214	91	89	3.43
Wilson	34	235.0	12-18	.400	200	96	87	3.45

Similarity: 976.

Joe Dobson, 1947, and Sal Maglie, 1952
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Dobson	33	229.0	18-8	.692	203	110	73	2.95
Maglie	35	216.0	18-8	.692	199	112	75	2.92

Similarity: 980. Dobson’s nickname was “Burrhead”. Maglie’s was “The Barber”.

Johnny Sain, 1947, and Joaquin Andujar, 1985
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Sain	38	266.0	21-12	.636	265	132	79	3.52
Andujar	38	269.2	21-12	.636	265	112	82	3.40

Similarity: 965. Somehow, it just struck me as funny to pair Joaquin Andujar with anyone named “Sain”. You had to know Joaquin to get the joke.

Luis Arroyo, 1955, and Alex Kellner, 1955
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Arroyo	35	159.0	11-8	.579	162	68	63	4.19
Kellner	30	163.0	11-8	.579	164	75	60	4.20

Similarity: 972. Kellner and Arroyo were both lefties who had one shining moment at one end of very modest careers. However, although the two men were about the same age (Kellner two years older), Kellner’s shining moment had come in the previous decade, when he won 20 games as a rookie for the 1949 Philadelphia A’s. Arroyo’s shining moment came in the next decade, in 1961, when he had a brilliant year as a reliever for the 1961 Yankees.

Jack Sanford, 1957, and Mark Langston, 1991
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Sanford	33	237.0	19-8	.704	194	188	94	3.08
Langston	34	246.1	19-8	.704	190	183	96	3.00

Similarity: 980.

Johnny Podres, 1961, and Cliff Lee, 2005
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Podres	32	183.0	18-5	.783	192	124	51	3.74
Lee	32	202.0	18-5	.783	194	143	52	3.79

Similarity: 968.

Whitey Ford, 1963, and Frank Viola, 1987
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Ford	38	269.0	24-7	.774	240	189	56	2.74
Viola	35	255.1	24-7	.774	236	193	54	2.64

Similarity: 967. Both veteran left-handers of exceptional quality. Matches this good are common among pitchers who win 12-15 games, but not common among pitchers of Cy Young quality.

Bill Monboquette, 1963, and Brad Radke, 1997
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Monboquette	37	267.0	20-10	.667	258	174	42	3.81
Radke	35	239.2	20-10	.667	238	174	48	3.87

Similarity: 962.

Robin Roberts, 1964, and John Tudor, 1986
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Roberts	31	204.0	13-7	.650	203	109	52	2.91
Tudor	30	219.0	13-7	.650	197	107	53	2.92

Similarity: 983.

Dave Giusti, 1966 with Brian Lawrence, 2004 and Greg Maddux, 2006
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Giusti	34	210.0	15-14	.517	215	131	54	4.20
Lawrence	34	203.0	15-14	.517	226	121	55	4.12
Maddux	34	210.0	15-14	.517	219	117	37	4.20

Similarity: Giusti to Lawrence, 977, Giusti to Maddux, 968, Lawrence to Maddux, 967.

Nelson Briles, 1969, and Charlie Leibrandt, 1991
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Briles	36	228.0	15-13	.536	218	126	63	3.51
Leibrandt	34	229.2	15-13	.536	212	128	56	3.49

Similarity: 986.

Sonny Siebert, 1970, and Pascual Perez, 1983
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Siebert	33	222.2	15-8	.652	207	142	60	3.44
Perez	33	215.1	15-8	.652	213	144	51	3.43

Similarity: 982.

Paul Splittorff, 1972, and Jack Billingham, 1972
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Splittorff	35	216.0	12-12	.500	189	140	67	3.13
Billingham	36	217.2	12-12	.500	197	137	64	3.18

Similarity: 979.

Ferguson Jenkins, 1973, and Bret Saberhagen, 1988
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Jenkins	38	271.0	14-16	.467	267	170	57	3.89
Saberhagen	35	260.2	14-16	.467	271	171	59	3.80

Similarity: 977. Jenkins and Saberhagen were both former Cy Young Award winners, Jenkins two years earlier (1971) and Saberhagen three years earlier (1985). The next year, Saberhagen bounced back to go 23-6 with a 2.16 ERA, claiming his second Cy Young Award. Jenkins bounced back to go 25-12 with a 2.82 ERA, but lost the Cy Young voting to Catfish Hunter (below).

Ferguson Jenkins, 1974, and Catfish Hunter, 1974
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Jenkins	41	328.0	25-12	.676	286	225	45	2.82
Hunter	41	318.0	25-12	.676	268	143	46	2.49

Similarity: Only 890. I just included them because of the Jenkins/Saberhagen matchup above. Jenkins and Hunter were extremely similar except for the large difference in strikeouts.

Jim Colborn, 1977, and Rudy May, 1977
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Colborn	36	239.0	18-14	.563	233	103	81	3.62
May	37	251.2	18-14	.563	243	105	78	3.61

Similarity: 980. They were not similar pitchers, but had very similar records that year. .

Steve Rogers, 1977, and Bert Blyelven, 1985
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Rogers	40	301.2	17-16	.515	272	206	81	3.10
Blyleven	37	293.2	17-16	.515	264	206	75	3.16

Similarity: 975.

Vern Ruhle, 1980, and John Tudor, 1990
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Ruhle	28	159.1	12-4	.750	148	55	29	2.37
Tudor	25	146.1	12-4	.750	120	63	30	2.40

Similarity: 960.

Tom Henke, 1980, and Akinori Otsuka, 2006
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA	Saves
Henke	61	74.2	2-4	.333	58	75	19	2.17	32
Otsuka	63	59.2	2-4	.333	53	47	11	2.11	32

Ron Guidry, 1983, and Tom Glavine, 2000
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Guidry	31	250.1	21-9	.700	232	156	60	3.42
Glavine	34	241.0	21-9	.700	222	152	65	3.40

Similarity: 969.

Buddy Black, 1984, and Bob Knepper, 1986
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Black	35	257.0	17-12	.586	226	140	64	3.12
Knepper	40	258.0	17-12	.586	232	143	62	3.14

Similarity: 976. Black and Knepper, though very different personalities, were profoundly similar pitchers.

Dave Righetti, 1984, and Lee Smith, 1990
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA	Saves
Righetti	64	96.1	5-6	.455	79	90	37	2.34	31
Smith	64	83.0	5-6	.500	71	87	29	2.06	31

Similarity: 948.

Bert Blyleven, 1984, and Roy Halladay, 2002
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Blyleven	33	245.0	19-7	.731	204	170	74	2.87
Halladay	34	239.1	19-7	.731	223	168	62	2.93

Similarity: 968. Watch out, Roy; they’re going to screw you in the Hall of Fame voting.

Oil Can Boyd, 1984, and Kris Benson, 2004
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Boyd	29	197.2	12-12	.500	207	134	53	4.37
Benson	31	200.1	12-12	.500	202	134	61	4.31

Similarity: 980. Think about it. Boyd was a colorful, eccentric character who had a famous emotional melt-down after being left off the All-Star team in 1986. Benson is more famous for his wife’s hooters than for his pitching.

Danny Cox, 1985, and Joe Magrane, 1989
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Cox	35	241.0	18-9	.667	226	131	64	2.88
Magrane	34	234.2	18-9	.667	219	127	72	2.91

Similarity: 977. Cox and Magrane were very dissimilar pitchers and very dissimilar personalities. But both pitchers faded instantly after giving the Cardinals just a taste of their ability. Magrane was 24 at the time, Cox 25 (both pitching for St. Louis. Sorry. . .I tend to assume that you’re old enough to remember those things.) Both finished under .500 the next season, and both finished under .500 in relatively short careers.

Bill Gullickson, 1986, and Tom Browning, 1989
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Gullickson	37	244.2	15-12	.556	245	121	60	3.38
Browning	37	249.2	15-12	.556	241	118	64	3.39

Similarity: 988. The measured similarity of these two seasons (988) is even higher than that of Rush and Cardwell. Gullickson and Browning both pitched for the Reds at the time of these two very-similar seasons. Gullickson had other seasons in which he was 20-9, 14-13, 14-12 and 10-14. Browning had other seasons of 20-9, 14-13, 14-14 and 10-13.

Mike Schooler, 1990, and John Franco, 1994
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA	Saves
Schooler	49	56.0	1-4	.200	47	45	16	2.25	30
Franco	47	50.0	1-4	.200	47	42	19	2.70	30

Similarity: 967.

Tom Browning, 1991, and Mark Buehrle, 2003
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Browning	36	230.1	14-14	.500	241	115	56	4.18
Buehrle	35	230.1	14-14	.500	250	119	61	4.14

Similarity: 982.

Jack McDowell, 1992, and Pat Hentgen, 1996
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
McDowell	34	260.2	20-10	.667	247	178	75	3.18
Hentgen	35	265.2	20-10	.667	238	177	94	3.22

Similarity: 968.

Jimmy Key, 1992, and Tom Glavine, 2005
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Key	31	216.2	13-13	.500	205	117	59	3.53
Glavine	39	211.1	13-13	.500	227	105	61	3.53

Similarity: 972. Key and Glavine are profoundly similar pitchers.

Rick Aguilera, 1993, and Jason Isringhausen, 2001
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA	Saves
Aguilera	65	72.1	4-3	.571	60	59	14	3.11	34
Isringhausen	65	71.1	4-3	.571	54	74	23	2.65	34

Similarity: 954.

Ramon Martinez, 1995, and Kenny Rogers, 1995
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Martinez	30	206.1	17-7	.708	176	138	81	3.66
Rogers	31	208.0	17-7	.708	192	140	76	3.38

Similarity: 970. It just struck me that Rogers, still going strong more than ten years later, was actually a year OLDER than Martinez when the two had similar seasons in 1995. And still is, I guess.

Erik Hanson, 1996, and Darryl Kile, 1998
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Hanson	35	214.2	13-17	.433	243	156	102	5.41
Kile	36	230.1	13-17	.433	257	158	102	5.20

Similarity: 966.

Dave Burba, 1998, and Jimmy Haynes, 2002
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Burba	32	203.2	15-10	.600	210	132	69	4.11
Haynes	34	196.2	15-10	.600	210	126	81	4.12

Similarity: 972.

Roger Clemens, 1998, and Johan Santana, 2004
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Clemens	33	234.2	20-6	.769	169	271	88	2.65
Santana	34	228.0	20-6	.769	156	271	54	2.61

Similarity: 946. Perhaps the best-matched Cy Young seasons. (McDowell won the Cy Young Award in 1993, not 1992, and Whitey Ford would have won the American League Cy Young Award in 1963 had there been such an award.) In truth, Clemens and Santana could hardly be LESS similar as pitchers.

Bob Wickman, 1999, and Robb Nen, 1999
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA	Saves
Wickman	71	74.1	3-8	.273	75	60	38	3.39	37
Nen	72	72.0	3-8	.273	79	77	27	4.00	37

Similarity: 941.

Todd Jones, 1999, and Danys Baez, 1999
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA	Saves
Jones	65	66.1	4-4	.500	64	64	52	3.80	30
Baez	62	68.0	4-4	.500	60	52	29	3.57	30

Similarity: 961.

Chad Durbin, 2001, and Jeff D'Amico, 2005
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Durbin	29	179.0	9-16	.360	201	95	58	4.93
D'Amico	29	175.1	9-16	.360	204	100	42	4.77

Similarity: 969.

Mark Mulder, 2001, and Bartolo Colon, 2005
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Mulder	34	229.1	21-8	.724	214	153	51	3.45
Colon	33	222.2	21-8	.724	215	157	43	3.48

Similarity: 980. These may be the best-matched twenty-win seasons in baseball history. Mulder, however, doesn’t look anything like a hippopotamus with a hand-knit doily on his head.

Matt Clement, 2002, and Kerry Wood, 2002
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Clement	32	205.0	12-11	.522	162	215	85	3.60
Wood	33	213.2	12-11	.522	169	217	97	3.66

Similarity: 973. Teammates. Nolan Ryan, 1988, was also very similar.

Cory Lidle, 2006, and Claudio Vargas, 2006
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Lidle	31	170.2	12-10	.545	181	130	58	4.85
Vargas	31	167.2	12-10	.545	185	123	52	4.83

Similarity: 983.

It occurs to me that all of the similarities I have shown you are exceptional—and thus atypical—matches. I probably should have shown you a few more typical comparisons. If you take any two 200-inning seasons and match them together at random, you get an average similarity score, by the method I used here, of 658. This is 656:

Murry Dickson, 1952, and Billy Pierce, 1953
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Dickson	43	278.0	14-21	.400	278	112	76	3.56
Pierce	40	271.0	18-12	.600	216	186	102	2.72

Dickson and Pierce have about the same workload, but very different levels of effectiveness.

This is 659: Pud Galvin, 1889, and Gary Peters, 1971
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Galvin	41	341.0	23-16	.590	392	77	78	4.17
Peters	34	214	14-11	.560	241	104	70	4.37

Peters and Galvin have about the same level of effectiveness, but radically different workloads.

This match scores at 657: Clyde Wright, 1973, and Kenny Rogers, 1993
Pitcher	G	IP	W-L	Pct	H	SO	BB	ERA
Wright	37	257.0	11-19	.367	264	64	54	4.42
Rogers	35	208.1	16-10	.615	210	140	71	4.10

Bill James
Ft. Myers, Florida
March 28, 2007

COMMENTS

No comments have been posted.

Don Rush and Bob Cardwell

COMMENTS

Leave a comment

Report inappropriate comment