Username:	Password:

Remember me

Forgot your username/password?

Print Email

Home>Articles

The Longing

By Bill James

August 30, 2010

(Pitcher Regularity Scores)

(Poetically re-titled “Major League Managers and the Longing for Stability”)

Major league baseball has been substantially re-created and re-invented, over the last hundred years, by the desire of major league managers to create more stable workloads for their pitchers. This desire has overridden and thus essentially eliminated platooning, has sharply limited pinch hitting, has done away with third catchers, has virtually eliminated complete games, has re-defined normalcy in roster construction, pushing us gradually from 7 pitchers per team to 13, has added dozens of new statistical measurements to the game, and has caused games to get substantially longer and slower, as managers change pitchers every more frequently.

We have talked a lot, we have written a lot, we have done much research on the issue of pitcher workloads. The research that we have done—on this issue, perhaps more than any other—has invaded the game itself, and is now relied upon by people in decision-making positions to help guide the ways that pitchers are used.

But there is a parallel issue, of perhaps equal or similar importance, about which we have done very little research. That is the issue of the regularity of pitcher usage.

Suppose that we had a metric to measure not how hard the pitcher had been worked, but how regularly, how consistent and even his workload was over the course of his season or career. If we had such a metric, think of the other questions that this would open up to us. Does pitcher performance improve with more regular usage? Does regularity of usage help to prevent injuries? Is a workload of 200 innings with regular use perhaps equal to a workload of 150 innings with irregular use, or 120 innings, or 80?

Is regularity of pitcher use a variable among managers? If so, is it a relevant variable in terms of manager success, or an irrelevant variable? Do managers who keep their pitchers on a more regular schedule enjoy more success?

I have three tasks in this article. First, I will outline a simple method with which to distinguish between regular and irregular workloads. Second, having applied this method to more than 1,000 pitcher/seasons—about 3% of all pitcher/seasons in baseball history—I will report the data resulting from those studies. Third, I will try to explain and interpret the data as best I can, and then I will stick just a toe into those issues that I have outlined above.

I. The Method

It would have been impossible to have done this research—as it would have been impossible to have done any of the research that we do these days—without Retrosheet. Retrosheet.org in recent months has added daily logs for pitchers dating back to 1920. This research was done based on those logs.

My method here is extremely simple. From the game logs available through Retrosheet, we extract two figures: the dates on which the pitcher pitched, and the number of batters that he faced. For Bob Feller, 1946, this is that data:

Month	Day	Year	BFP	Month	Day	Year	BFP
4	16	1946	29	7	24	1946	31
4	21	1946	41	7	28	1946	34
4	26	1946	38	7	31	1946	34
4	30	1946	32	8	4	1946	26
5	4	1946	32	8	8	1946	33
5	8	1946	36	8	10	1946	9
5	12	1946	45	8	11	1946	1
5	17	1946	34	8	13	1946	29
5	21	1946	33	8	17	1946	36
5	26	1946	42	8	20	1946	34
5	30	1946	38	8	22	1946	5
6	4	1946	34	8	24	1946	37
6	8	1946	35	8	28	1946	32
6	12	1946	36	9	1	1946	33
6	16	1946	38	9	3	1946	7
6	21	1946	34	9	5	1946	22
6	25	1946	41	9	8	1946	33
6	29	1946	34	9	12	1946	35
7	3	1946	41	9	15	1946	29
7	7	1946	37	9	19	1946	34
7	11	1946	28	9	22	1946	38
7	16	1946	41	9	25	1946	37
7	20	1946	38	9	27	1946	21
7	21	1946	7	9	29	1946	38

Feller pitched 48 times in 1946, facing a total of 1,512 batters.

Scanning this log from a modern perspective, we can see a number of quite remarkable things. Feller pitched a complete game at Washington on July 20, 1946, came back on July 21 and pitched an inning and two-thirds of relief (scoreless relief, striking out four.) He pitched a complete game at Chicago on August 8^th (a one-hit shutout), came back to pitch in relief in St. Louis on August 10^th (3 innings of perfect relief), pitched in relief again on August 11^th, then started against Detroit on August 13^th. He gave up three hits, one run in 8 innings against Detroit, lowering his ERA to 1.88, rested three days, started again, rested two days, started again, missed one day, pitched in relief, missed one day, and started again. In the last five days of the season he pitched a total of 23 innings.

But our purpose here is not to stare at this in open-mouthed awe, but to score the regularity (or irregularity) of this assignment. The score that I developed has two parts, and a potential of 25 points per game appearance, after the first appearance.

In Part A, we compare the intervals between appearances. . .counting the number of days off between this appearance and the previous one. For Feller, 1946, this is as follows (below). The gap between the last start of Feller’s 1945 season and his first in 1946, the off-season being longer then than it is now, was 201 days. . .which actually makes Feller’s 1946 season even more remarkable, when you consider that the season then was about three weeks shorter than it is now:

Month	Day	Year	Gap
4	16	1946	201
4	21	1946	5
4	26	1946	5
4	30	1946	4
5	4	1946	4
5	8	1946	4
5	12	1946	4
5	17	1946	5
5	21	1946	4
5	26	1946	5
5	30	1946	4
6	4	1946	5
6	8	1946	4
6	12	1946	4
6	16	1946	4
6	21	1946	5
6	25	1946	4
6	29	1946	4
7	3	1946	4
7	7	1946	4
7	11	1946	4
7	16	1946	5
7	20	1946	4
7	21	1946	1
7	24	1946	3
7	28	1946	4
7	31	1946	3
8	4	1946	4
8	8	1946	4
8	10	1946	2
8	11	1946	1
8	13	1946	2
8	17	1946	4
8	20	1946	3
8	22	1946	2
8	24	1946	2
8	28	1946	4
9	1	1946	3
9	3	1946	2
9	5	1946	2
9	8	1946	3
9	12	1946	4
9	15	1946	3
9	19	1946	4
9	22	1946	3
9	25	1946	3
9	27	1946	2
9	29	1946	2

Feller’s “gaps”, as you can see, are actually fairly regular for most of the season—five days, five days, four days, four days, four days, etc.

I score these “gaps” by asking this question: Is the gap between these two appearances the same as the gap between the previous two appearances? If the gap is exactly the same, I score that as “10”, or 10/10. If the gap is different by one, I score it as 9/10. If it is different by two, I score it as 6/10. If it is different by three, I score it as 1/10. If it is different by four or more, I score it at 0/10—totally irregular work.

For Feller in 1946, this creates the following scores (S A stands for Score-A):

Month	Day	Year	Gap	S A
4	16	1946	201
4	21	1946	5	0
4	26	1946	5	10
4	30	1946	4	9
5	4	1946	4	10
5	8	1946	4	10
5	12	1946	4	10
5	17	1946	5	9
5	21	1946	4	9
5	26	1946	5	9
5	30	1946	4	9
6	4	1946	5	9
6	8	1946	4	9
6	12	1946	4	10
6	16	1946	4	10
6	21	1946	5	9
6	25	1946	4	9
6	29	1946	4	10
7	3	1946	4	10
7	7	1946	4	10
7	11	1946	4	10
7	16	1946	5	9
7	20	1946	4	9
7	21	1946	1	1
7	24	1946	3	6
7	28	1946	4	9
7	31	1946	3	9
8	4	1946	4	9
8	8	1946	4	10
8	10	1946	2	6
8	11	1946	1	9
8	13	1946	2	9
8	17	1946	4	6
8	20	1946	3	9
8	22	1946	2	9
8	24	1946	2	10
8	28	1946	4	6
9	1	1946	3	9
9	3	1946	2	9
9	5	1946	2	10
9	8	1946	3	9
9	12	1946	4	9
9	15	1946	3	9
9	19	1946	4	9
9	22	1946	3	9
9	25	1946	3	10
9	27	1946	2	9
9	29	1946	2	10

Altogether, Feller scores for 1946 at 409 out of a possible 470 points on Part One of our system—86%.

In that era, the common rule was that the ace of the staff—what we would now call the #1 starter—started on a regular four-day schedule if possible, always started against key opponents, and the rest of the staff had to move around to accommodate him. If you checked Feller’s teammates, I am certain you would find that their schedules were less regular than Feller’s. There were many more “travel days” then than now and frequent double-headers, and Feller hop-scotched other pitchers to get more work. That’s the way it was done at that time.

The other question that we ask is, “Was the number of batters that this pitcher faced in this outing the same as it was in the previous outing?”

If the number of batters faced was exactly the same or was within 3 of being the same, we score that at “15” (15/15). If it was different by 4, that’s 14; different by 5, that’s 13; different by 6, that’s 12. For each difference in batters faced in excess of three, we subtract one, down to zero. For Feller, 1946, these are the scores for each outing:

Month	Day	Year	Gap	S A	BFP	S B
4	16	1946	201		29
4	21	1946	5	0	41	6
4	26	1946	5	10	38	15
4	30	1946	4	9	32	12
5	4	1946	4	10	32	15
5	8	1946	4	10	36	14
5	12	1946	4	10	45	9
5	17	1946	5	9	34	7
5	21	1946	4	9	33	15
5	26	1946	5	9	42	9
5	30	1946	4	9	38	14
6	4	1946	5	9	34	14
6	8	1946	4	9	35	15
6	12	1946	4	10	36	15
6	16	1946	4	10	38	15
6	21	1946	5	9	34	14
6	25	1946	4	9	41	11
6	29	1946	4	10	34	11
7	3	1946	4	10	41	11
7	7	1946	4	10	37	14
7	11	1946	4	10	28	9
7	16	1946	5	9	41	5
7	20	1946	4	9	38	15
7	21	1946	1	1	7	0
7	24	1946	3	6	31	0
7	28	1946	4	9	34	15
7	31	1946	3	9	34	15
8	4	1946	4	9	26	10
8	8	1946	4	10	33	11
8	10	1946	2	6	9	0
8	11	1946	1	9	1	10
8	13	1946	2	9	29	0
8	17	1946	4	6	36	11
8	20	1946	3	9	34	15
8	22	1946	2	9	5	0
8	24	1946	2	10	37	0
8	28	1946	4	6	32	13
9	1	1946	3	9	33	15
9	3	1946	2	9	7	0
9	5	1946	2	10	22	3
9	8	1946	3	9	33	7
9	12	1946	4	9	35	15
9	15	1946	3	9	29	12
9	19	1946	4	9	34	13
9	22	1946	3	9	38	14
9	25	1946	3	10	37	15
9	27	1946	2	9	21	2
9	29	1946	2	10	38	1

Here, as we can see, Feller’s scores are somewhat lower. Feller’s S B scores for the season total up to 462 out of a possible 705, or 66%. Adding together S A and S B, Feller’s “Workload Regularity Score” for the 1946 season is 871 out of a possible 1175, or 74%.

In that era, that’s a very high score. Among the fourteen pitchers for whom I have data covering the 1946 season, Feller’s 74% is easily the highest percentage. Feller’s workload, in 1946, was more regular than that of any other pitcher that I studied.

II. The Data

In late 1960, the legendary Casey Stengel was fired by the New York Yankees and replaced with the late Ralph Houk. Houk’s first phone call, after learning that he had the job, was to Whitey Ford.

Let’s pause there a second, and read between the lines. Why did Houk call Ford first? Was Ford the leader of the team?

Well, yes, but no more than Berra or Mantle, and there is something else going on here. You remember Casey’s famous line, “The secret of managing is to keep the five guys who hate you away from the ten guys who are on the fence?” Ford, by 1960, had (we suspect) become the de facto leader of the Dump Casey faction of the Yankee roster. He wasn’t ripping Stengel is the newspapers or anything, but it seems pretty clear, reading between the lines a little, that Ford by 1960 had grown disenchanted with Stengel’s management of the team, in large part because of the way that he used his pitchers. Houk was reaching out to Ford to heal the rift.

And what did Ford say to Houk, in that phone conversation? “Just put me on the mound every fourth day and let me pitch.” (Quote is not exact.)

Stengel was a great believer in percentage baseball—that is, in deploying his players at their most effective moments. He used left-handed hitters against right-handed pitchers, he used right-handers against lefties, and he varied his pitching patterns constantly so that his best pitcher—Ford—would pitch against the strongest opposition and in the parks where he was best suited, while the lesser pitchers would soak up the starts against the teams the Yankees could beat anyway. Ford never won 20 games for Stengel, largely because he didn’t pitch on a regular four-day rotation like other staff aces.

It was smart percentage baseball, and that’s the way the game was played by some managers in 1920, in 1930, in 1940, but by 1960 it wasn’t really done that way anymore. Starting pitcher leveraging, as Chris Jaffe calls it in his new book about managing, was pretty much gone by 1960. Casey was still doing it, and Whitey didn’t like it.

These are Whitey Ford’s Regular Use Scores throughout his career:

Pitcher	Year	Points	Possible	Pct
Whitey Ford	1950	253	475	53%
Whitey Ford	1953	506	775	65%
Whitey Ford	1954	506	825	61%
Whitey Ford	1955	624	950	66%
Whitey Ford	1956	518	750	69%
Whitey Ford	1957	337	575	59%
Whitey Ford	1958	482	725	66%
Whitey Ford	1959	466	850	55%
Whitey Ford	1960	413	775	53%
Whitey Ford	1961	768	975	79%
Whitey Ford	1962	638	925	69%
Whitey Ford	1963	741	925	80%
Whitey Ford	1964	658	950	69%
Whitey Ford	1965	626	900	70%
Whitey Ford	1966	604	1075	56%
Whitey Ford	1967	102	150	68%

Ford’s pitching schedule in 1960 was 53% regular. In 1961, under Ralph Houk, it was 79% regular—a huge increase.

In 1960 Ford was 12-9. In 1961 he was 25-4, and won the Cy Young Award. Of course, the true facts are always more complicated than the simplest narrative: It was an expansion year, Ford’s ERA actually went up, etc. I’m not trying to hide the true facts; I am merely pointing out:

1) Ford was used by Stengel in a very irregular pattern,

2) Ford was unhappy about this,

3) Houk used Ford in a much, much more regular pattern,

4) Ford won twice as many games and won the Cy Young Award, and

5) Ford was much happier.

I am using this fairly well-known story to demonstrate that our method works, but there

are actually are a series of story lines from 1961/1962 that are relevant here. From 1956 to 1961 Lew Burdette had won-lost records of 19-10, 17-9, 20-10, 21-15, 19-13 and 18-11. Over that six-year span Burdette was second in the major leagues in innings pitched and in games won, behind his running mate Warren Spahn. He was one of the premier pitchers in the game.

In 1962 Burdette won just 10 games—10-9, ERA of 4.88, far worse than the league average. Unhappy, Burdette said the next spring that “the only difference between last year and any other year was that I didn’t get to pitch regularly.” There’s that word again, regularly. These are Burdette’s Regular Use Scores:

Pitcher	Year	Points	Possible	Pct
Lew Burdette	1950	14	25	56%
Lew Burdette	1951	26	50	52%
Lew Burdette	1952	586	1100	53%
Lew Burdette	1953	758	1125	67%
Lew Burdette	1954	494	925	53%
Lew Burdette	1955	597	1025	58%
Lew Burdette	1956	581	925	63%
Lew Burdette	1957	610	925	66%
Lew Burdette	1958	695	1000	70%
Lew Burdette	1959	655	975	67%
Lew Burdette	1960	551	1100	50%
Lew Burdette	1961	668	975	69%
Lew Burdette	1962	521	900	58%
Lew Burdette	1963	452	875	52%
Lew Burdette	1964	480	875	55%
Lew Burdette	1965	301	625	48%
Lew Burdette	1966	1022	1325	77%
Lew Burdette	1967	315	450	70%

The linguist in me wants to point out that what Burdette meant by “regular” was not exactly the same thing that Ford meant by regular. “Regular” has (at least) three meanings in common usage:

1) Consistent,

2) Frequent, and

3) Standard.

When a dietary supplement says that it gets you regular and keeps you regular, what it means is (1). When a boyfriend says that he wants more regular action from his girlfriend, what he means is (2). When a reporter says that the trial was somewhat irregular, what he means is (3).

When Ford said that he wanted to pitch more regularly, he meant (1), and also possibly (2). When Burdette complained that he was not being given regular work, he meant (2). When Billy Martin allowed Rick Langford to pitch 28 complete games in 1980, we could say that that was irregular, meaning (3). Small point, but. . .we need to think clearly.

Anyway, the regularity of Burdette’s usage did decline in 1960, no matter how we define “regular”, but there is another side to that story. The manager of the Milwaukee Braves in 1959, when Lew Burdette was 21-15 and getting really regular use (meaning frequent), was Fred Haney. It is the view of the author that Haney in 1959 may have had the worst season by any manager in major league history. Haney took a team that. . .I am not exaggerating for emphasis; this is exactly what I think. . .Haney took a team that should have won the National League by 20 games, and managed to lose the race. If you compare the roster of the 1959 Braves to the 1959 Dodgers, it is difficult to explain how this is possible.

One of the 28 stupid things that Haney did was, he worked Spahn and Burdette to death. Spahn was a great pitcher and Burdette was pretty good, but Haney over-worked them both, they pitched .500 ball over the last two months, and they both finished 21-15, pitching 582 innings. This might be understandable if Haney was short of other pitching options, but in fact he had a very good third starter (Bob Buhl) and a quite exceptional number of young pitchers, whom he refused to pitch. He had Carlton Willey, who in 1958 had made 19 starts, led the league in shutouts with 4, and posted a 2.70 ERA, plus Joey Jay (97 innings in 1958 with a 2.13 ERA) and Juan Pizarro (97 innings in 1958 with a 2.69 ERA). Haney didn’t believe any of these young pitchers was trustworthy, and tried instead to make Burdette and Spahn win the pennant for him.

So Haney got fired, and the team was turned over to Charlie Dressen, but Dressen was an older manager who had been around the block several times, and he liked older pitchers as well. Joey Jay didn’t get to pitch. . .well, REGULARLY, until 1961, when he was traded to Cincinnati:

Pitcher	Year	Points	Possible	Pct
Joey Jay	1954	223	350	64%
Joey Jay	1955	150	275	55%
Joey Jay	1958	267	450	59%
Joey Jay	1959	396	825	48%
Joey Jay	1960	490	775	63%
Joey Jay	1961	636	825	77%
Joey Jay	1962	618	950	65%
Joey Jay	1963	390	725	54%
Joey Jay	1964	494	825	60%
Joey Jay	1965	524	900	58%
Joey Jay	1966	247	500	49%

In 1959, pitching on a very irregular schedule for Fred Haney (48% regular), Jay was 6-11 with a 4.10 ERA. In 1960, with his regular use score increasing to 63%, Jay was 9-8 with a 3.25 ERA. In 1961, traded to Cincinnati, Jay’s Regular Use Score went up to 77%. His record improved to 21-10 in 34 starts. (His ERA went up, but that’s an illusion created by the park and league.)

It wasn’t just Jay, either. Juan Pizarro was a lefty with one of the best fastballs of that era. In 1960, fighting for regular work, he went 6-7 with an ERA a run over the league norm, park adjusted. Traded to the White Sox that winter, he also got regular work, and he also immediately became effective. I don’t have regularity scores for Pizarro, but it seems like a reasonable assumption that his regularity of use also increased sharply with the trade.

You remember what I said earlier about Bob Feller—that in this era, most teams tried to keep their #1 starter on a four-day schedule, and everybody else had to work around that. What Burdette was complaining about, really, was that from 1956 to 1961 he was one of the big dogs, that everybody else had to operate around. In 1962 he was downgraded to being one of the little dogs, who had to work around the big dogs. He didn’t like it.

But really, it should have happened sooner; the Braves should not have been jacking around Pizarro, Jay and six other young pitchers to keep Lew Burdette happy.

There is a very powerful trend, across history, of pitcher usage becoming more regular and more predictable. What we are talking about, really, are (at least) five different changes in the game, all operating toward the common purpose of regularizing a pitcher’s work schedule. Those five changes are:

1) The disappearance of double-headers,

2) The division of pitching staffs into starters and relievers,

3) The abandonment of the practice of starter leveraging,

4) The expansion of the bullpens, and

5) The adoption of pitch limits.

The key point, really, is (4). The expansion of the bullpens, I will argue, has been primarily driven by this exact reason: The desire to regularize workloads. We have written volumes about the expansion of the bullpens. But when you think about it, what are the extra pitchers really doing? They’re stabilizing the staff. They’re making it possible to keep workloads regular. That’s the main purpose of the larger bullpens; to regularize workloads. We have missed this fact, I think, because we focus on what we can measure. Not being able to measure the regularity of usage, we have essentially looked right through the real cause of the expansion of bullpens, which is the widespread belief that pitchers are most effective when they have regular work schedules. But perhaps I am getting ahead of myself.

Since 1920, the regularity of use of major league pitchers has advanced relentlessly. While my studies cover only about 3% of major league pitchers and not even a representative 3%, the study nonetheless demonstrates a massive change in the regularity of usage by pitchers over the last nine decades. For those pitchers for whom I have figured data, these are the overall Regular Use Scores for each of the last nine decades:

From	To	Pitchers	Points	Possible	Pct
1920	1929	104	49993	91350	55%
1930	1939	110	42521	76400	56%
1940	1949	112	47177	80050	59%
1950	1959	112	53793	88350	61%
1960	1969	110	62991	92500	68%
1970	1979	108	72181	97125	74%
1980	1989	111	69520	91625	76%
1990	1999	107	70182	86875	81%
2000	2009	138	105488	126525	83%

While our studies are limited, I would predict that a comprehensive study would change these percentages by only tiny amounts, and I think we may safely assume that the conclusion that the regularity of pitcher usage has shown steady increases over most of baseball history will not be undermined by additional research.

These are the top 50 and the bottom 50 pitcher/seasons in our study, in terms of regularity of use. I included pitchers with less than 20 game appearances, but didn’t count them:

Rank	Pitcher	Year	Points	Possible	Pct
1	Johan Santana	2006	775	825	94%
2	Randy Johnson	2004	795	850	94%
3	Mark Buehrle	2003	783	850	92%
4	Johan Santana	2004	759	825	92%
5	Ted Lilly	2004	713	775	92%
6	Bartolo Colon	2004	757	825	92%
7	Pedro Martinez	2004	734	800	92%
8	Ryan Franklin	2004	711	775	92%
9	Mark Buehrle	2004	778	850	92%
10	Jamie Moyer	2002	753	825	91%
11	Mark Buehrle	2009	729	800	91%
12	Jamie Moyer	2003	728	800	91%
	Trevor Hoffman	2003	182	200	91%
13	John Burkett	2001	748	825	91%
14	Jamie Moyer	2008	724	800	90%
15	Randy Johnson	1999	769	850	90%
16	Johan Santana	2008	746	825	90%
17	Randy Johnson	2002	768	850	90%
18	Jamie Moyer	1997	655	725	90%
19	Johan Santana	2007	721	800	90%
20	Jamie Moyer	2007	721	800	90%
21	Mark Buehrle	2006	698	775	90%
22	Mark Buehrle	2002	743	825	90%
23	John Smoltz	1993	762	850	90%
24	Jamie Moyer	2006	717	800	90%
25	Jamie Moyer	1998	737	825	89%
26	Johan Santana	2005	713	800	89%
27	Jamie Moyer	2000	557	625	89%
28	Mark Buehrle	2005	711	800	89%
29	Goose Gossage	1977	1575	1775	89%
30	Mark Buehrle	2008	731	825	89%
31	Adam Wainwright	2009	753	850	89%
32	Johan Santana	2009	531	600	88%
33	John Burkett	1996	730	825	88%
34	Livan Hernandez	2004	749	850	88%
35	Don Drysdale	1963	902	1025	88%
	Pedro Martinez	2007	88	100	88%
36	Steve Carlton	1973	857	975	88%
37	Pedro Martinez	1998	703	800	88%
38	John Smoltz	2004	1577	1800	88%
	Lee Smith	1980	372	425	88%
39	Livan Hernandez	2007	700	800	87%
40	Ryan Franklin	2008	1596	1825	87%
41	Bret Saberhagen	1998	655	750	87%
42	Mark Buehrle	2007	633	725	87%
43	Randy Johnson	2001	742	850	87%
44	Bartolo Colon	2002	698	800	87%
45	Goose Gossage	1976	654	750	87%
	Bret Saberhagen	1997	109	125	87%
46	Francisco Cordero	2005	1482	1700	87%
47	Randy Johnson	2006	697	800	87%
48	Adam Wainwright	2007	675	775	87%
49	Pedro Martinez	2005	653	750	87%
50	Don Drysdale	1964	847	975	87%

The earliest pitcher on the list of those with the most regular workloads was Drysdale in 1963. These are the bottom 50:

Rank	Pitcher	Year	Points	Possible	Pct
	Bucky Walters	1934	0	25	0%
	Joey Jay	1953	0	50	0%
	Tom Zachary	1936	32	175	18%
	Pedro Martinez	1992	5	25	20%
	Bucky Walters	1948	31	150	21%
	Randy Gumpert	1938	18	75	24%
	Steve Carlton	1988	20	75	27%
	Bobo Newsom	1929	15	50	30%
	Whitlow Wyatt	1931	25	75	33%
1	Ellis Kinder	1952	189	550	34%
	Whitlow Wyatt	1944	69	200	34%
	Ernie Bonham	1949	150	425	35%
	Freddie Fitzsimmons	1941	106	300	35%
	Tommy Byrne	1953	100	275	36%
2	Tom Zachary	1929	228	625	36%
	Bill Henry	1955	146	400	36%
3	Bill Henry	1953	183	500	37%
4	Whitlow Wyatt	1930	184	500	37%
	Tommy Byrne	1943	92	250	37%
5	Freddie Fitzsimmons	1939	240	650	37%
6	Elam Vangilder	1920	214	575	37%
	Mel Parnell	1955	112	300	37%
	Whitlow Wyatt	1929	28	75	37%
7	Tom Zachary	1928	243	650	37%
	Firpo Marberry	1935	38	100	38%
8	Bob Feller	1956	171	450	38%
	Larry Gura	1985	68	175	39%
9	Dolph Luque	1930	297	750	40%
10	Lefty Grove	1927	495	1250	40%
11	Tommy Byrne	1951	268	675	40%
	Freddie Fitzsimmons	1935	169	425	40%
	Tommy Byrne	1946	30	75	40%
	Lindy McDaniel	1955	30	75	40%
12	Freddie Fitzsimmons	1940	191	475	40%
13	Dolph Luque	1931	182	450	40%
14	Dazzy Vance	1933	275	675	41%
15	Eddie Rommel	1929	318	775	41%
16	Alex Ferguson	1928	339	825	41%
17	Bob Feller	1955	247	600	41%
18	Bucky Walters	1935	237	575	41%
	Whitlow Wyatt	1939	157	375	42%
19	Tom Zachary	1930	273	650	42%
20	Joe Page	1945	200	475	42%
21	Jim Konstanty	1944	200	475	42%
22	Dizzy Trout	1943	455	1075	42%
	Bobo Newsom	1953	170	400	42%
23	Firpo Marberry	1934	394	925	43%
24	Randy Gumpert	1951	341	800	43%
25	Earl Whitehill	1939	248	575	43%
	Firpo Marberry	1936	54	125	43%
26	Dizzy Trout	1939	346	800	43%
27	Dazzy Vance	1934	260	600	43%
28	Larry Gura	1976	206	475	43%
	Eddie Rommel	1932	174	400	43%
29	Early Wynn	1944	348	800	43%
	Bobby Ojeda	1994	11	25	44%
30	Herb Pennock	1930	264	600	44%
31	Herb Pennock	1922	341	775	44%
	Early Wynn	1946	177	400	44%
32	Eddie Rommel	1921	499	1125	44%
33	Bobo Newsom	1943	411	925	44%
	Steve Carlton	1966	89	200	44%
34	Tom Zachary	1922	345	775	45%
35	Dizzy Trout	1947	348	775	45%
	Freddie Fitzsimmons	1943	90	200	45%
36	Bobby Shantz	1955	248	550	45%
37	Herb Pennock	1925	519	1150	45%
38	Alex Ferguson	1922	430	950	45%
39	Dizzy Trout	1952	397	875	45%
40	Eddie Rommel	1922	568	1250	45%
41	Bobo Newsom	1946	377	825	46%
42	Elam Vangilder	1921	343	750	46%
43	Randy Gumpert	1950	446	975	46%
44	Elam Vangilder	1926	469	1025	46%
45	Ernie Bonham	1941	254	550	46%
46	Mel Parnell	1951	406	875	46%
47	Eddie Rommel	1930	395	850	46%
48	Earl Whitehill	1937	373	800	47%
49	Herb Pennock	1932	245	525	47%
50	Herb Pennock	1933	257	550	47%

Among those with 20 or more game appearances, the most recent pitcher was Larry Gura in 1976—a very interesting season, by the way. There is almost no chronological overlap between the lowest and highest scores. Since Johan Santana, 2006, had the most regular work regimen and Ellis Kinder, 1952, had the least regular, let’s compare and contrast their work patterns:

*Johan Santana, 2006*				*Ellis Kinder, 1952*
Month	Day	Year	BFP	Month	Day	Year	BFP
4	4	2006	28	4	16	1952	12
4	9	2006	23	4	18	1952	15
4	15	2006	28	4	23	1952	37
4	21	2006	29	5	3	1952	37
4	27	2006	30	5	5	1952	2
5	2	2006	28	5	16	1952	2
5	7	2006	27	5	18	1952	6
5	12	2006	26	5	19	1952	11
5	17	2006	27	5	24	1952	33
5	23	2006	30	5	30	1952	40
5	28	2006	29	6	1	1952	9
6	2	2006	26	6	5	1952	33
6	8	2006	22	6	7	1952	5
6	13	2006	28	8	14	1952	6
6	18	2006	25	8	24	1952	29
6	23	2006	30	9	1	1952	15
6	28	2006	26	9	5	1952	25
7	3	2006	25	9	7	1952	4
7	9	2006	29	9	9	1952	1
7	15	2006	26	9	13	1952	29
7	20	2006	26	9	16	1952	8
7	25	2006	27	9	24	1952	32
7	30	2006	28	9	28	1952	9
8	4	2006	25
8	9	2006	27
8	15	2006	27
8	20	2006	28
8	26	2006	29
8	31	2006	27
9	5	2006	27
9	10	2006	23
9	15	2006	31
9	21	2006	25
9	26	2006	31

Santana in 2006 never faced fewer than 23 batters in an outing, nor more than 31. His largest difference between batters faced in any two consecutive appearances was eight, and only ten times was there a difference larger than three.

The most batters that Santana faced in any outing was 31. Kinder in 1952 exceeded that six times in ten starts. The fewest batters Santa faced was 23; Kinder had less than that 14 times. Kinder’s work assignment for the day was outside the range of Santana’s extremes 20 times in 23 game appearances.

Santana worked on four or five days rest literally in every start all season, except for the first start. He never once worked off-schedule, or even slightly off-schedule.

Kinder worked on consecutive days once, and worked on one day’s rest seven times—but also had three gaps of ten days or more without taking the mound.

III. Analysis and Comment

As I said, this steady increase in the regularity of pitcher’s work is actually a result of five distinct factors, which are:

1) The disappearance of double-headers,

2) The division of pitching staffs into starters and relievers,

3) The abandonment of the practice of starter leveraging,

4) The expansion of the bullpens, and

5) The adoption of pitch limits.

1) The disappearance of double-headers.

The pitchers of the 1940s made more than 30% of their game appearances in double headers. Bobo Newsom was a very colorful pitcher who had a long career (1929-1953) with a lot of different teams. In his career he made exactly 600 game appearances, of which 208—34.7%--were in double headers. The percentages for other pitchers of his era were similar: Bucky Walters, 32%; Joe Page, 33%; Bob Feller, 29%; Whitlow Wyatt, 31%; Hugh Casey, 31%; Spud Chandler, 36%. Teams would not infrequently play back-to-back double-headers, and many, many times would have Wednesday and Thursday off for travel, then would play a double-header on Friday and another on Sunday.

Over time, more than 90% of these double headers have disappeared. Modern pitchers typically make 3% or less of their Game Appearances in double headers (Bartolo Colon, 2.7%; Mark Buehrle, 2.1%; John Smoltz, 3.0%; Adam Wainwright, 0.7%.)

Obviously, the disappearance of double-headers has helped to regularize pitcher work schedules.

2) The division of pitching staffs into starters and relievers.

There were 305 major league pitchers who made 100 or more game appearances between 1920 and 1939, every one of whom appeared both as a starter and as a reliever. All of them appeared in at least 5% of their games as starters, and at least 5% as relievers. Only a small fraction had starter percentages or reliever percentages over 80%.

Carl Hubbell, Dazzy Vance, Ted Lyons, Herb Pennock, Lefty Grove and Dizzy Dean all made 20 to 30% of their game appearances in relief (during the 1920s and 1930s. Some of them may have gone under 20% if you include the 1910s or the 1940s.)

By the late 1930s, managers were beginning to divide their pitching staffs into starters and relievers. I believe that it was Joe McCarthy who first divided his staff into pure starters and pure relievers, and I believe that he did this in the mid-1930s. Further research might better inform us about that issue, but in any case, by World War II “pure relievers” or “dedicated relievers” had emerged, and starting pitchers were generally being excused from the responsibility of finishing up for weakening teammates.

Let’s note that points (1) and (2) were totally unrelated, and that the division of pitching staffs into starters and relievers had nothing at all to do with the disappearance of double-headers. In fact, the division of staffs into starters and relievers came at a time when the number of double-headers was increasing very rapidly, due to the addition of lights to the parks. Owners of the 1920s and 1930s generally liked double-headers, which drew large crowds, but double-headers created scheduling issues because you generally had to start them no later than 3:00 in order to be sure you got both games in before darkness. The addition of lights enabled owners to start Friday-night double-headers at 5:00 or later, which caused the number of scheduled double-headers to spike upward during the 1940s.

3) The abandonment of the practice of starter leveraging.

Chris Jaffe, in Evaluating Baseball’s Managers (McFarland, 2010), writes a lot about “starter leveraging”, by which he means jiggling the starting schedule so that your best pitchers are matched up against the best teams, and left-handers are used most often against teams that may be vulnerable to left-handers and in parks that may favor left-handers. For example, he writes that (Al) “Lopez’s career helps trace the course and extinction of starting pitcher leveraging. In Cleveland, he oversaw one of the best collections of starting pitching in history (and did not engage in). . .an especially impressive degree of leveraging, but it is noteworthy that any existed given the overall quality and depth of Lopez’s starting rotation. . .When Lopez came to Chicago, he maintained this approach, using ace Billy Pierce frequently against top rivals. As noted (earlier), Lopez’s decision not to start Pierce against the Dodgers in the 1959 World Series helped bring about the end of starter leveraging (but Lopez continued to do some of it until 1963.) In 1964, Lopez changed. The reform that had blown across the baseball world over the previous few years finally came to Chicago.”

Jaffe demonstrates by a number of such narratives, documented by sabermetric analysis, that virtually all major league managers did juggle their rotations to leverage their best starting pitchers prior to 1955, but that this policy died in the late 1950s and early 1960s.

In connection with this change, I want to focus on this part of the chart that I gave you earlier:

From	To	Regular Use Pct
1920	1929	55%
1930	1939	56%
1940	1949	59%
1950	1959	61%
1960	1969	68%
1970	1979	74%
1980	1989	76%
1990	1999	81%
2000	2009	83%

The regularity with which pitchers have been used has increased in every decade, yes, and has probably increased in every five-year period, if we had enough data to measure that reliably. But the time when it really increased was between 1960 and 1979. From the 1920s through the 1950s, the regularity of pitcher use increased by a total of 6%. In the 1960s it increased by 7% in one decade, and then by another 6% in the 1970s. That 20-year period accounts for basically one-half the increase over the nine decades.

What was really driving that change? It wasn’t the disappearance of the double headers; that happened twenty years later.

I go back to the Ralph Houk story that I told before, and to the other anecdotes from that same era, having to do with Lew Burdette, Joey Jay and others. What was driving this change was that a very strong belief entered baseball, right about that moment, that pitchers would benefit from regularity of use. Workloads for pitchers didn’t go down at that time. If anything, they went up. This wasn’t a belief in avoiding overworking pitchers; that belief became important about twenty years later. It was a separate and distinct belief, in regular and predictable workloads.

Look, for example, at the Regular Use Scores of Don Drysdale, from 1959 to 1963:

Pitcher	Year	Pct
Don Drysdale	1956	48%
Don Drysdale	1957	65%
Don Drysdale	1958	61%
Don Drysdale	1959	56%
Don Drysdale	1960	67%
Don Drysdale	1961	68%
Don Drysdale	1962	80%
Don Drysdale	1963	88%
Don Drysdale	1964	87%
Don Drysdale	1965	71%
Don Drysdale	1966	77%
Don Drysdale	1967	80%
Don Drysdale	1968	84%
Don Drysdale	1969	55%

Why? Why did Drysdale’s regularity of use increase from 56% in 1959 to 80% in 1962, 88% in 1963? Drysdale was the Dodger’s number one pitcher in 1959; he was their number one pitcher in 1962. The Dodgers were in a tight pennant race, leading to a playoff, in 1959; they were in a tight pennant race, leading to a playoff, in 1962. Drysdale was managed by Walter Alston in 1959, and he was managed by Walter Alston in 1963.

What changed was simple: between 1959 and 1963, the notion entered baseball, in a really vigorous way, that pitchers would perform their best when working on a regular schedule. The Jim Bunning thing in 1964, when Mauch pulled Bunning off-rotation and the Phillies crashed and burned, cemented that link in the public mind. The reality is, managers in the heat of the pennant race had done exactly what Mauch did in 1964 quite routinely for at least 50 years before then. Sometimes it worked, sometimes it didn’t work. Either way, it was never a big deal.

The difference was, Mauch did it after the public had come to believe that pitchers performed their best given a regular work schedule. That’s all; that’s the only difference between Mauch doing that in 1964 and Eddie Sawyer and Burt Shotton both doing it in the last week of the season in 1950. In 1950 it was standard practice because there was no strong belief in the value of regular work schedules for pitchers; in 1964 it was a blunder because there was.

The Longing for regular use patterns had entered baseball in a big way.

In 1959 the league ERAs were 3.95 and 3.86. In 1968 the league ERAs were 2.98 and 2.99.

It is possible, in my opinion, that one thing that caused the pitcher domination of the 1960s was the regularization of pitcher schedules. The re-definition of the strike zone in 1962/63 certainly contributed, yes—but the trend downward continued after 1963. Most of it was after 1963. People speculate on what caused this to happen. One thing that may have caused this to happen was that pitchers in the 1960s were working in more regular patterns.

4) The expansion of the bullpens.

Of course we all know how bullpens have expanded in the last 30 years, and many of us have written about this. Many of us have written critically about this, and I don’t think I need to review that.

The particular contribution of this research, however, is

1) To establish a method to document differences between pitchers in the regularity of use, and

2) To suggest the possibility that the desire to create regular work patterns for all pitchers is in fact the major driving force behind the expansion of bullpens.

What multiple layers of relievers do for a pitching staff is to act as a shock absorber against the normal irregularity of the schedule. When you have multiple relievers with subtle distinctions in their job assignments, it becomes more practical to expand or adjust those assignments to minimize stress points for starting pitchers and the critical two relievers. Not only is this done, but that’s why those pitchers are there; that is the benefit that they give to a team.

About eight/ten years ago, there was a debate going about the appropriate use of the closer. In the middle of this debate I was hired by the Red Sox, and thus lost most of my ability to participate in that debate, because whatever I said could be played off the Red Sox in ways that were not helpful to the team.

But what I really did not understand, until about 2005, was this. If you look at Mariano Rivera, at Trevor Hoffman, Papelbon and others, these pitchers reach and sustain levels of effectiveness, relative to context, that are historically unprecedented. No starter in history—not Koufax, not Randy, not Walter, not Grove, not anybody—was as effective, inning per inning and adjusting for context, as Mariano Rivera, and I’m not just talking about ERA. Strikeouts, walks, home runs allowed. ..you name it, and the other guys are right there with him. Why is that?

It’s because the modern system of using Closers creates for these pitchers a role that is both very limited and very regular, extremely predictable. Those guys throw maybe three dozen pitches a week—and do so at moments that can be predicted with a high degree of reliability. They know when they are going to be in the game. They can loosen up, stretch, push themselves to a peak, and they take the mound only when they are at that peak. That enables them to reach fantastic levels of sustained effectiveness.

I never understood this, until about 2005, and frankly, neither did any of you. I know that you didn’t because of the things that you wrote. People write about using your closer in the 7^th inning when the high-leverage situation arises in the seventh inning. If you think you can do that, you don’t understand. You can’t reach that peak of perfection on an irregular work schedule. You’ve got to KNOW when you’re going to be needed. If you used Mariano in the 7^th inning half the time, he would NOT be Mariano.

And then there’s the press guys, who think that some guys can pitch in the 9^th inning, while other guys can only pitch in the 7^th. . .which totally misses the point, but then, we all miss half the points three-fourths of the time.

What all of those other relievers are doing, really, is allowing the critical relievers to pitch only at those moments of peak effectiveness. They’re load-levelers. We’ve missed this; up to now. We’ve left it out of the discussion. Now it’s in. That’s what this article is about. I am arguing that you can’t really evaluate the worth of a deep bullpen unless you can deal with the advantage of creating stable workloads for other pitchers, because that’s really why it’s done.

5) The adoption of pitch limits.

About which, again, enough has already been said—but again, I would argue that the real point of pitch limits is not to reduce the workload of starting pitchers, but to stabilize it.

OK, but are we dealing here with five different phenomena, which combine to create a consistent evolution in the game, or are we dealing with a single phenomenon which has five different manifestations? Or, looking again at this chart:

From	To	Regular Use Pct
1920	1929	55%
1930	1939	56%
1940	1949	59%
1950	1959	61%
1960	1969	68%
1970	1979	74%
1980	1989	76%
1990	1999	81%
2000	2009	83%

Have we gone as far as we can go?

Because if it is five different phenomenon, then it may be that these five distinct changes that were driving the Regular Use Percentages up have all played themselves out, and thus it may be that we are at the end of the line in this massive, 90-year shift.

If, on the other hand, it is not five different phenomenon but five different manifestations of the same phenomenon—the Longing to create ever more stability in pitcher workloads—then we have to assume that this nine-decade trend may continue for several more decades until the Regular Use Percentages reach 90% or 92% or 94%. And if that’s the case, then we have to ask ourselves, since these other manifestations of the shift have pretty much played themselves out, how this desire to create regular use patterns will change baseball in the future.

The disappearance of the double headers—that’s clearly a different thing. That was not caused by the desire to create regular workloads; we can be pretty sure that it wasn’t. But I would argue that the other four factors are, in reality, all simply manifestations of the same over-riding and over-whelming desire, on the part of managers and pitching coaches, to create stable workloads—thus

1) that this desire has created massive changes in the game of baseball in the past, and

2) we should expect that it will create additional significant changes in the game of baseball in the future.

I can cite for you another change, caused by this same Longing, which is already occurring. More aggressive use of the Triple-A roster as an extension of the major league roster.

I have worked for the Red Sox since 2002—eight years. Over that time, I have seen a very substantial increase in the willingness, both of the Red Sox and other teams, to make a roster move to avoid stress points for pitcher usage. Eight years ago, you could pretty much assume that the pitchers the other team had on their roster in the morning would be the pitchers they would have on their roster at game time. Now, you can’t. Same thing—front offices move pitchers up and down to protect the regular use patterns of the key pitchers on the team.

In working through this process, we skipped a critical question. Is this a legitimate benefit? Do pitchers in fact pitch better and/or get injured less when they have regular, predictable work schedules than they do when their workloads are less regular? Or are we chasing a phantom here, bending ourselves out of shape and wrenching the game of baseball out of its traditional shape merely to pursue an “advantage” which is no real advantage at all?

Well, I don’t doubt that pitchers pitch better when they have a regular work schedule. This is the universal assumption of all baseball men, very deeply ingrained; one can’t challenge that assumption based on a mere absence of evidence. We don’t have any evidence or any reason to believe that this is not true, and, intuitively, I am strongly inclined to believe that it is true.

I do have some serious questions, however, about whether the benefits derived from the regular usage of pitchers are substantial enough to justify the things that have been done in that effort. The deeply rooted and constantly growing belief in the benefits of regular work for pitchers has changed baseball like a growing tree crushing and throwing aside a meddlesome sidewalk or even an old street.

That’s why I wanted to post this method: To put us in a position to ask those questions. Let’s assume that some managers use their pitchers in more regular patterns than other managers—if not now, then certainly at some points in the past. Can we validate by sabermetric research that managers who are ahead of the game in this fashion get better results from their pitchers than managers who are behind the game? Can we show that they have fewer injuries? Can we show that they have better ERAs? Can we show that they win more games?

Can we demonstrate that pitchers who are used on a regular basis are more effective than those who are not? Can we demonstrate that they are injured less often?

If we cannot demonstrate that these benefits exist, then should we call into question the things that are done to pursue this benefit? That’s getting ahead of ourselves; I guess we shouldn’t ask that question until we see what we have.

It may be that the method that I have outlined here will be inadequate to the tasks set before us. Better methods may be needed. These issues will be entwined with issues of fatigue and workload, and it will be in some cases impossible to tease them apart, so that some of these questions will prove to be un-answerable. It just seems to me that this is too large a subject to ignore, that we have ignored it for too long, and I’m trying to move the discussion off the dime.

COMMENTS (22 Comments, most recent shown first)

bbmarks
I don't buy it that Mariano Rivera couldn't be effective in other situations. All you have to do is tell him that he will be sometimes used in a tie ballgame in the 7th or other such situations, then he can get ready just as if he is closing the game. Rivera would still be Rivera.
2:05 PM Oct 20th

OldBackstop
Very interesting, but I think the Whitey Ford anecdote needs some drill down. He did indeed go from 25-4 by being moved to "regular use" and not being held for starts against the tough teams. But that was also the expansion year...eight more games and against weaker teams. Six of Whitey's wins (in 6 starts) came against expansion teams that Stengel probably would have hopped for him -- for the good of the team. It is interesting to note that while he did have that amazing 25-4 record, he also had the highest ERA of his career to date: 3.21, and it was the second highest ERA of his 17 year career (3.24 in '65). And (with the six more games, albeit) the Yanks produced 827 runs, up from 746 in 1960.

Anyway, interesting piece. I'd be interested in more data from an injury perspective.
5:26 PM Sep 22nd

Bucky
Not to put other research questions onto your plate, but does the increasing regular load also extend to position players? Without examining it, it does seem to me that we have less platooning, more in-season stability of positions, and fewer utility players than in earlier days. I could of course be off my rocker on this. And I also don't know what impact, if any, this would have on position players. But it seems as though, if I am right, that this might go hand in glove with a more regular load for pitchers of all kinds.
2:28 PM Sep 5th

TomStrother
After almost 30 years of availing myself of your research, I believe this is one of the most significant pieces of work and most practical for understanding changes in the game, particularly for people in their 60's like me, that you've done. And it isn't weighted down with a lot of sabermetric arcana. Good stuff.
9:41 AM Sep 3rd

MarisFan61
Speaking as a Yankee fan, I hope Girardi Eiland & Co. will hear about this (or have realized it) before doing too many more odd things with Phil Hughes and Joba Chamberlain. This article systematizes much of what I've felt about the handling of them.
I have loved the fact that teams are now at least *trying* to find ways to avoid having good young pitchers flame out, but I have long felt that the "regularity" factor has been ignored; I just never thought of analyzing it in such a way. Almost-total emphasis is often put on "number of innings," without regard to the possibility that 'an inning isn't an inning isn't an inning' -- irregularity might be a hidden stress factor.
If you save a starter's innings by using him in short relief for part of the season, are you really saving him? Or, if you "shut him down" for a time late in the season before bringing him back? If pitching is much like any activity that I know, you aren't.
Viewing it as pure arithmetic -- number of innings -- misses the role of these other stressors.
2:00 AM Sep 3rd

mskarpelos
This article typifies why I started reading Bill James soon after graduating from college in the early 1980s. It wasn't the stats per se that I found interesting but rather the use of stats to explain an overarching historical narrative. I never knew about the rift between Ford and Stengel nor Fred Haney's incompetence until I read this article. Those stories were interesting in their own right. But seeing them so expertly woven into a greater historical narrative carefully built on a solid statistical foundation is something that only Bill James can do. We have fine baseball writers and fine baseball statisticians and fine baseball historians, but only Bill James can do all three at a world class level. Excellent job.

My only quibble is with the seeming arbitrariness of the reliability metric. I'm not saying that the metric is intrinsically inaccurate; I just don't understand why we have a discontinuous 10 point scale for days off and a slightly more continuous 15 point scale for batters faced. Couldn't we simply combine the standard deviation for days off and the standard deviation for batters faced for each appearance with equal weighting to form our reliability metric? I'm not necessarily advocating this, but it seems like the obvious approach to me, so I'm not sure why Bill defined the metric the way he did.
2:04 PM Aug 31st

CharlesSaeger
David: Pitchers nowadays aren't used to pitching on three days rest, and pitch worse when on it, as opposed to those of the Seventies. Personally, I'd like to see the four-and-a-half man rotation, though the half man is anti-regularity.
1:33 PM Aug 31st

Richie
Don't you mean, "all other things being equal, they won't fire you for being orthodox"?
11:33 AM Aug 31st

donmalcolm
I suspect that your percentages are about 5-7% lower once you put in all pitchers who have 10GS or more. But, yes, the trend you've unearthed here is undeniable.

I think it's likely that it can't go up as much as you surmise. But it almost certainly won't go down, as there is insulation from outlier strategies backfiring if a manager sticks to this protocol more or less blindly. All else being equal, they won't be able to fire you for doing something unorthodox!

There seems to be a secondary benefit emerging along with the potential for top-end closer performances (though those, too, are almost "freak show" stats). Middle relievers who pitch in more than one full inning are also benefiting from regularized usage. The aggregate ERA+ for these appearances has been steadily rising since 1950 (when it was 111) and is now flirting with 150. Apparently specialization is having other impacts.

To answer one of your questions: given that the stabilization of starter workloads is seemingly so uniform, there will likely be no way to tease out any conclusions concerning quality. It's more likely that teams will adopt a six-man rotation than return to a set of practices that would decrease starter uniformity and reliever specialization. It may be possible, with enough historical research, to determine if this approach limits injuries.
1:17 AM Aug 31st

bearbyz
The answers to the questions in the last few paragraphs could make a very interesting book.
9:22 PM Aug 30th

StatsGuru
Great article. The question that comes to mind is, should a four-man rotation work, as long as it's regular?
8:12 PM Aug 30th

jdw
On zqxjk's question about how much of it was/is to keep the staff happy, it would seem not a lot initially. Look again at the chart in the pitch leveraging section, and when the greatest change happened. It was in a period where management wasn't yet engaged in keeping players happy. The probably didn't kick off until the 80s, possibly even later. Even in the early days o free agency, most teams didn't put the effort into keeping people happy knowing that folks could and would walk once given the chance.

I think Bill pretty much nails it that it was "performance", or at least the strong belief in better performance, that led the charge. Frankly one couldn't find a comp than Drysdale vs Drysdale. Alston and the Dodgers alone would make an interesting macro study on this given Alston's longevity, along with the stability of the Dodgers front office / ownership, their "Dodger Way" philosophy and a number of pitchers like Drysdale and Sutton who had longer stretches in the Dodgers Rotation.

Anyway... extremely interesting article, Bill. Very good read.
7:59 PM Aug 30th

danfeinstein
This is probably Bill's article this year that has most made me think. Thanks.

I'm not surprised to see a more regimented and routine usage pattern as the sport has matured. We'd expect experimentation to be much greater in a newer industry than in one more established.

Bill suggests that increase regularity is one factor that has enabled Rivera, Hoffman, Wagner, et al to thrive. I wonder if it also has allowed or extended the peak performances of Johnson, Martinez, Maddux, Clemens, etc. By rate stats compared against their peers, the best of the last two generations of players have stood out more than their predecessors; I wonder if regular usage has a disproportionate benefit for the best of the best.

I'm not convinced that every change in strategy is an improvement. I came of age in the time of Herb Washington and the fleet afoot. That was certainly an odd strategy and one that I suspect was a step backwards. By saying that, I am not certain that the strategy of regularity - and the apparent benefits for the back end of the bullpen - outweigh the reduction in availability. Lets assume that Rivera is better if he knows he will pitch the 9th with a 1-3 run lead plus enough extra work to stay sharp; how much better does he have to be to outweigh not being available to pitch the 9th tied, the 8th in a pinch, or an infrequent 7th?
7:23 PM Aug 30th

TJNawrocki
I thought it was very interesting that Rich Gossage showed up in two consecutive years on the "stable workload" list. Not only did Gossage switch teams between 1976 and 1977, but he was a starter in 1976 and a reliever in 1977. The only stable thing about those two seasons was his workload.
5:40 PM Aug 30th

CharlesSaeger
robinsong: It's pretty obvious that this has happened. A look down at bb-ref.com for the years where we have pitch data has the number of short (less than 80 pitches) outings drop along with the long outings.
4:54 PM Aug 30th

MWeddell
Thanks for the article.

By the way, Chris Jaffe also has written about the changes in doubleheader frequency during baseball history. Here's the third part of his series (which has links to the first two parts): http://www.hardballtimes.com/main/article/history-of-the-doubleheader-part-iii/
2:53 PM Aug 30th

Robinsong
A terrific article. I think it would be interesting to look at the lower bound of workload, like your observation on Santana facing at least 23 batters. Have managers raised the bar for pulling a starter (or key reliever) having a poor outing? One could look at the frequency with which poor Game Scores are pulled in the first three innings. Having managers not overreact to a couple of runs may actually increase win probability as well as regularity. In any case, it would be interesting to study along with the high workload questions. This may be an area where the pitcher regularity could continue to increase in the future.
1:09 PM Aug 30th

CharlesSaeger
The question about closer usage should be what pattern of regular use is best for everyone involved. Instead of the save defining a closer's use, you could change it, so long as the whole staff knows what will happen. Say, the closer comes into tie games and one run leads in the eighth and ninth, and when two runs ahead or one run behind in the ninth. Define the next guy down by adding one to that range. You still have regular use, and everyone still knows when he pitches. Maybe you add the seventh to the closer; maybe you don't bring him into tie games in the eighth. But the point is to keep it regular AND maximize how often you get your best reliever into close games.
12:17 PM Aug 30th

greggborgeson
This research might enable even more radical departures in pitcher usage. For example, having a three or four day rotation of two paired pitchers per day, each pitching four or five innings max. That could create even higher levels of regularity, and potentially shrink the pitching roster by two or three.
12:17 PM Aug 30th

champ
Once again, Bill, you show your willingness to investigate EVERYTHING, including your own point of view.
12:14 PM Aug 30th

sprox
How much of this is done to keep the pitching staff happy - because they like regular work and how much of this is to improve team performance or some other business reason such as reducing injuries? Even if there is NO data to suggest regular work is better in any way - it may still be preferable if nothing else than to keep the pitching staff content.
11:16 AM Aug 30th

wovenstrap
Brilliant article. And you're totally right about the stathead bias about using the closer in the 7th or whatever -- indeed, that was largely based on things you had written (not saying you ever committed yourself to such a position). Hell, I still suspect that telling the closer he has to be ready for the last at-bat by the #3 hitter (say) would achieve an acceptable degree of "psychological" regularity, but I'm willing to concede that that could be wildly mistaken. Even if Mariano's numbers stop being so eye-popping, there might be added value in having Mariano at 90% effectiveness dealing with Papi in the 7th, rather than Daniel Nava in the 9th. There might be an analogy here with kickers in football -- who are brought in to kick 45-yard field goals in key situations etc.

It's still a little bit mystifying that 3-4 roster spots might be worth it; it's a heavy price for that regularity, and I haven't noticed ERA records being broken or anything. In other words, it still feels like received wisdom or groupthink. At best it's aesthetically unpleasing to the fan, feels inefficient and wastes time, as you said. But managers aren't graded on aesthetics. As usual, you've provided plenty to think about.
11:14 AM Aug 30th

The Longing

COMMENTS (22 Comments, most recent shown first)

Leave a comment

Report inappropriate comment


Type of Abuse:
Comments: