Remember me

The Longing

August 30, 2010

(Pitcher Regularity Scores)

(Poetically re-titled “Major League Managers and the Longing for Stability”)

 

            Major league baseball has been substantially re-created and re-invented, over the last hundred years, by the desire of major league managers to create more stable workloads for their pitchers.   This desire has overridden and thus essentially eliminated platooning, has sharply limited pinch hitting, has done away with third catchers, has virtually eliminated complete games, has re-defined normalcy in roster construction, pushing us gradually from 7 pitchers per team to 13, has added dozens of new statistical measurements to the game, and has caused games to get substantially longer and slower, as managers change pitchers every more frequently.

            We have talked a lot, we have written a lot, we have done much research on the issue of pitcher workloads.   The research that we have done—on this issue, perhaps more than any other—has invaded the game itself, and is now relied upon by people in decision-making positions to help guide the ways that pitchers are used.

            But there is a parallel issue, of perhaps equal or similar importance, about which we have done very little research.   That is the issue of the regularity of pitcher usage.

            Suppose that we had a metric to measure not how hard the pitcher had been worked, but how regularly, how consistent and even his workload was over the course of his season or career.   If we had such a metric, think of the other questions that this would open up to us.   Does pitcher performance improve with more regular usage?   Does regularity of usage help to prevent injuries?    Is a workload of 200 innings with regular use perhaps equal to a workload of 150 innings with irregular use, or 120 innings, or 80?

            Is regularity of pitcher use a variable among managers?   If so, is it a relevant variable in terms of manager success, or an irrelevant variable?  Do managers who keep their pitchers on a more regular schedule enjoy more success?

            I have three tasks in this article.   First, I will outline a simple method with which to distinguish between regular and irregular workloads.    Second, having applied this method to more than 1,000 pitcher/seasons—about 3% of all pitcher/seasons in baseball history—I will report the data resulting from those studies.    Third, I will try to explain and interpret the data as best I can, and then I will stick just a toe into those issues that I have outlined above.

 

I.  The Method

 

            It would have been impossible to have done this research—as it would have been impossible to have done any of the research that we do these days—without Retrosheet.   Retrosheet.org in recent months has added daily logs for pitchers dating back to 1920.   This research was done based on those logs.

            My method here is extremely simple.   From the game logs available through Retrosheet, we extract two figures:  the dates on which the pitcher pitched, and the number of batters that he faced.   For Bob Feller, 1946, this is that data:

 

Month

Day

Year

BFP

 

Month

Day

Year

BFP

4

16

1946

29

 

7

24

1946

31

4

21

1946

41

 

7

28

1946

34

4

26

1946

38

 

7

31

1946

34

4

30

1946

32

 

8

4

1946

26

5

4

1946

32

 

8

8

1946

33

5

8

1946

36

 

8

10

1946

9

5

12

1946

45

 

8

11

1946

1

5

17

1946

34

 

8

13

1946

29

5

21

1946

33

 

8

17

1946

36

5

26

1946

42

 

8

20

1946

34

5

30

1946

38

 

8

22

1946

5

6

4

1946

34

 

8

24

1946

37

6

8

1946

35

 

8

28

1946

32

6

12

1946

36

 

9

1

1946

33

6

16

1946

38

 

9

3

1946

7

6

21

1946

34

 

9

5

1946

22

6

25

1946

41

 

9

8

1946

33

6

29

1946

34

 

9

12

1946

35

7

3

1946

41

 

9

15

1946

29

7

7

1946

37

 

9

19

1946

34

7

11

1946

28

 

9

22

1946

38

7

16

1946

41

 

9

25

1946

37

7

20

1946

38

 

9

27

1946

21

7

21

1946

7

 

9

29

1946

38

 

            Feller pitched 48 times in 1946, facing a total of 1,512 batters.  

            Scanning this log from a modern perspective, we can see a number of quite remarkable things.   Feller pitched a complete game at Washington on July 20, 1946, came back on July 21 and pitched an inning and two-thirds of relief (scoreless relief, striking out four.)   He pitched a complete game at Chicago on August 8th (a one-hit shutout), came back to pitch in relief in St. Louis on August 10th (3 innings of perfect relief), pitched in relief again on August 11th, then started against Detroit on August 13th.   He gave up three hits, one run in 8 innings against Detroit, lowering his ERA to 1.88, rested three days, started again, rested two days, started again, missed one day, pitched in relief, missed one day, and started again.   In the last five days of the season he pitched a total of 23 innings.

            But our purpose here is not to stare at this in open-mouthed awe, but to score the regularity (or irregularity) of this assignment.   The score that I developed has two parts, and a potential of 25 points per game appearance, after the first appearance.

            In Part A, we compare the intervals between appearances. . .counting the number of days off between this appearance and the previous one.   For Feller, 1946, this is as follows (below). The gap between the last start of Feller’s 1945 season and his first in 1946, the off-season being longer then than it is now, was 201 days. . .which actually makes Feller’s 1946 season even more remarkable, when you consider that the season then was about three weeks shorter than it is now: 

 

Month

Day

Year

Gap

4

16

1946

201

4

21

1946

5

4

26

1946

5

4

30

1946

4

5

4

1946

4

5

8

1946

4

5

12

1946

4

5

17

1946

5

5

21

1946

4

5

26

1946

5

5

30

1946

4

6

4

1946

5

6

8

1946

4

6

12

1946

4

6

16

1946

4

6

21

1946

5

6

25

1946

4

6

29

1946

4

7

3

1946

4

7

7

1946

4

7

11

1946

4

7

16

1946

5

7

20

1946

4

7

21

1946

1

7

24

1946

3

7

28

1946

4

7

31

1946

3

8

4

1946

4

8

8

1946

4

8

10

1946

2

8

11

1946

1

8

13

1946

2

8

17

1946

4

8

20

1946

3

8

22

1946

2

8

24

1946

2

8

28

1946

4

9

1

1946

3

9

3

1946

2

9

5

1946

2

9

8

1946

3

9

12

1946

4

9

15

1946

3

9

19

1946

4

9

22

1946

3

9

25

1946

3

9

27

1946

2

9

29

1946

2

 

            Feller’s “gaps”, as you can see, are actually fairly regular for most of the season—five days, five days, four days, four days, four days, etc.   

            I score these “gaps” by asking this question:  Is the gap between these two appearances the same as the gap between the previous two appearances?   If the gap is exactly the same, I score that as “10”, or 10/10.   If the gap is different by one, I score it as 9/10.  If it is different by two, I score it as 6/10.   If it is different by three, I score it as 1/10.   If it is different by four or more, I score it at 0/10—totally irregular work.

            For Feller in 1946, this creates the following scores (S A stands for Score-A):

 

Month

Day

Year

Gap

S A

4

16

1946

201

 

4

21

1946

5

0

4

26

1946

5

10

4

30

1946

4

9

5

4

1946

4

10

5

8

1946

4

10

5

12

1946

4

10

5

17

1946

5

9

5

21

1946

4

9

5

26

1946

5

9

5

30

1946

4

9

6

4

1946

5

9

6

8

1946

4

9

6

12

1946

4

10

6

16

1946

4

10

6

21

1946

5

9

6

25

1946

4

9

6

29

1946

4

10

7

3

1946

4

10

7

7

1946

4

10

7

11

1946

4

10

7

16

1946

5

9

7

20

1946

4

9

7

21

1946

1

1

7

24

1946

3

6

7

28

1946

4

9

7

31

1946

3

9

8

4

1946

4

9

8

8

1946

4

10

8

10

1946

2

6

8

11

1946

1

9

8

13

1946

2

9

8

17

1946

4

6

8

20

1946

3

9

8

22

1946

2

9

8

24

1946

2

10

8

28

1946

4

6

9

1

1946

3

9

9

3

1946

2

9

9

5

1946

2

10

9

8

1946

3

9

9

12

1946

4

9

9

15

1946

3

9

9

19

1946

4

9

9

22

1946

3

9

9

25

1946

3

10

9

27

1946

2

9

9

29

1946

2

10

 

            Altogether, Feller scores for 1946 at 409 out of a possible 470 points on Part One of our system—86%.

            In that era, the common rule was that the ace of the staff—what we would now call the #1 starter—started on a regular four-day schedule if possible, always started against key opponents, and the rest of the staff had to move around to accommodate him.   If you checked Feller’s teammates, I am certain you would find that their schedules were less regular than Feller’s.   There were many more “travel days” then than now and frequent double-headers, and Feller hop-scotched other pitchers to get more work.   That’s the way it was done at that time.

            The other question that we ask is, “Was the number of batters that this pitcher faced in this outing the same as it was in the previous outing?”

            If the number of batters faced was exactly the same or was within 3 of being the same, we score that at “15” (15/15).    If it was different by 4, that’s 14; different by 5, that’s 13; different by 6, that’s 12.   For each difference in batters faced in excess of three, we subtract one, down to zero.   For Feller, 1946, these are the scores for each outing:

 

Month

Day

Year

Gap

S A

 

BFP

S B

4

16

1946

201

 

 

29

 

4

21

1946

5

0

 

41

6

4

26

1946

5

10

 

38

15

4

30

1946

4

9

 

32

12

5

4

1946

4

10

 

32

15

5

8

1946

4

10

 

36

14

5

12

1946

4

10

 

45

9

5

17

1946

5

9

 

34

7

5

21

1946

4

9

 

33

15

5

26

1946

5

9

 

42

9

5

30

1946

4

9

 

38

14

6

4

1946

5

9

 

34

14

6

8

1946

4

9

 

35

15

6

12

1946

4

10

 

36

15

6

16

1946

4

10

 

38

15

6

21

1946

5

9

 

34

14

6

25

1946

4

9

 

41

11

6

29

1946

4

10

 

34

11

7

3

1946

4

10

 

41

11

7

7

1946

4

10

 

37

14

7

11

1946

4

10

 

28

9

7

16

1946

5

9

 

41

5

7

20

1946

4

9

 

38

15

7

21

1946

1

1

 

7

0

7

24

1946

3

6

 

31

0

7

28

1946

4

9

 

34

15

7

31

1946

3

9

 

34

15

8

4

1946

4

9

 

26

10

8

8

1946

4

10

 

33

11

8

10

1946

2

6

 

9

0

8

11

1946

1

9

 

1

10

8

13

1946

2

9

 

29

0

8

17

1946

4

6

 

36

11

8

20

1946

3

9

 

34

15

8

22

1946

2

9

 

5

0

8

24

1946

2

10

 

37

0

8

28

1946

4

6

 

32

13

9

1

1946

3

9

 

33

15

9

3

1946

2

9

 

7

0

9

5

1946

2

10

 

22

3

9

8

1946

3

9

 

33

7

9

12

1946

4

9

 

35

15

9

15

1946

3

9

 

29

12

9

19

1946

4

9

 

34

13

9

22

1946

3

9

 

38

14

9

25

1946

3

10

 

37

15

9

27

1946

2

9

 

21

2

9

29

1946

2

10

 

38

1

 

 

            Here, as we can see, Feller’s scores are somewhat lower.   Feller’s S B scores for the season total up to 462 out of a possible 705, or 66%.   Adding together S A and S B, Feller’s “Workload Regularity Score” for the 1946 season is 871 out of a possible 1175, or 74%.

            In that era, that’s a very high score.   Among the fourteen pitchers for whom I have data covering the 1946 season, Feller’s 74% is easily the highest percentage.    Feller’s workload, in 1946, was more regular than that of any other pitcher that I studied. 

 

II.  The Data

 

            In late 1960, the legendary Casey Stengel was fired by the New York Yankees and replaced with the late Ralph Houk.   Houk’s first phone call, after learning that he had the job, was to Whitey Ford.

            Let’s pause there a second, and read between the lines.   Why did Houk call Ford first?   Was Ford the leader of the team?

            Well, yes, but no more than Berra or Mantle, and there is something else going on here.   You remember Casey’s famous line, “The secret of managing is to keep the five guys who hate you away from the ten guys who are on the fence?”   Ford, by 1960, had (we suspect) become the de facto leader of the Dump Casey faction of the Yankee roster.   He wasn’t ripping Stengel is the newspapers or anything, but it seems pretty clear, reading between the lines a little, that Ford by 1960 had grown disenchanted with Stengel’s management of the team, in large part because of the way that he used his pitchers.   Houk was reaching out to Ford to heal the rift.

            And what did Ford say to Houk, in that phone conversation?   “Just put me on the mound every fourth day and let me pitch.”   (Quote is not exact.) 

            Stengel was a great believer in percentage baseball—that is, in deploying his players at their most effective moments.   He used left-handed hitters against right-handed pitchers, he used right-handers against lefties, and he varied his pitching patterns constantly so that his best pitcher—Ford—would pitch against the strongest opposition and in the parks where he was best suited, while the lesser pitchers would soak up the starts against the teams the Yankees could beat anyway.   Ford never won 20 games for Stengel, largely because he didn’t pitch on a regular four-day rotation like other staff aces.

            It was smart percentage baseball, and that’s the way the game was played by some managers in 1920, in 1930, in 1940, but by 1960 it wasn’t really done that way anymore.   Starting pitcher leveraging, as Chris Jaffe calls it in his new book about managing, was pretty much gone by 1960.   Casey was still doing it, and Whitey didn’t like it.

            These are Whitey Ford’s Regular Use Scores throughout his career:

 

 

Pitcher

Year

Points

Possible

Pct

Whitey Ford

1950

253

475

53%

Whitey Ford

1953

506

775

65%

Whitey Ford

1954

506

825

61%

Whitey Ford

1955

624

950

66%

Whitey Ford

1956

518

750

69%

Whitey Ford

1957

337

575

59%

Whitey Ford

1958

482

725

66%

Whitey Ford

1959

466

850

55%

Whitey Ford

1960

413

775

53%

Whitey Ford

1961

768

975

79%

Whitey Ford

1962

638

925

69%

Whitey Ford

1963

741

925

80%

Whitey Ford

1964

658

950

69%

Whitey Ford

1965

626

900

70%

Whitey Ford

1966

604

1075

56%

Whitey Ford

1967

102

150

68%

 

            Ford’s pitching schedule in 1960 was 53% regular.   In 1961, under Ralph Houk, it was 79% regular—a huge increase.

            In 1960 Ford was 12-9.  In 1961 he was 25-4, and won the Cy Young Award.   Of course, the true facts are always more complicated than the simplest narrative:  It was an expansion year, Ford’s ERA actually went up, etc.   I’m not trying to hide the true facts; I am merely pointing out:

            1)   Ford was used by Stengel in a very irregular pattern,

            2)   Ford was unhappy about this,

            3)   Houk used Ford in a much, much more regular pattern,

            4)   Ford won twice as many games and won the Cy Young Award, and

            5)   Ford was much happier.

            I am using this fairly well-known story to demonstrate that our method works, but there

are actually are a series of story lines from 1961/1962 that are relevant here.    From 1956 to 1961 Lew Burdette had won-lost records of 19-10, 17-9, 20-10, 21-15, 19-13 and 18-11.   Over that six-year span Burdette was second in the major leagues in innings pitched and in games won, behind his running mate Warren Spahn.   He was one of the premier pitchers in the game.

            In 1962 Burdette won just 10 games—10-9, ERA of 4.88, far worse than the league average.   Unhappy, Burdette said the next spring that “the only difference between last year and any other year was that I didn’t get to pitch regularly.”   There’s that word again, regularly.  These are Burdette’s Regular Use Scores:

 

Pitcher

Year

Points

Possible

Pct

Lew Burdette

1950

14

25

56%

Lew Burdette

1951

26

50

52%

Lew Burdette

1952

586

1100

53%

Lew Burdette

1953

758

1125

67%

Lew Burdette

1954

494

925

53%

Lew Burdette

1955

597

1025

58%

Lew Burdette

1956

581

925

63%

Lew Burdette

1957

610

925

66%

Lew Burdette

1958

695

1000

70%

Lew Burdette

1959

655

975

67%

Lew Burdette

1960

551

1100

50%

Lew Burdette

1961

668

975

69%

Lew Burdette

1962

521

900

58%

Lew Burdette

1963

452

875

52%

Lew Burdette

1964

480

875

55%

Lew Burdette

1965

301

625

48%

Lew Burdette

1966

1022

1325

77%

Lew Burdette

1967

315

450

70%

 

 

            The linguist in me wants to point out that what Burdette meant by “regular” was not exactly the same thing that Ford meant by regular.    “Regular” has (at least) three meanings in common usage:

            1)  Consistent,

            2)  Frequent, and

            3)  Standard.

 

            When a dietary supplement says that it gets you regular and keeps you regular, what it means is (1).   When a boyfriend says that he wants more regular action from his girlfriend, what he means is (2).   When a reporter says that the trial was somewhat irregular, what he means is (3).

            When Ford said that he wanted to pitch more regularly, he meant (1), and also possibly (2).   When Burdette complained that he was not being given regular work, he meant (2).   When Billy Martin allowed Rick Langford to pitch 28 complete games in 1980, we could say that that was irregular, meaning (3).   Small point, but. . .we need to think clearly. 

            Anyway, the regularity of Burdette’s usage did decline in 1960, no matter how we define “regular”, but there is another side to that story.   The manager of the Milwaukee Braves in 1959, when Lew Burdette was 21-15 and getting really regular use (meaning frequent), was Fred Haney.  It is the view of the author that Haney in 1959 may have had the worst season by any manager in major league history.   Haney took a team that. . .I am not exaggerating for emphasis; this is exactly what I think. . .Haney took a team that should have won the National League by 20 games, and managed to lose the race.   If you compare the roster of the 1959 Braves to the 1959 Dodgers, it is difficult to explain how this is possible.

            One of the 28 stupid things that Haney did was, he worked Spahn and Burdette to death.   Spahn was a great pitcher and Burdette was pretty good, but Haney over-worked them both, they pitched .500 ball over the last two months, and they both finished 21-15, pitching 582 innings.   This might be understandable if Haney was short of other pitching options, but in fact he had a very good third starter (Bob Buhl) and a quite exceptional number of young pitchers, whom he refused to pitch.   He had Carlton Willey, who in 1958 had made 19 starts, led the league in shutouts with 4, and posted a 2.70 ERA, plus Joey Jay (97 innings in 1958 with a 2.13 ERA) and Juan Pizarro (97 innings in 1958 with a 2.69 ERA).    Haney didn’t believe any of these young pitchers was trustworthy, and tried instead to make Burdette and Spahn win the pennant for him.

            So Haney got fired, and the team was turned over to Charlie Dressen, but Dressen was an older manager who had been around the block several times, and he liked older pitchers as well.   Joey Jay didn’t get to pitch. . .well, REGULARLY, until 1961, when he was traded to Cincinnati:

 

Pitcher

Year

Points

Possible

Pct

Joey Jay

1954

223

350

64%

Joey Jay

1955

150

275

55%

Joey Jay

1958

267

450

59%

Joey Jay

1959

396

825

48%

Joey Jay

1960

490

775

63%

Joey Jay

1961

636

825

77%

Joey Jay

1962

618

950

65%

Joey Jay

1963

390

725

54%

Joey Jay

1964

494

825

60%

Joey Jay

1965

524

900

58%

Joey Jay

1966

247

500

49%

 

            In 1959, pitching on a very irregular schedule for Fred Haney (48% regular), Jay was 6-11 with a 4.10 ERA.  In 1960, with his regular use score increasing to 63%, Jay was 9-8 with a 3.25 ERA.   In 1961, traded to Cincinnati, Jay’s Regular Use Score went up to 77%.   His record improved to 21-10 in 34 starts.  (His ERA went up, but that’s an illusion created by the park and league.)

            It wasn’t just Jay, either.   Juan Pizarro was a lefty with one of the best fastballs of that era.  In 1960, fighting for regular work, he went 6-7 with an ERA a run over the league norm, park adjusted.   Traded to the White Sox that winter, he also got regular work, and he also immediately became effective.   I don’t have regularity scores for Pizarro, but it seems like a reasonable assumption that his regularity of use also increased sharply with the trade.

            You remember what I said earlier about Bob Feller—that in this era, most teams tried to keep their #1 starter on a four-day schedule, and everybody else had to work around that.   What Burdette was complaining about, really, was that from 1956 to 1961 he was one of the big dogs, that everybody else had to operate around.    In 1962 he was downgraded to being one of the little dogs, who had to work around the big dogs.   He didn’t like it.

            But really, it should have happened sooner; the Braves should not have been jacking around Pizarro, Jay and six other young pitchers to keep Lew Burdette happy.

            There is a very powerful trend, across history, of pitcher usage becoming more regular and more predictable.   What we are talking about, really, are (at least) five different changes in the game, all operating toward the common purpose of regularizing a pitcher’s work schedule.   Those five changes are:

            1)  The disappearance of double-headers,

            2)  The division of pitching staffs into starters and relievers,

            3)  The abandonment of the practice of starter leveraging,

            4)  The expansion of the bullpens, and

            5)  The adoption of pitch limits.

            The key point, really, is (4).   The expansion of the bullpens, I will argue, has been primarily driven by this exact reason:  The desire to regularize workloads.   We have written volumes about the expansion of the bullpens.   But when you think about it, what are the extra pitchers really doing?   They’re stabilizing the staff.   They’re making it possible to keep workloads regular.   That’s the main purpose of the larger bullpens; to regularize workloads.   We have missed this fact, I think, because we focus on what we can measure.   Not being able to measure the regularity of usage, we have essentially looked right through the real cause of the expansion of bullpens, which is the widespread belief that pitchers are most effective when they have regular work schedules.   But perhaps I am getting ahead of myself.

            Since 1920, the regularity of use of major league pitchers has advanced relentlessly.  While my studies cover only about 3% of major league pitchers and not even a representative 3%, the study nonetheless demonstrates a massive change in the regularity of usage by pitchers over the last nine decades.  For those pitchers for whom I have figured data, these are the overall Regular Use Scores for each of the last nine decades:

 

From

To

Pitchers

Points

Possible

Pct

1920

1929

104

49993

91350

55%

1930

1939

110

42521

76400

56%

1940

1949

112

47177

80050

59%

1950

1959

112

53793

88350

61%

1960

1969

110

62991

92500

68%

1970

1979

108

72181

97125

74%

1980

1989

111

69520

91625

76%

1990

1999

107

70182

86875

81%

2000

2009

138

105488

126525

83%

 

            While our studies are limited, I would predict that a comprehensive study would change these percentages by only tiny amounts, and I think we may safely assume that the conclusion that the regularity of pitcher usage has shown steady increases over most of baseball history will not be undermined by additional research.

            These are the top 50 and the bottom 50 pitcher/seasons in our study, in terms of regularity of use.   I included pitchers with less than 20 game appearances, but didn’t count them:

 

Rank

Pitcher

Year

Points

Possible

Pct

1

Johan Santana

2006

775

825

94%

2

Randy Johnson

2004

795

850

94%

3

Mark Buehrle

2003

783

850

92%

4

Johan Santana

2004

759

825

92%

5

Ted Lilly

2004

713

775

92%

6

Bartolo Colon

2004

757

825

92%

7

Pedro Martinez

2004

734

800

92%

8

Ryan Franklin

2004

711

775

92%

9

Mark Buehrle

2004

778

850

92%

10

Jamie Moyer

2002

753

825

91%

11

Mark Buehrle

2009

729

800

91%

12

Jamie Moyer

2003

728

800

91%

 

Trevor Hoffman

2003

182

200

91%

13

John Burkett

2001

748

825

91%

14

Jamie Moyer

2008

724

800

90%

15

Randy Johnson

1999

769

850

90%

16

Johan Santana

2008

746

825

90%

17

Randy Johnson

2002

768

850

90%

18

Jamie Moyer

1997

655

725

90%

19

Johan Santana

2007

721

800

90%

20

Jamie Moyer

2007

721

800

90%

21

Mark Buehrle

2006

698

775

90%

22

Mark Buehrle

2002

743

825

90%

23

John Smoltz

1993

762

850

90%

24

Jamie Moyer

2006

717

800

90%

25

Jamie Moyer

1998

737

825

89%

26

Johan Santana

2005

713

800

89%

27

Jamie Moyer

2000

557

625

89%

28

Mark Buehrle

2005

711

800

89%

29

Goose Gossage

1977

1575

1775

89%

30

Mark Buehrle

2008

731

825

89%

31

Adam Wainwright

2009

753

850

89%

32

Johan Santana

2009

531

600

88%

33

John Burkett

1996

730

825

88%

34

Livan Hernandez

2004

749

850

88%

35

Don Drysdale

1963

902

1025

88%

 

Pedro Martinez

2007

88

100

88%

36

Steve Carlton

1973

857

975

88%

37

Pedro Martinez

1998

703

800

88%

38

John Smoltz

2004

1577

1800

88%

 

Lee Smith

1980

372

425

88%

39

Livan Hernandez

2007

700

800

87%

40

Ryan Franklin

2008

1596

1825

87%

41

Bret Saberhagen

1998

655

750

87%

42

Mark Buehrle

2007

633

725

87%

43

Randy Johnson

2001

742

850

87%

44

Bartolo Colon

2002

698

800

87%

45

Goose Gossage

1976

654

750

87%

 

Bret Saberhagen

1997

109

125

87%

46

Francisco Cordero

2005

1482

1700

87%

47

Randy Johnson

2006

697

800

87%

48

Adam Wainwright

2007

675

775

87%

49

Pedro Martinez

2005

653

750

87%

50

Don Drysdale

1964

847

975

87%

 

            The earliest pitcher on the list of those with the most regular workloads was Drysdale in 1963.   These are the bottom 50:

 

Rank

Pitcher

Year

Points

Possible

Pct

 

Bucky Walters

1934

0

25

0%

 

Joey Jay

1953

0

50

0%

 

Tom Zachary

1936

32

175

18%

 

Pedro Martinez

1992

5

25

20%

 

Bucky Walters

1948

31

150

21%

 

Randy Gumpert

1938

18

75

24%

 

Steve Carlton

1988

20

75

27%

 

Bobo Newsom

1929

15

50

30%

 

Whitlow Wyatt

1931

25

75

33%

1

Ellis Kinder

1952

189

550

34%

 

Whitlow Wyatt

1944

69

200

34%

 

Ernie Bonham

1949

150

425

35%

 

Freddie Fitzsimmons

1941

106

300

35%

 

Tommy Byrne

1953

100

275

36%

2

Tom Zachary

1929

228

625

36%

 

Bill Henry

1955

146

400

36%

3

Bill Henry

1953

183

500

37%

4

Whitlow Wyatt

1930

184

500

37%

 

Tommy Byrne

1943

92

250

37%

5

Freddie Fitzsimmons

1939

240

650

37%

6

Elam Vangilder

1920

214

575

37%

 

Mel Parnell

1955

112

300

37%

 

Whitlow Wyatt

1929

28

75

37%

7

Tom Zachary

1928

243

650

37%

 

Firpo Marberry

1935

38

100

38%

8

Bob Feller

1956

171

450

38%

 

Larry Gura

1985

68

175

39%

9

Dolph Luque

1930

297

750

40%

10

Lefty Grove

1927

495

1250

40%

11

Tommy Byrne

1951

268

675

40%

 

Freddie Fitzsimmons

1935

169

425

40%

 

Tommy Byrne

1946

30

75

40%

 

Lindy McDaniel

1955

30

75

40%

12

Freddie Fitzsimmons

1940

191

475

40%

13

Dolph Luque

1931

182

450

40%

14

Dazzy Vance

1933

275

675

41%

15

Eddie Rommel

1929

318

775

41%

16

Alex Ferguson

1928

339

825

41%

17

Bob Feller

1955

247

600

41%

18

Bucky Walters

1935

237

575

41%

 

Whitlow Wyatt

1939

157

375

42%

19

Tom Zachary

1930

273

650

42%

20

Joe Page

1945

200

475

42%

21

Jim Konstanty

1944

200

475

42%

22

Dizzy Trout

1943

455

1075

42%

 

Bobo Newsom

1953

170

400

42%

23

Firpo Marberry

1934

394

925

43%

24

Randy Gumpert

1951

341

800

43%

25

Earl Whitehill

1939

248

575

43%

 

Firpo Marberry

1936

54

125

43%

26

Dizzy Trout

1939

346

800

43%

27

Dazzy Vance

1934

260

600

43%

28

Larry Gura

1976

206

475

43%

 

Eddie Rommel

1932

174

400

43%

29

Early Wynn

1944

348

800

43%

 

Bobby Ojeda

1994

11

25

44%

30

Herb Pennock

1930

264

600

44%

31

Herb Pennock

1922

341

775

44%

 

Early Wynn

1946

177

400

44%

32

Eddie Rommel

1921

499

1125

44%

33

Bobo Newsom

1943

411

925

44%

 

Steve Carlton

1966

89

200

44%

34

Tom Zachary

1922

345

775

45%

35

Dizzy Trout

1947

348

775

45%

 

Freddie Fitzsimmons

1943

90

200

45%

36

Bobby Shantz

1955

248

550

45%

37

Herb Pennock

1925

519

1150

45%

38

Alex Ferguson

1922

430

950

45%

39

Dizzy Trout

1952

397

875

45%

40

Eddie Rommel

1922

568

1250

45%

41

Bobo Newsom

1946

377

825

46%

42

Elam Vangilder

1921

343

750

46%

43

Randy Gumpert

1950

446

975

46%

44

Elam Vangilder

1926

469

1025

46%

45

Ernie Bonham

1941

254

550

46%

46

Mel Parnell

1951

406

875

46%

47

Eddie Rommel

1930

395

850

46%

48

Earl Whitehill

1937

373

800

47%

49

Herb Pennock

1932

245

525

47%

50

Herb Pennock

1933

257

550

47%

 

            Among those with 20 or more game appearances, the most recent pitcher was Larry Gura in 1976—a very interesting season, by the way.   There is almost no chronological overlap between the lowest and highest scores.  Since Johan Santana, 2006, had the most regular work regimen and Ellis Kinder, 1952, had the least regular, let’s compare and contrast their work patterns:

 

 

Johan Santana, 2006

 

 

Ellis Kinder, 1952

Month

Day

Year

BFP

 

 

Month

Day

Year

BFP

4

4

2006

28

 

 

4

16

1952

12

4

9

2006

23

 

 

4

18

1952

15

4

15

2006

28

 

 

4

23

1952

37

4

21

2006

29

 

 

5

3

1952

37

4

27

2006

30

 

 

5

5

1952

2

5

2

2006

28

 

 

5

16

1952

2

5

7

2006

27

 

 

5

18

1952

6

5

12

2006

26

 

 

5

19

1952

11

5

17

2006

27

 

 

5

24

1952

33

5

23

2006

30

 

 

5

30

1952

40

5

28

2006

29

 

 

6

1

1952

9

6

2

2006

26

 

 

6

5

1952

33

6

8

2006

22

 

 

6

7

1952

5

6

13

2006

28

 

 

8

14

1952

6

6

18

2006

25

 

 

8

24

1952

29

6

23

2006

30

 

 

9

1

1952

15

6

28

2006

26

 

 

9

5

1952

25

7

3

2006

25

 

 

9

7

1952

4

7

9

2006

29

 

 

9

9

1952

1

7

15

2006

26

 

 

9

13

1952

29

7

20

2006

26

 

 

9

16

1952

8

7

25

2006

27

 

 

9

24

1952

32

7

30

2006

28

 

 

9

28

1952

9

8

4

2006

25

 

 

 

 

 

 

8

9

2006

27

 

 

 

 

 

 

8

15

2006

27

 

 

 

 

 

 

8

20

2006

28

 

 

 

 

 

 

8

26

2006

29

 

 

 

 

 

 

8

31

2006

27

 

 

 

 

 

 

9

5

2006

27

 

 

 

 

 

 

9

10

2006

23

 

 

 

 

 

 

9

15

2006

31

 

 

 

 

 

 

9

21

2006

25

 

 

 

 

 

 

9

26

2006

31

 

 

 

 

 

 

 

 

            Santana in 2006 never faced fewer than 23 batters in an outing, nor more than 31.    His largest difference between batters faced in any two consecutive appearances was eight, and only ten times was there a difference larger than three.

            The most batters that Santana faced in any outing was 31.   Kinder in 1952 exceeded that six times in ten starts.   The fewest batters Santa faced was 23; Kinder had less than that 14 times.   Kinder’s work assignment for the day was outside the range of Santana’s extremes 20 times in 23 game appearances.

            Santana worked on four or five days rest literally in every start all season, except for the first start.   He never once worked off-schedule, or even slightly off-schedule.

            Kinder worked on consecutive days once, and worked on one day’s rest seven times—but also had three gaps of ten days or more without taking the mound.

 

III.   Analysis and Comment

 

            As I said, this steady increase in the regularity of pitcher’s work is actually a result of five distinct factors, which are:

            1)  The disappearance of double-headers,

            2)  The division of pitching staffs into starters and relievers,

            3)  The abandonment of the practice of starter leveraging,

            4)  The expansion of the bullpens, and

            5)  The adoption of pitch limits.

 

            1)  The disappearance of double-headers.

            The pitchers of the 1940s made more than 30% of their game appearances in double headers.   Bobo Newsom was a very colorful pitcher who had a long career (1929-1953) with a lot of different teams.   In his career he made exactly 600 game appearances, of which 208—34.7%--were in double headers.   The percentages for other pitchers of his era were similar:  Bucky Walters, 32%; Joe Page, 33%; Bob Feller, 29%; Whitlow Wyatt, 31%; Hugh Casey, 31%; Spud Chandler, 36%.    Teams would not infrequently play back-to-back double-headers, and many, many times would have Wednesday and Thursday off for travel, then would play a double-header on Friday and another on Sunday.

            Over time, more than 90% of these double headers have disappeared.   Modern pitchers typically make 3% or less of their Game Appearances in double headers (Bartolo Colon, 2.7%; Mark Buehrle, 2.1%; John Smoltz, 3.0%; Adam Wainwright, 0.7%.)

            Obviously, the disappearance of double-headers has helped to regularize pitcher work schedules.

 

           

            2)  The division of pitching staffs into starters and relievers.

            There were 305 major league pitchers who made 100 or more game appearances between 1920 and 1939, every one of whom appeared both as a starter and as a reliever.   All of them appeared in at least 5% of their games as starters, and at least 5% as relievers.   Only a small fraction had starter percentages or reliever percentages over 80%.

            Carl Hubbell, Dazzy Vance, Ted Lyons, Herb Pennock, Lefty Grove and Dizzy Dean all made 20 to 30% of their game appearances in relief (during the 1920s and 1930s.   Some of them may have gone under 20% if you include the 1910s or the 1940s.)

            By the late 1930s, managers were beginning to divide their pitching staffs into starters and relievers.   I believe that it was Joe McCarthy who first divided his staff into pure starters and pure relievers, and I believe that he did this in the mid-1930s.   Further research might better inform us about that issue, but in any case, by World War II “pure relievers” or “dedicated relievers” had emerged, and starting pitchers were generally being excused from the responsibility of finishing up for weakening teammates.

            Let’s note that points (1) and (2) were totally unrelated, and that the division of pitching staffs into starters and relievers had nothing at all to do with the disappearance of double-headers.  In fact, the division of staffs into starters and relievers came at a time when the number of double-headers was increasing very rapidly, due to the addition of lights to the parks.   Owners of the 1920s and 1930s generally liked double-headers, which drew large crowds, but double-headers created scheduling issues because you generally had to start them no later than 3:00 in order to be sure you got both games in before darkness.  The addition of lights enabled owners to start Friday-night double-headers at 5:00 or later, which caused the number of scheduled double-headers to spike upward during the 1940s.

 

            3)  The abandonment of the practice of starter leveraging.

            Chris Jaffe, in Evaluating Baseball’s Managers (McFarland, 2010), writes a lot about “starter leveraging”, by which he means jiggling the starting schedule so that your best pitchers are matched up against the best teams, and left-handers are used most often against teams that may be  vulnerable to left-handers and in parks that may favor left-handers.   For example, he writes that (Al) “Lopez’s career helps trace the course and extinction of starting pitcher leveraging.   In Cleveland, he oversaw one of the best collections of starting pitching in history (and did not engage in). . .an especially impressive degree of leveraging, but it is noteworthy that any existed given the overall quality and depth of Lopez’s starting rotation. . .When Lopez came to Chicago, he maintained this approach, using ace Billy Pierce frequently against top rivals.   As noted (earlier), Lopez’s decision not to start Pierce against the Dodgers in the 1959 World Series helped bring about the end of starter leveraging (but Lopez continued to do some of it until 1963.)  In 1964, Lopez changed.   The reform that had blown across the baseball world over the previous few years finally came to Chicago.”

            Jaffe demonstrates by a number of such narratives, documented by sabermetric analysis,  that virtually all major league managers did juggle their rotations to leverage their best starting pitchers prior to 1955, but that this policy died in the late 1950s and early 1960s.

            In connection with this change, I want to focus on this part of the chart that I gave you earlier:

 

From

To

Regular Use Pct

1920

1929

55%

1930

1939

56%

1940

1949

59%

1950

1959

61%

1960

1969

68%

1970

1979

74%

1980

1989

76%

1990

1999

81%

2000

2009

83%

 

 

            The regularity with which pitchers have been used has increased in every decade, yes, and has probably increased in every five-year period, if we had enough data to measure that reliably.    But the time when it really increased was between 1960 and 1979.   From the 1920s through the 1950s, the regularity of pitcher use increased by a total of 6%.   In the 1960s it increased by 7% in one decade, and then by another 6% in the 1970s.   That 20-year period accounts for basically one-half the increase over the nine decades.

            What was really driving that change?  It wasn’t the disappearance of the double headers; that happened twenty years later.

            I go back to the Ralph Houk story that I told before, and to the other anecdotes from that same era, having to do with Lew Burdette, Joey Jay and others.   What was driving this change was that a very strong belief entered baseball, right about that moment, that pitchers would benefit from regularity of use.   Workloads for pitchers didn’t go down at that time.  If anything, they went up.    This wasn’t a belief in avoiding overworking pitchers; that belief became important about twenty years later.   It was a separate and distinct belief, in regular and predictable workloads.

            Look, for example, at the Regular Use Scores of Don Drysdale, from 1959 to 1963:

 

Pitcher

Year

Pct

Don Drysdale

1956

48%

Don Drysdale

1957

65%

Don Drysdale

1958

61%

Don Drysdale

1959

56%

Don Drysdale

1960

67%

Don Drysdale

1961

68%

Don Drysdale

1962

80%

Don Drysdale

1963

88%

Don Drysdale

1964

87%

Don Drysdale

1965

71%

Don Drysdale

1966

77%

Don Drysdale

1967

80%

Don Drysdale

1968

84%

Don Drysdale

1969

55%

 

            Why?  Why did Drysdale’s regularity of use increase from 56% in 1959 to 80% in 1962, 88% in 1963?   Drysdale was the Dodger’s number one pitcher in 1959; he was their number one pitcher in 1962.   The Dodgers were in a tight pennant race, leading to a playoff, in 1959; they were in a tight pennant race, leading to a playoff, in 1962.   Drysdale was managed by Walter Alston in 1959, and he was managed by Walter Alston in 1963.

            What changed was simple:  between 1959 and 1963, the notion entered baseball, in a really vigorous way, that pitchers would perform their best when working on a regular schedule.   The Jim Bunning thing in 1964, when Mauch pulled Bunning off-rotation and the Phillies crashed and burned, cemented that link in the public mind.   The reality is, managers in the heat of the pennant race had done exactly what Mauch did in 1964 quite routinely for at least 50 years before then.   Sometimes it worked, sometimes it didn’t work.   Either way, it was never a big deal.

            The difference was, Mauch did it after the public had come to believe that pitchers performed their best given a regular work schedule.   That’s all; that’s the only difference between Mauch doing that in 1964 and Eddie Sawyer and Burt Shotton both doing it in the last week of the season in 1950.   In 1950 it was standard practice because there was no strong belief in the value of regular work schedules for pitchers; in 1964 it was a blunder because there was. 

The Longing for regular use patterns had entered baseball in a big way.

            In 1959 the league ERAs were 3.95 and 3.86.   In 1968 the league ERAs were 2.98 and 2.99.

            It is possible, in my opinion, that one thing that caused the pitcher domination of the 1960s was the regularization of pitcher schedules.    The re-definition of the strike zone in 1962/63 certainly contributed, yes—but the trend downward continued after 1963.   Most of it was after 1963.   People speculate on what caused this to happen.   One thing that may have caused this to happen was that pitchers in the 1960s were working in more regular patterns.

 

            4)  The expansion of the bullpens.

            Of course we all know how bullpens have expanded in the last 30 years, and many of us have written about this.   Many of us have written critically about this, and I don’t think I need to review that.

            The particular contribution of this research, however, is

1)  To establish a method to document differences between pitchers in the regularity of use, and

2)  To suggest the possibility that the desire to create regular work patterns for all pitchers is in fact the major driving force behind the expansion of bullpens.

 

            What multiple layers of relievers do for a pitching staff is to act as a shock absorber against the normal irregularity of the schedule.    When you have multiple relievers with subtle distinctions in their job assignments, it becomes more practical to expand or adjust those assignments to minimize stress points for starting pitchers and the critical two relievers.   Not only is this done, but that’s why those pitchers are there; that is the benefit that they give to a team.

            About eight/ten years ago, there was a debate going about the appropriate use of the closer.   In the middle of this debate I was hired by the Red Sox, and thus lost most of my ability to participate in that debate, because whatever I said could be played off the Red Sox in ways that were not helpful to the team.

            But what I really did not understand, until about 2005, was this.  If you look at Mariano Rivera, at Trevor Hoffman, Papelbon and others, these pitchers reach and sustain levels of effectiveness, relative to context, that are historically unprecedented.   No starter in history—not Koufax, not Randy, not Walter, not Grove, not anybody—was as effective, inning per inning and adjusting for context, as Mariano Rivera, and I’m not just talking about ERA.   Strikeouts, walks, home runs allowed. ..you name it, and the other guys are right there with him.  Why is that?

            It’s because the modern system of using Closers creates for these pitchers a role that is both very limited and very regular, extremely predictable.  Those guys throw maybe three dozen pitches a week—and do so at moments that can be predicted with a high degree of reliability.   They know when they are going to be in the game.   They can loosen up, stretch, push themselves to a peak, and they take the mound only when they are at that peak.   That enables them to reach fantastic levels of sustained effectiveness.

            I never understood this, until about 2005, and frankly, neither did any of you.   I know that you didn’t because of the things that you wrote.   People write about using your closer in the 7th inning when the high-leverage situation arises in the seventh inning.   If you think you can do that, you don’t understand.   You can’t reach that peak of perfection on an irregular work schedule.   You’ve got to KNOW when you’re going to be needed.    If you used Mariano in the 7th inning half the time, he would NOT be Mariano.

            And then there’s the press guys, who think that some guys can pitch in the 9th inning, while other guys can only pitch in the 7th. . .which totally misses the point, but then, we all miss half the points three-fourths of the time.

            What all of those other relievers are doing, really, is allowing the critical relievers to pitch only at those moments of peak effectiveness.   They’re load-levelers.  We’ve missed this; up to now.  We’ve left it out of the discussion.   Now it’s in.   That’s what this article is about.   I am arguing that you can’t really evaluate the worth of a deep bullpen unless you can deal with the advantage of creating stable workloads for other pitchers, because that’s really why it’s done.

           

            5)  The adoption of pitch limits.

            About which, again, enough has already been said—but again, I would argue that the real point of pitch limits is not to reduce the workload of starting pitchers, but to stabilize it.

 

            OK, but are we dealing here with five different phenomena, which combine to create a consistent evolution in the game, or are we dealing with a single phenomenon which has five different manifestations?   Or, looking again at this chart:

 

From

To

Regular Use Pct

1920

1929

55%

1930

1939

56%

1940

1949

59%

1950

1959

61%

1960

1969

68%

1970

1979

74%

1980

1989

76%

1990

1999

81%

2000

2009

83%

 

 

            Have we gone as far as we can go?   

Because if it is five different phenomenon, then it may be that these five distinct changes that were driving the Regular Use Percentages up have all played themselves out, and thus it may be that we are at the end of the line in this massive, 90-year shift.  

If, on the other hand, it is not five different phenomenon but five different manifestations of the same phenomenon—the Longing to create ever more stability in pitcher workloads—then we have to assume that this nine-decade trend may continue for several more decades until the Regular Use Percentages reach 90% or 92% or 94%.   And if that’s the case, then we have to ask ourselves, since these other manifestations of the shift have pretty much played themselves out, how this desire to create regular use patterns will change baseball in the future.

The disappearance of the double headers—that’s clearly a different thing.    That was not caused by the desire to create regular workloads; we can be pretty sure that it wasn’t.   But I would argue that the other four factors are, in reality, all simply manifestations of the same over-riding and over-whelming desire, on the part of managers and pitching coaches, to create stable workloads—thus

1)  that this desire has created massive changes in the game of baseball in the past, and

2)  we should expect that it will create additional significant changes in the game of baseball in the future. 

I can cite for you another change, caused by this same Longing, which is already occurring.    More aggressive use of the Triple-A roster as an extension of the major league roster.

I have worked for the Red Sox since 2002—eight years.  Over that time, I have seen a very substantial increase in the willingness, both of the Red Sox and other teams, to make a roster move to avoid stress points for pitcher usage.    Eight years ago, you could pretty much assume that the pitchers the other team had on their roster in the morning would be the pitchers they would have on their roster at game time.   Now, you can’t.   Same thing—front offices move pitchers up and down to protect the regular use patterns of the key pitchers on the team.

 

In working through this process, we skipped a critical question.    Is this a legitimate benefit?    Do pitchers in fact pitch better and/or get injured less when they have regular, predictable work schedules than they do when their workloads are less regular?    Or are we chasing a phantom here, bending ourselves out of shape and wrenching the game of baseball out of its traditional shape merely to pursue an “advantage” which is no real advantage at all?

Well, I don’t doubt that pitchers pitch better when they have a regular work schedule.   This is the universal assumption of all baseball men, very deeply ingrained; one can’t challenge that assumption based on a mere absence of evidence.   We don’t have any evidence or any reason to believe that this is not true, and, intuitively, I am strongly inclined to believe that it is true.

            I do have some serious questions, however, about whether the benefits derived from the regular usage of pitchers are substantial enough to justify the things that have been done in that effort.   The deeply rooted and constantly growing belief in the benefits of regular work for pitchers has changed baseball like a growing tree crushing and throwing aside a meddlesome sidewalk or even an old street.

That’s why I wanted to post this method:  To put us in a position to ask those questions.   Let’s assume that some managers use their pitchers in more regular patterns than other managers—if not now, then certainly at some points in the past.   Can we validate by sabermetric research that managers who are ahead of the game in this fashion get better results from their pitchers than managers who are behind the game?    Can we show that they have fewer injuries?  Can we show that they have better ERAs?  Can we show that they win more games?

Can we demonstrate that pitchers who are used on a regular basis are more effective than those who are not?  Can we demonstrate that they are injured less often?

If we cannot demonstrate that these benefits exist, then should we call into question the things that are done to pursue this benefit?   That’s getting ahead of ourselves; I guess we shouldn’t ask that question until we see what we have.

It may be that the method that I have outlined here will be inadequate to the tasks set before us.   Better methods may be needed.   These issues will be entwined with issues of fatigue and workload, and it will be in some cases impossible to tease them apart, so that some of these questions will prove to be un-answerable.   It just seems to me that this is too large a subject to ignore, that we have ignored it for too long, and I’m trying to move the discussion off the dime.

 
 

COMMENTS (22 Comments, most recent shown first)

bbmarks
I don't buy it that Mariano Rivera couldn't be effective in other situations. All you have to do is tell him that he will be sometimes used in a tie ballgame in the 7th or other such situations, then he can get ready just as if he is closing the game. Rivera would still be Rivera.
2:05 PM Oct 20th
 
OldBackstop
Very interesting, but I think the Whitey Ford anecdote needs some drill down. He did indeed go from 25-4 by being moved to "regular use" and not being held for starts against the tough teams. But that was also the expansion year...eight more games and against weaker teams. Six of Whitey's wins (in 6 starts) came against expansion teams that Stengel probably would have hopped for him -- for the good of the team. It is interesting to note that while he did have that amazing 25-4 record, he also had the highest ERA of his career to date: 3.21, and it was the second highest ERA of his 17 year career (3.24 in '65). And (with the six more games, albeit) the Yanks produced 827 runs, up from 746 in 1960.

Anyway, interesting piece. I'd be interested in more data from an injury perspective.
5:26 PM Sep 22nd
 
Bucky
Not to put other research questions onto your plate, but does the increasing regular load also extend to position players? Without examining it, it does seem to me that we have less platooning, more in-season stability of positions, and fewer utility players than in earlier days. I could of course be off my rocker on this. And I also don't know what impact, if any, this would have on position players. But it seems as though, if I am right, that this might go hand in glove with a more regular load for pitchers of all kinds.
2:28 PM Sep 5th
 
TomStrother
After almost 30 years of availing myself of your research, I believe this is one of the most significant pieces of work and most practical for understanding changes in the game, particularly for people in their 60's like me, that you've done. And it isn't weighted down with a lot of sabermetric arcana. Good stuff.
9:41 AM Sep 3rd
 
MarisFan61
Speaking as a Yankee fan, I hope Girardi Eiland & Co. will hear about this (or have realized it) before doing too many more odd things with Phil Hughes and Joba Chamberlain. This article systematizes much of what I've felt about the handling of them.
I have loved the fact that teams are now at least *trying* to find ways to avoid having good young pitchers flame out, but I have long felt that the "regularity" factor has been ignored; I just never thought of analyzing it in such a way. Almost-total emphasis is often put on "number of innings," without regard to the possibility that 'an inning isn't an inning isn't an inning' -- irregularity might be a hidden stress factor.
If you save a starter's innings by using him in short relief for part of the season, are you really saving him? Or, if you "shut him down" for a time late in the season before bringing him back? If pitching is much like any activity that I know, you aren't.
Viewing it as pure arithmetic -- number of innings -- misses the role of these other stressors.
2:00 AM Sep 3rd
 
mskarpelos
This article typifies why I started reading Bill James soon after graduating from college in the early 1980s. It wasn't the stats per se that I found interesting but rather the use of stats to explain an overarching historical narrative. I never knew about the rift between Ford and Stengel nor Fred Haney's incompetence until I read this article. Those stories were interesting in their own right. But seeing them so expertly woven into a greater historical narrative carefully built on a solid statistical foundation is something that only Bill James can do. We have fine baseball writers and fine baseball statisticians and fine baseball historians, but only Bill James can do all three at a world class level. Excellent job.

My only quibble is with the seeming arbitrariness of the reliability metric. I'm not saying that the metric is intrinsically inaccurate; I just don't understand why we have a discontinuous 10 point scale for days off and a slightly more continuous 15 point scale for batters faced. Couldn't we simply combine the standard deviation for days off and the standard deviation for batters faced for each appearance with equal weighting to form our reliability metric? I'm not necessarily advocating this, but it seems like the obvious approach to me, so I'm not sure why Bill defined the metric the way he did.
2:04 PM Aug 31st
 
CharlesSaeger
David: Pitchers nowadays aren't used to pitching on three days rest, and pitch worse when on it, as opposed to those of the Seventies. Personally, I'd like to see the four-and-a-half man rotation, though the half man is anti-regularity.
1:33 PM Aug 31st
 
Richie
Don't you mean, "all other things being equal, they won't fire you for being orthodox"?
11:33 AM Aug 31st
 
donmalcolm
I suspect that your percentages are about 5-7% lower once you put in all pitchers who have 10GS or more. But, yes, the trend you've unearthed here is undeniable.

I think it's likely that it can't go up as much as you surmise. But it almost certainly won't go down, as there is insulation from outlier strategies backfiring if a manager sticks to this protocol more or less blindly. All else being equal, they won't be able to fire you for doing something unorthodox!

There seems to be a secondary benefit emerging along with the potential for top-end closer performances (though those, too, are almost "freak show" stats). Middle relievers who pitch in more than one full inning are also benefiting from regularized usage. The aggregate ERA+ for these appearances has been steadily rising since 1950 (when it was 111) and is now flirting with 150. Apparently specialization is having other impacts.

To answer one of your questions: given that the stabilization of starter workloads is seemingly so uniform, there will likely be no way to tease out any conclusions concerning quality. It's more likely that teams will adopt a six-man rotation than return to a set of practices that would decrease starter uniformity and reliever specialization. It may be possible, with enough historical research, to determine if this approach limits injuries.
1:17 AM Aug 31st
 
bearbyz
The answers to the questions in the last few paragraphs could make a very interesting book.
9:22 PM Aug 30th
 
StatsGuru
Great article. The question that comes to mind is, should a four-man rotation work, as long as it's regular?
8:12 PM Aug 30th
 
jdw
On zqxjk's question about how much of it was/is to keep the staff happy, it would seem not a lot initially. Look again at the chart in the pitch leveraging section, and when the greatest change happened. It was in a period where management wasn't yet engaged in keeping players happy. The probably didn't kick off until the 80s, possibly even later. Even in the early days o free agency, most teams didn't put the effort into keeping people happy knowing that folks could and would walk once given the chance.

I think Bill pretty much nails it that it was "performance", or at least the strong belief in better performance, that led the charge. Frankly one couldn't find a comp than Drysdale vs Drysdale. Alston and the Dodgers alone would make an interesting macro study on this given Alston's longevity, along with the stability of the Dodgers front office / ownership, their "Dodger Way" philosophy and a number of pitchers like Drysdale and Sutton who had longer stretches in the Dodgers Rotation.

Anyway... extremely interesting article, Bill. Very good read.
7:59 PM Aug 30th
 
danfeinstein
This is probably Bill's article this year that has most made me think. Thanks.

I'm not surprised to see a more regimented and routine usage pattern as the sport has matured. We'd expect experimentation to be much greater in a newer industry than in one more established.

Bill suggests that increase regularity is one factor that has enabled Rivera, Hoffman, Wagner, et al to thrive. I wonder if it also has allowed or extended the peak performances of Johnson, Martinez, Maddux, Clemens, etc. By rate stats compared against their peers, the best of the last two generations of players have stood out more than their predecessors; I wonder if regular usage has a disproportionate benefit for the best of the best.

I'm not convinced that every change in strategy is an improvement. I came of age in the time of Herb Washington and the fleet afoot. That was certainly an odd strategy and one that I suspect was a step backwards. By saying that, I am not certain that the strategy of regularity - and the apparent benefits for the back end of the bullpen - outweigh the reduction in availability. Lets assume that Rivera is better if he knows he will pitch the 9th with a 1-3 run lead plus enough extra work to stay sharp; how much better does he have to be to outweigh not being available to pitch the 9th tied, the 8th in a pinch, or an infrequent 7th?
7:23 PM Aug 30th
 
TJNawrocki
I thought it was very interesting that Rich Gossage showed up in two consecutive years on the "stable workload" list. Not only did Gossage switch teams between 1976 and 1977, but he was a starter in 1976 and a reliever in 1977. The only stable thing about those two seasons was his workload.
5:40 PM Aug 30th
 
CharlesSaeger
robinsong: It's pretty obvious that this has happened. A look down at bb-ref.com for the years where we have pitch data has the number of short (less than 80 pitches) outings drop along with the long outings.
4:54 PM Aug 30th
 
MWeddell
Thanks for the article.

By the way, Chris Jaffe also has written about the changes in doubleheader frequency during baseball history. Here's the third part of his series (which has links to the first two parts): http://www.hardballtimes.com/main/article/history-of-the-doubleheader-part-iii/
2:53 PM Aug 30th
 
Robinsong
A terrific article. I think it would be interesting to look at the lower bound of workload, like your observation on Santana facing at least 23 batters. Have managers raised the bar for pulling a starter (or key reliever) having a poor outing? One could look at the frequency with which poor Game Scores are pulled in the first three innings. Having managers not overreact to a couple of runs may actually increase win probability as well as regularity. In any case, it would be interesting to study along with the high workload questions. This may be an area where the pitcher regularity could continue to increase in the future.
1:09 PM Aug 30th
 
CharlesSaeger
The question about closer usage should be what pattern of regular use is best for everyone involved. Instead of the save defining a closer's use, you could change it, so long as the whole staff knows what will happen. Say, the closer comes into tie games and one run leads in the eighth and ninth, and when two runs ahead or one run behind in the ninth. Define the next guy down by adding one to that range. You still have regular use, and everyone still knows when he pitches. Maybe you add the seventh to the closer; maybe you don't bring him into tie games in the eighth. But the point is to keep it regular AND maximize how often you get your best reliever into close games.
12:17 PM Aug 30th
 
greggborgeson
This research might enable even more radical departures in pitcher usage. For example, having a three or four day rotation of two paired pitchers per day, each pitching four or five innings max. That could create even higher levels of regularity, and potentially shrink the pitching roster by two or three.
12:17 PM Aug 30th
 
champ
Once again, Bill, you show your willingness to investigate EVERYTHING, including your own point of view.
12:14 PM Aug 30th
 
sprox
How much of this is done to keep the pitching staff happy - because they like regular work and how much of this is to improve team performance or some other business reason such as reducing injuries? Even if there is NO data to suggest regular work is better in any way - it may still be preferable if nothing else than to keep the pitching staff content.
11:16 AM Aug 30th
 
wovenstrap
Brilliant article. And you're totally right about the stathead bias about using the closer in the 7th or whatever -- indeed, that was largely based on things you had written (not saying you ever committed yourself to such a position). Hell, I still suspect that telling the closer he has to be ready for the last at-bat by the #3 hitter (say) would achieve an acceptable degree of "psychological" regularity, but I'm willing to concede that that could be wildly mistaken. Even if Mariano's numbers stop being so eye-popping, there might be added value in having Mariano at 90% effectiveness dealing with Papi in the 7th, rather than Daniel Nava in the 9th. There might be an analogy here with kickers in football -- who are brought in to kick 45-yard field goals in key situations etc.

It's still a little bit mystifying that 3-4 roster spots might be worth it; it's a heavy price for that regularity, and I haven't noticed ERA records being broken or anything. In other words, it still feels like received wisdom or groupthink. At best it's aesthetically unpleasing to the fan, feels inefficient and wastes time, as you said. But managers aren't graded on aesthetics. As usual, you've provided plenty to think about.
11:14 AM Aug 30th
 
 
©2024 Be Jolly, Inc. All Rights Reserved.|Powered by Sports Info Solutions|Terms & Conditions|Privacy Policy