Username:	Password:

Remember me

Forgot your username/password?

Print Email

Home>Articles

The Strikeout Push/Pull Effect

By Bill James

November 2, 2018

The Strikeout Push/Pull Effect

Hey, guys. I am aware that I owe you one article from the Rookie series, and I’ll get to that next, and then we’ll get back to the Bill James Life in Prison Baseball Game, try to re-launch that, but first of all I got carried away with a serious research question, and I’ve spent a week or so on that issue. This is an issue that I have discussed before and researched before, but this time I did it much better than I have before, studied it more systematically and in more depth, and came away with a better understanding of what is happening.

It has been my theory for about ten years that the historical upward drift of strikeouts, the ever-increasing number of strikeouts in the game, is caused by an imbalance of forces in the selection criteria. Basically, the problem is that

(a) Pitchers who get strikeouts are better pitchers than those who do not, but

(b) Batters who strike out are NOT significantly worse, on balance, than batters who don’t strike out.

Not only are strikeout-prone hitters historically not worse, they are, over a period of time, slightly better. Hitters who strike out, like Babe Ruth, Mickey Mantle, Mike Schmidt, Reggie Jackson and Aaron Judge, are still enormously valuable despite the strikeouts.

Because of (a) and (b) above, teams are constantly looking for strikeout pitchers—but are NOT looking to avoid strikeout hitters. They’re not looking to avoid guys like Mike Schmidt and Aaron Judge; they’re looking to find them. So you have upward push on the strikeout column from pitcher selection, but no downward push from batter selection. The result of this is that strikeouts go up over time.

In other words, what should (you would think) be a relationship of push and push back is, in practice, a relationship of push and pull. What we will call PUSH is the pitcher’s contribution—the fact that teams are always looking for pitchers who can strike people out. But what should be pushBACK—teams looking for hitters who don’t strike out—isn’t there. Very often, it’s a push/pull relationship in which both forces are operating in the same direction.

This has been my theory for about ten years, but now I need to stress that this is simply a theory. It is not a proven fact. It could be true; it could be partially true but not absolutely true. It could be false; it could be that there is some other explanation that works better. I am trying to keep an open mind as I study the issue.

This issue was brought to the front of my mind as I was writing an article for the 2019 Bill James Handbook about trendlines in baseball. I wrote the following:

At some point, this has to end, right? At some point, strikeouts have to become SO high that batter effectiveness and strikeouts have to part company, or at least this is what I believe.

In 1978 the ten teams which struck out the most often (as hitters) had a collective won-lost record of 849-769, 80 games over .500. The ten teams which struck out LEAST often had a collective record of 762-852, 90 games under .500. The high-strikeout teams were better teams—indicating that strikeouts were not a negative for their teams, but merely the fellow travelers of home runs.

By 2008 the gap perhaps was narrowing; the advantage of the strikeout teams was diminishing. In 2008 the ten teams which struck out the most were collectively 27 games over .500 (822-795), but the ten teams that struck out the least were 7 games over (814-807). The gap may (or may not) have become less than it was.

But in 2018 there was a different picture. In 2018 the ten teams that struck out the most often had a collective record of 765-857, or 92 games under .500. The ten teams which struck out LEAST often had a collective record of 861-758, or 103 games over .500. The pattern is actually stronger if you stick to five teams. The five teams which struck out most—the White Sox, San Diego, Philadelphia, Texas and San Francisco—all had losing records, while the five teams which struck out least—Cleveland, Houston, Seattle, Pittsburgh and Boston—all had winning records, and three of the five won their divisions.

What I am saying is that in modern baseball, it may no longer be true that strikeouts are not a meaningful drain on a team’s offensive energy. It may be that we have reached the point, or at least are near the point, when teams WILL begin to avoid hitters who strike out.

After reading that, I took note of this passage in Rob Neyer’s new book, Power Ball (page 128):

From the mid-1960s through the mid-90s, the strikeout rate essentially held steady—aside from an odd downturn in the late 70s—at around 15 percent of plate appearances, give or take a percentage point or so. But for the next twenty years, the rate edged toward 20 percent. Finally, three years ago fully one-fifth of all plate appearances ended in strikeouts. Last year it was 21 percent; this year, nearly 22 percent.

And there’s simply no reason to think this trend is going to reverse itself anytime soon.

So I was hoping to be able to tell Neyer, "Actually, Neyer, there IS a reason to think this trend is going to reverse itself soon." My studies didn’t exactly work out that way (he said, tipping his pitches), but that was my thinking.

My observation in the Handbook is valid and relevant, I think, but there are four obvious problems with it:

First, the observation spot-checks data from just a few seasons (1978, 2008), and compares that to 2018, leaving the appearance of a trend where no trend may exist in fact.

Second, the data that I cited is team-level data, but the effect, if it is really there, would operate on the level of individual players, not teams.

Third, I used the ten teams on the extremes to represent the "slant" of the data, rather than doing a correlation study that would include all of the teams in the study and evaluate the degree to which each one provided evidence about the issue. In other words, you can get a different answer, perhaps, if you use eight teams or six teams or twelve teams. Just using X number of teams is a loosey-goosey approach to the problem.

And fourth, the data that I sited addresses only the "batter" side of the issue, while ignoring the "pitcher" side of the issue, which may be more important.

In other words, what I referenced in the Handbook is not a real study of the issue; it is just picking and choosing data that seems to support my point—exactly what traditional sportswriters do when they say that Lefty O’Doul ought to be in the Hall of Fame because he had a career batting average of .349. THIS is a real study of the issue, what we’re doing today. I looked at both sides of the issue, looked at much larger sections of the data, looked at it on the player level rather than the team level, and did actual correlation studies although, being the obstreperous SOB that I am, I then aggregated the correlation data in ways that would probably cause a legitimate data scientist to sneer viciously and shout at me to leave the stage. My skills only go so far.

By doing this, I did in fact gain a much better understanding of the issue I was studying. I picked up a major new insight into the problem. This was satisfying, because very often you put 8 days into a study like this and you don’t learn a damn thing. This time I actually gained understanding of the issue.

First we have to define the data set. Perhaps I should put in a sub-heading there:

Outlining the Study, Part I—Batters

This is boring math, and also, if you don’t get math, you may not understand what I am talking about. If you don’t get math, you may want to skip ahead to "Conclusions, Part I—Batters".

First we have to define the data set. I studied all players (hitters) who had 300 plate appearances in a season in the last 100 years, 1919 to 2018. For each player, I used two data points: his strikeout rate and his runs created per 27 outs. For strikeout rate, I used strikeouts divided by plate appearances.

For each season, I determined the correlation of runs created/27 outs with the strikeout rate, using a Pearson Product/Moment correlation. If the two were positively correlated, that means that, in that season, the good hitters (as a group) struck out more often than the less-productive hitters. This was true, for example, in every season from 1960 to 1984. If the two were negatively correlated, as they were, for example, this season, that means that the good hitters struck out less often than the weaker hitters.

In 1970, picking a season at random for illustration. In 1970 there were 212 players who had 300 or more plate appearances. With 24 teams that is 8.83 players per team with 300 plate appearances, a normal number. The average strikeout rate of those 212 players was 130 strikeouts per 1000 plate appearances, while the average runs created per 27 outs was 4.985.

The three best hitters of 1970, runs created per 27 outs, were Carl Yastrzemski (10.42), Rico Carty (10.24) and the late Willie McCovey (10.12). All three of those players had BELOW average strikeout rates, so all three of them made NEGATIVE correlation contributions. Yastrzemski, for example, struck out only 66 times in 698 plate appearances, or .095. We multiply the extent to which he was better than the average hitter in the group (10.42 minus 4.985) by the extent to which he struck out less than the average hitter (.095 minus .130), and you get negative .193. His contribution to the correlation is negative .193.

That’s important—it is one of the largest contribution figures of the 1970 season—but it is less than the correlation contribution of Bobby Bonds. Bonds was also an above-average hitter, contributing 7.45 runs to his team for each 27 outs, but he struck out 189 times, which was a major league record at that time and a record that stood for many years after that. He struck out in 25.4% of his plate appearances—actually not that far above the 2018 average, but way above average in 1970. Bonds’ contribution to the correlation in 1970 was (7.45 – 4.985) X (.254-.130), or +.304.

Of the 212 players from 1970, 122 had positive contributions to the correlation (meaning that they were either good hitters who struck out a lot or weak hitters who struck out little), while 90 (like Yastrzemski, Carty and McCovey) had negative correlation contributions. Overall, the sum total of the 212 players was +2.94.

If the two categories had been perfectly aligned—that is, if every hitter who struck out a lot was also a good hitter, and if the runs created/27 outs were in exactly the same order as the strikeout rates, and if the proportions between them were exactly the same relative to the standard deviations of each. . ..if the two were perfectly aligned, then the sum total would have been +18.63. It was +2.94, so we divide 2.94 by 18.63, and we get +.158. The correlation between strikeouts and offensive productivity, for 1970 hitters, was +.158.

This is a high figure by historical standards. For the 100 seasons that I studied, this was the fourth-highest. The correlation was positive in 53 seasons, negative in 47 seasons, and this was the fourth-highest positive correlation.

Conclusions—Part I, Batters

There were 53 seasons in which strikeouts were positively correlated with offensive productivity, and 47 seasons in which the correlation was negative (you just said that.)

The correlation was positive in every season from 1926 to 1942 except two (1936 and 1937). This is what we could refer to as the Babe Ruth/Jimmie Foxx/Hank Greenberg era, when there were several or many big sluggers who struck out a lot by the standards of the era.

The correlation was negative for every season between 1943 and 1959, except 1956. This is what we could refer to as the Joe DiMaggio/Ted Williams/Stan Musial/Yogi Berra era, when most of the best hitters in the game did NOT strike out much.

From 1960 to 1984, the correlation was positive in every season without exception. This is what we could refer to as the Mickey Mantle/Harmon Killebrew/Dick Allen/Reggie Jackson/Mike Schmidt era, when the game was dominated by star hitters who often led the league in strikeouts.

From 1985 to the present, the correlation has generally been negative. It has been negative 25 times in that 34-year period, and positive 9 times. For the last 34 years, the majority of good hitters have NOT been hitters who struck out a lot.

While the correlation has generally been negative for a third of a century now, and while I definitely should have picked this up many years ago, the negative correlation is stronger now than at any other point over the last 100 years. The 2018 figure was negative .141, the second strongest negative relationship of any season in the study. The only stronger negative connection was in 2014, when it was -.161. This chart gives the correlation of strikeouts to offensive productivity in every season of the last 100 years.

Year		Year		Year		Year
1919	-.016	1944	-.028	1969	.117	1994	-.090
1920	.021	1945	-.069	1970	.158	1995	-.052
1921	.017	1946	-.052	1971	.141	1996	-.019
1922	.024	1947	-.092	1972	.067	1997	-.075
1923	-.018	1948	-.138	1973	.091	1998	-.008
1924	-.008	1949	-.091	1974	.078	1999	-.039
1925	-.113	1950	-.100	1975	.071	2000	.032
1926	.090	1951	-.020	1976	.036	2001	.001
1927	.028	1952	-.035	1977	.148	2002	-.003
1928	.046	1953	-.109	1978	.205	2003	-.019
1929	.100	1954	-.099	1979	.159	2004	-.030
1930	.110	1955	-.087	1980	.000	2005	.086
1931	.005	1956	.042	1981	.117	2006	.051
1932	.046	1957	-.014	1982	.118	2007	.058
1933	.159	1958	-.047	1983	.016	2008	.025
1934	.003	1959	-.008	1984	.046	2009	-.017
1935	.021	1960	.133	1985	-.007	2010	.117
1936	-.061	1961	.093	1986	-.024	2011	-.032
1937	-.030	1962	.035	1987	-.045	2012	-.069
1938	.073	1963	.005	1988	-.047	2013	-.041
1939	.051	1964	.093	1989	-.079	2014	-.161
1940	.027	1965	.041	1990	.058	2015	.009
1941	.017	1966	.105	1991	-.069	2016	-.111
1942	.040	1967	.000	1992	-.038	2017	-.110
1943	-.117	1968	.051	1993	-.059	2018	-.141

And here is the same chart for the benefit of Dave Studeman and any others of you who are color-blind:

Year		Year		Year		Year
1919	-.016	1944	-.028	1969	.117	1994	-.090
1920	.021	1945	-.069	1970	.158	1995	-.052
1921	.017	1946	-.052	1971	.141	1996	-.019
1922	.024	1947	-.092	1972	.067	1997	-.075
1923	-.018	1948	-.138	1973	.091	1998	-.008
1924	-.008	1949	-.091	1974	.078	1999	-.039
1925	-.113	1950	-.100	1975	.071	2000	.032
1926	.090	1951	-.020	1976	.036	2001	.001
1927	.028	1952	-.035	1977	.148	2002	-.003
1928	.046	1953	-.109	1978	.205	2003	-.019
1929	.100	1954	-.099	1979	.159	2004	-.030
1930	.110	1955	-.087	1980	.000	2005	.086
1931	.005	1956	.042	1981	.117	2006	.051
1932	.046	1957	-.014	1982	.118	2007	.058
1933	.159	1958	-.047	1983	.016	2008	.025
1934	.003	1959	-.008	1984	.046	2009	-.017
1935	.021	1960	.133	1985	-.007	2010	.117
1936	-.061	1961	.093	1986	-.024	2011	-.032
1937	-.030	1962	.035	1987	-.045	2012	-.069
1938	.073	1963	.005	1988	-.047	2013	-.041
1939	.051	1964	.093	1989	-.079	2014	-.161
1940	.027	1965	.041	1990	.058	2015	.009
1941	.017	1966	.105	1991	-.069	2016	-.111
1942	.040	1967	.000	1992	-.038	2017	-.110
1943	-.117	1968	.051	1993	-.059	2018	-.141

The total for the last five seasons (2014 to 2018) is negative .513, which is the lowest total in the history of baseball or at least the last 100 seasons. In other words, we ARE (in a certain sense) back to the Ted Williams/Stan Musial/Joe DiMaggio/Yogi Berra era, when the best hitters generally do NOT have high strikeout totals.

For those who would like to see an end to the endless escalation of strikeouts, that’s an encouraging development. If we stopped the study at this point, we could conclude that the endless increase is strikeouts may in fact be near an end, because there is now significant pushback from the hitters. If we stopped now, we could say "see this, Rob Neyer, there actually IS reason to believe that this trend will reverse itself."

Tempted as I was to do that, I am unfortunately too intellectually honest to let myself do it. It has never been the hitters who were basically causing the upward trend in strikeouts. It has always been the pitchers (pitcher selection) who were basically causing the upward push in strikeouts. The role of hitters (hitter selection) was simply that it was not effectively pushing BACK.

So now hitter selection is pushing back a little bit, but is it enough?

By my theory, by the strikeout push/pull theory, the upward trend in strikeouts should end when the negative correlation of hitter strikeouts to productivity is equal to the positive correlation of pitcher strikeouts to pitcher effectiveness. But have we reached that point, or are we near that point? To answer that question, we have to study the pitchers.

Outlining the Study, Part II—Pitchers

The process for pitchers was essentially the same as the process for hitters. I included in the study all pitchers who

(a) Pitched 100 or more innings in the season, or

(b) Had 50 or more "points", where points are Games Started + Game Appearances, a combination very commonly used in pitcher contracts.

Then, after defining the group, I did essentially the same things I had done with the hitters. The two things that I correlated for pitchers were:

(a) ERA, and

(b) The percentage of batters struck out. Strikeouts, divided by batters facing pitcher.

For illustration, in 1919 there were 90 major league pitchers who met the workload standards to be included in the study. Those 90 pitchers had an average strikeout rate of .083 (8.3%) and an average ERA of 2.99. (The major league ERA was 3.07, because those pitchers not included in the study were somewhat less effective than those who were included in the study.) Hod Eller of Cincinnati had the highest strikeout rate in baseball, 13.9%, and also had a good ERA (2.40), so we mark that as a positive. He had a positive correlation contribution of +.033.

The sum for all major league pitchers was +.372 out of a theoretically possible 1.508, for a 1919 correlation of +.247.

The Major New Insight that I Have into this Problem

That I gained from doing this study

The increase in strikeout rates is feeding on itself.

I feel stupid for not having realized this before now, but as you no doubt can recognize, there are many processes in nature that feed on themselves, growing stronger because they have grown stronger. A hurricane is one such effect, or a tornado. A straight wind may reach a certain maximum speed, 50 MPH or 60 MPH or whatever, with GUSTS of wind that may be higher. What happens with a tornado or a hurricane is that the wind forms a circle and blows into itself, making a sustained wind that is much more powerful than a gust of wind. It is generating its own momentum. A snowball rolling downhill is the clichéd example of this, or an avalanche.

Strikeouts are increasing because strikeouts are increasing. Stupid of me not to see it before now.

Look, Hod Eller in 1919 pitched 248 innings and struck out 137 batters. This was the highest strikeout rate in baseball, five strikeouts per nine innings. A typical pitcher, pitching the same number of innings in 1919, would have struck out 80 to 85 batters.

The correlation of strikeout rate to pitcher effectiveness was positive in 1919 (+.247), but weak by historic standards. When you think about it, this obviously has to be true. When there aren’t that many strikeouts in the game, strikeouts are just not that important, and therefore the extent to which the best pitchers are strikeout pitchers is limited. Other things become more important. Not walking batters is more important. Getting ground balls is more important. Holding the baserunners is more important. Getting strikeouts is not that important because it’s just a few plays a game.

Because strikeouts are not that important, the ability to get a strikeout is not a central or not the central selection mechanism for pitchers.

As strikeouts become more common, they become more important.

As strikeouts become more important, the extent to which the best pitchers are strikeout pitchers increases.

As the extent to which the best pitchers are strikeout pitchers increases, the extent to which future pitchers are selected by their ability to get strikeouts increases.

As the extent to which pitchers are selected by their ability to get strikeouts increases, strikeouts become more common.

As strikeouts become more common, they become more important.

As strikeouts become more important, dot dot dot.

For ten years or more, I have explained the relentless increase in strikeouts as a function of the asymmetry in the relationship of pitcher strikeouts/pitcher effectiveness to the relationship of batter strikeouts/batter effectiveness. But I now realize that there is a second dynamic involved here, which is probably as important or more important than the other one. Like a hurricane, the cycle of strikeouts increasing is feeding on itself.

Conclusions—Part II, Pitchers

We are nowhere near the point at which the downward pressure on strikeouts (from the batters) is equal to the upward pressure (from the pitchers), and, in fact, we may not be much closer to that point now than we were years ago.

The realization in the point above is an outgrowth of the realization that, over time, the correlation of strikeouts to effectiveness among pitchers has increased dramatically, as the number of strikeouts has increased. Once you realize that, it’s an "Oh, of course." I had just never before realized that, never before realized that the extent to which the best pitchers were strikeout pitchers had increased over time.

Unlike the batter correlation, which is sometimes positive and sometimes negative, the correlation between pitcher strikeouts and pitcher success has always been positive. There has never been a year when it wasn’t positive.

Year	Correlation	Year	Correlation	Year	Correlation	Year	Correlation
1919	+.247	1944	+.215	1969	+.278	1994	+.333
1920	+.346	1945	+.372	1970	+.307	1995	+.526
1921	+.309	1946	+.314	1971	+.384	1996	+.522
1922	+.105	1947	+.347	1972	+.313	1997	+.495
1923	+.290	1948	+.277	1973	+.367	1998	+.520
1924	+.222	1949	+.406	1974	+.311	1999	+.485
1925	+.225	1950	+.256	1975	+.320	2000	+.468
1926	+.266	1951	+.399	1976	+.361	2001	+.478
1927	+.203	1952	+.439	1977	+.484	2002	+.449
1928	+.311	1953	+.137	1978	+.370	2003	+.524
1929	+.201	1954	+.403	1979	+.400	2004	+.528
1930	+.470	1955	+.226	1980	+.449	2005	+.543
1931	+.361	1956	+.136	1981	+.330	2006	+.501
1932	+.175	1957	+.109	1982	+.501	2007	+.510
1933	+.284	1958	+.250	1983	+.360	2008	+.507
1934	+.317	1959	+.176	1984	+.265	2009	+.496
1935	+.380	1960	+.345	1985	+.389	2010	+.538
1936	+.362	1961	+.260	1986	+.398	2011	+.488
1937	+.408	1962	+.467	1987	+.390	2012	+.441
1938	+.197	1963	+.399	1988	+.503	2013	+.437
1939	+.403	1964	+.349	1989	+.459	2014	+.449
1940	+.190	1965	+.361	1990	+.399	2015	+.521
1941	+.301	1966	+.365	1991	+.454	2016	+.517
1942	+.352	1967	+.311	1992	+.410	2017	+.566
1943	+.235	1968	+.392	1993	+.508	2018	+.510

The correlation has always been positive, but it has become more strongly positive over the years. In the first 25 years of the century we are studying, the correlation average was +.286. In the second 25 years, it averaged +.309. In the third 25 years, it averaged +.388. In the fourth 25 years, it averaged +.494. In the last five seasons it has averaged +.513. As strikeouts have risen, so too has the value of being a strikeout pitcher.

Conclusions—Part III, Combining the Pitching and Hitting Studies

It could be true, then, that:

a) There is now meaningful pushback against rising strikeout rates, caused by the fact that the best hitters now are players who don’t strike out as much, but

b) This effect is still very small compared to the "push" effect of strikeout pitchers being better than non-strikeout pitchers, and

c) The growth in strikeout rates is still pushing on itself, more than negating the increase in pushback in recent years.

So it remains possible and perhaps likely that strikeout rates will continue to go up.

Sorry.

That wasn’t the result I wanted, either. This chart combines the pitcher’s correlation (PUSH) and the batter’s correlation (PULL) into one number.

Year	PUSH	PULL	Combined	Year	PUSH	PULL	Combined
1919	.247	-.016	.231	1969	.278	.117	.395
1920	.346	.021	.367	1970	.307	.158	.464
1921	.309	.017	.326	1971	.384	.141	.525
1922	.105	.024	.129	1972	.313	.067	.380
1923	.290	-.018	.272	1973	.367	.091	.458
1924	.222	-.008	.214	1974	.311	.078	.390
1925	.225	-.113	.112	1975	.320	.071	.391
1926	.266	.090	.355	1976	.361	.036	.397
1927	.203	.028	.231	1977	.484	.148	.631
1928	.311	.046	.358	1978	.370	.205	.575
1929	.201	.100	.302	1979	.400	.159	.559
1930	.470	.110	.580	1980	.449	.000	.450
1931	.361	.005	.367	1981	.330	.117	.447
1932	.175	.046	.221	1982	.501	.118	.619
1933	.284	.159	.444	1983	.360	.016	.376
1934	.317	.003	.320	1984	.265	.046	.311
1935	.380	.021	.401	1985	.389	-.007	.382
1936	.362	-.061	.301	1986	.398	-.024	.375
1937	.408	-.030	.378	1987	.390	-.045	.345
1938	.197	.073	.270	1988	.503	-.047	.456
1939	.403	.051	.454	1989	.459	-.079	.380
1940	.190	.027	.217	1990	.399	.058	.457
1941	.301	.017	.318	1991	.454	-.069	.385
1942	.352	.040	.392	1992	.410	-.038	.372
1943	.235	-.117	.117	1993	.508	-.059	.449
1944	.215	-.028	.188	1994	.333	-.090	.244
1945	.372	-.069	.303	1995	.526	-.052	.473
1946	.314	-.052	.262	1996	.522	-.019	.503
1947	.347	-.092	.255	1997	.495	-.075	.421
1948	.277	-.138	.139	1998	.520	-.008	.512
1949	.406	-.091	.315	1999	.485	-.039	.446
1950	.256	-.100	.156	2000	.468	.032	.499
1951	.399	-.020	.380	2001	.478	.001	.479
1952	.439	-.035	.404	2002	.449	-.003	.445
1953	.137	-.109	.028	2003	.524	-.019	.505
1954	.403	-.099	.304	2004	.528	-.030	.497
1955	.226	-.087	.139	2005	.543	.086	.628
1956	.136	.042	.179	2006	.501	.051	.552
1957	.109	-.014	.095	2007	.510	.058	.568
1958	.250	-.047	.203	2008	.507	.025	.532
1959	.176	-.008	.169	2009	.496	-.017	.479
1960	.345	.133	.478	2010	.538	.117	.655
1961	.260	.093	.354	2011	.488	-.032	.456
1962	.467	.035	.502	2012	.441	-.069	.372
1963	.399	.005	.404	2013	.437	-.041	.395
1964	.349	.093	.442	2014	.449	-.161	.288
1965	.361	.041	.402	2015	.521	.009	.530
1966	.365	.105	.470	2016	.517	-.111	.406
1967	.311	.000	.311	2017	.566	-.110	.456
1968	.392	.051	.443	2018	.510	-.141	.369

I’ll go over that one quickly, because the one-year chart doesn’t show trends clearly. It’s not a one-year effect. The theory here is that pitcher selection is driven by the types of pitchers who are successful. That’s not a one-year process; it’s a process that happens over time. These charts below combine the one-year effects (above) into five-year totals.

Year 1	to	Year 5	Total Effect
1919	to	1923	1.33
1920	to	1924	1.31
1921	to	1925	1.05

The "1.05" number above combines the Push and Pull Effects (that is, the pitcher’s correlation and the hitter’s correlation) from 1921 to 1925 into one number, which is 1.05. That means that the average of those ten effects is just +.105—a very weak effect. Strikeouts are being pushed upward, but at a relatively slow pace.

Now I will take the "from" and "to" out of the charts, and just refer to "1921 to 1925" as "1925". That means it is the five-year average, measured from 1925. From 1925 to 1934, this number went steadily upward:

Year	Total Effect
1925	1.05
1926	1.08
1927	1.18
1928	1.27
1929	1.36
1930	1.83
1931	1.84
1932	1.83
1933	1.91
1934	1.93

By 1934 the upward pressure on strikeouts had nearly doubled. From 1934 until 1957, however, this number went DOWN steadily:

Year	Total Effect
1934	1.93
1935	1.75
1936	1.69
1937	1.84
1938	1.67
1939	1.80
1940	1.62
1941	1.64
1942	1.65
1943	1.50
1944	1.23
1945	1.32
1946	1.26
1947	1.12
1948	1.15
1949	1.27
1950	1.13
1951	1.24
1952	1.39
1953	1.28
1954	1.27
1955	1.25
1956	1.05
1957	0.74

For the sake of clarity, there was always upward pressure on strikeouts resulting from the asymmetrical pressures of pitcher selection (Push) and batter selection (Pull). However, that pressure, during this period of time, was dropping steadily, and was much lower in 1957 than it had been in 1934. The 1957 measurement is the all-time low point of this chart. After 1957, the upward pressures on strikeouts accelerated at a remarkable rate:

Year	Total Effect
1957	0.74
1958	0.92
1959	0.78
1960	1.12
1961	1.30
1962	1.71
1963	1.91
1964	2.18
1965	2.10
1966	2.22

Let us ask what was happening, in that era, which would have caused this quite remarkable jump in the upward measurable pressures on the strikeout rate?

1) Sandy Koufax emerged as clearly and obviously the best pitcher in baseball. In 1957 Koufax was in the majors and had a very high strikeout rate, but was an ineffective pitcher.

2) Koufax’ emergence (and Drysdale’s dominance) was followed by the emergence of many other power pitchers—Bob Gibson, Bob Veale, Sam McDowell and others. In the 1950s the Cy Young Award winners were Don Newcombe and Vern Law, who "dominated" by never walking anybody, a 36-year-old Warren Spahn and a 39-year-old Early Wynn. (Law actually won the Cy Young in 1960.) In the 1960s they were power pitchers.

3) Many teams built up their mounds, beyond regulation height, to give their pitchers an advantage.

4) The strike zone was re-defined (1963).

5) There was a very dramatic shift in the thinking of managers about the irregular use of pitchers. In the 1950s managers commonly used pitchers in what would later be regarded as highly irregular patterns. Between 1959 and 1965 this changed, to an enormous extent, largely unrecognized by baseball historians although it can be clearly documented.

The upward pressure on strikeouts can be seen in the chart beginning in 1958, which is actually the years 1954 to 1958. Thus, it can be argued that the push/pull effect predicts the pitching-dominated 1960s, that the signs of this happening emerged several years before it did happen. That can be argued, and it may be true, or it may be just picking and choosing data from the chart that is convenient to the argument. I don’t actually know which one of those it is.

In any case the push/pull effect after 1966 was flat for ten years, but remained at a very high level:

Year	Total Effect
1966	2.22
1967	2.03
1968	2.07
1969	2.02
1970	2.08
1971	2.14
1972	2.21
1973	2.22
1974	2.22
1975	2.14
1976	2.01

After 1976 the Push/Pull Pressure grew more intense once again, reaching a new peak of 2.66 in 1981:

Year	Total Effect
1976	2.01
1977	2.27
1978	2.38
1979	2.55
1980	2.61
1981	2.66

After 1981 the pressure for more strikeouts relaxed significantly for six years:

Year	Total Effect
1981	2.66
1982	2.65
1983	2.45
1984	2.20
1985	2.14
1986	2.06
1987	1.79

But beginning in 1987, it then went up for another 23 years, reaching an all-time high of 2.79 in 2010:

Year	Total Effect
1987	1.79
1988	1.87
1989	1.94
1990	2.01
1991	2.02
1992	2.05
1993	2.04
1994	1.91
1995	1.92
1996	2.04
1997	2.09
1998	2.15
1999	2.36
2000	2.38
2001	2.36
2002	2.38
2003	2.37
2004	2.43
2005	2.55
2006	2.63
2007	2.75
2008	2.78
2009	2.76
2010	2.79

Since 2010 the upward pressure on strikeouts does appear to have relaxed somewhat:

Year	Total Effect
2010	2.79
2011	2.69
2012	2.49
2013	2.36
2014	2.17
2015	2.04
2016	1.99
2017	2.08
2018	2.05

However, while the upward pressures on the strikeout total are not now what they were in 2010, they remain relatively high by historic standards.

By any possible interpretation of the data, we are nowhere near the point at which the downward pressures on strikeouts resulting from the fact that the best hitters don’t strike out much are equal to the upward pressures resulting from (a) the fact that the best pitchers get strikeouts, and (b) the hurricane effect, that as strikeouts become more common, that pushes managers to find strikeout pitchers. In other words, there is no evidence in the data that the 120-year trend toward more strikeouts in the game is near its end.

That doesn’t mean that it isn’t near its end, either. The exact relationship between future strikeouts and the upward pressure on strikeouts resulting from asymmetrical forces and is unclear, if in fact such a relationship exists. It’s a long-term, multi-year observation of the difference which creates a long-term increase in strikeout rates (if that is in fact what is happening.) You can’t look at the data and see an obvious relationship between how strong the forces are and how rapidly the strikeout rate increases. There may be such a relationship in there somewhere, but you can’t easily spot it. The rules of science require that if we can’t demonstrate a relationship between the measured forces and the "resulting" increase in strikeouts, we have to assume that there is no such relationship.

I do believe, honestly, that we could be near to a turning point. The forces we are trying to measure here operate through the collective mind of baseball people. Two or three years ago, when most of baseball was focused on "launch angle" and apparently ready to accept unlimited strikeouts to buy launch angle, the Red Sox and the Astros went in the other direction. They went the non-strikeout route. In 2018, the two best teams in baseball, probably, were the Red Sox and the Astros. I do believe that this may turn out to be a significant development. Baseball imitates its winners, of course, understanding that you have a new winner every year or most years. What won last year may not win next, and there may be no lasting effect to the 2018 success of the low-strikeout teams.

But then again, there could be.

COMMENTS (22 Comments, most recent shown first)

bertrecords
Over the past decade, fastball velocity has increased. Pitchers who hit 95 mph are becoming the norm. The faster the pitch, the less time the hitter has to react. Therefore, more strikeouts. I read that the correlation between fastball velocity and strikeouts is high which corroborates the obvious.

In my Twitter feed, I find reference to those who suggest that throwing 95 mph may be achieved by ordinary humans via training regimens. Obviously, this wasn't the case 15 years ago or we would have seen more pitchers throwing 95. I suppose these claims do not have to be true, but Eric Sim and Kyle Brody convince me that the claims are real.

The Astros and Red Sox may be scouting or somehow training hitters that can hit 100 mph balls.

But, the biggest impact is still Nate Eovaldi and the increasing number of pitchers who throw 100 mph because that is being taught.

I expect even more strikeouts.
7:24 PM Nov 14th

KaiserD2
This is a very interesting article. I have a question arising from it.

My anecdotal opinion, which isn't worth much in itself obviously, is that strikeouts are increasing partly because pitchers refuse to throw the ball over the plate and batters tend to swing anyway. With the presence of pitchtrax and other software on tv we can see what is a strike and what isn't. I've seen quite a few strikeouts for which there was not actually a single strike thrown, or maybe one. Everyone is trying to hit the corner, all the time. (I have more than my own observation to back that up--two coaches for the PawSox confirmed how unhappy this makes them at a recent SABR meeting.)

The thing is, some one, surely, is keeping the data I am talking about--how many strikes people are throwing, how many bad balls batters are swinging at, etc. My question is, is anybody publishing it? If so, where? It should have been available for some years now and we could confirm whether there is a trend.

Just curious.

David K
7:21 AM Nov 12th

MarisFan61
Yes, depending on betweew what and what! :-)
9:39 PM Nov 6th

evanecurb
Is there a correlation betweew
7:29 PM Nov 6th

shwright50
Chihuahua332 and Bill J. I stand corrected. Very interesting article. (I'll read more carefully next time...)
4:17 PM Nov 6th

MarisFan61
How about some background on the color chart thing.... :-)
9:53 PM Nov 5th

studes
Nice post, Bill. I appreciate you reviewing past assumptions like that. Also, thanks for the non-color chart. It's good to know that years of whining can have an impact.
10:29 AM Nov 5th

shthar
Does having more or less major league teams, and therefore more or less major league players affect this?
9:26 PM Nov 4th

hotstatrat
I was wondering the same thing as rwarn17588. Did the Royals' success start the trend that Boston and Houston have apparently picked up on?
1:47 PM Nov 3rd

Gfletch
"Obstreperous?" Nah. I'd go with "Occasionally prickly," myself.
1:14 PM Nov 3rd

bjames
Charles--Don't understand your point. Just not following what you are saying.

Josh Hader WAS included in the study. You half-read the criteria.

5:29 PM Nov 2nd

Chihuahua332
Shwright50, the criteria are 100IP or 50 points (starts + appearances). The second condition will include relief specialists who appear fairly regularly.
5:26 PM Nov 2nd

CharlesSaeger
Wouldn’t the change come from a team having success telling its pitchers to ease up, let the fielders do the work, and work later in games since they don’t throw as many pitches per out? (Or whatever incentive works for the pitcher.) As you said, the pitchers are what’s driving this.
5:08 PM Nov 2nd

arnewcs
Outfielders: It does feel like it has gotten harder for slow players to hide in the corner outfield spots. And when you think of Benintendi's play in the ALCS, and what, a decade ago, Manny Ramirez could have done in the same situation, it's a clear display of the value of a strong across the board outfield defense.
3:33 PM Nov 2nd

shwright50
I wonder if the '100 innings pitched' criterion might be introducing a bias in the trends of recent years. Would the inclusion of the growing number of (near)100 mph relief specialists, Josh Hader-like guys that don't accrue 100 innings but that do strikeout a lot of batters, shift the 'push-pull pressure' to higher values the 2.0-ish values of recent years?
3:28 PM Nov 2nd

BlueRulez
I am guessing that the longest term curb on strikeouts is the sheer ability of pitchers arms to stand the strain of throwing hard.

IF (big speculative IF) MLB expands the roster and teams can have even more pitchers on staff, one might see five or six pitchers per game, with NO attempt to have the starter complete a game at all. By that time, "starter" as a term becomes obsolete. Four or five pitchers throwing between 2.25 to 1.8 innings per game are going to throw a hell of a lot harder than any current starter. Also at that point, while there are no "starters," you could have twenty to twenty-five pitchers ON ROTATION. That would increase the ability of pitchers to throw harder, longer. I don't expect MLB to go in this direction myself; but, compared to 1963 it has gone in this direction.

IF scientists come up with improved pharmaceuticals to delay ageing and increase strength, with no downsides to health, I am guessing that the ability of pitchers to throw hard MAY swamp the ability of hitters to make contact while swinging hard. That, too, would increase the ability of pitchers to throw harder.

All this is speculative. I admit I'm not making an attempt to see what might affect strikeouts in the near future.
3:26 PM Nov 2nd

bearbyz
Hey, the pitchers chart could have been all red, bloody pitchers.
2:42 PM Nov 2nd

rwarn17588
That's a very good point, MarisFan61. I believe Bill has commented here about the vast improvement in groundskeeping that's led outfielders (and all fielders, in general) to take more chances on running full-tilt and diving for balls because they know their chances of injury is far less than it once was.
2:38 PM Nov 2nd

MarisFan61
Interesting and (to me) unexpected that in recent decades there's been that kind of reversal in correlation between strikeout rate and batter effectiveness -- i.e. that now, in the era when striking out has been sort of accepted as being OK, the relation changed in favor of striking out less.

Separate thing:
I offer the idea that there's another factor that's been driving up strikeouts, a factor that I've only seen ever mentioned a couple of times, and both of those were in posts by me. :-)
Which I realize doesn't argue greatly for it being so, but nevertheless.

I've mentioned many times that it seems fielding quality has become better and better and better in these last few decades. Even just in the last decade, it seems (to me) like more and more plays which would have been considered spectacular and rare 10 years ago, especially in the outfield, are now almost expected -- if the guy can get there, even by a dive or tumble, he's going to catch it, and many of the outfielders do get there on balls where it seems like they didn't used to.

If this is so, I'd say it follows almost as a given that there's more incentive for batters to go for hard contact, either to put the ball out of the park or just to make it harder to get to; plus that teams tend more to look for players who hit the ball harder. We could also say that the growing emphasis on "hard contact" is in line with this.
Oh -- I forgot to say, I guess because I thought it was implicit but maybe it isn't:
Going more for hard contact increases the chance of swinging and missing, does it not....

We could add that the 'shifts' thing has an effect, in the same direction, but I don't want to put too much on that, because there's nothing new about bringing that up, plus that we don't know where the trend will go and how batters may adjust. The better-fielding thing is a separate factor, and if I'm right that it's so, it's here to stay.
1:35 PM Nov 2nd

Jack
Fantastic article, crystallizing and providing a sensible, evidence-based explanation for what has been happening.
1:26 PM Nov 2nd

QimingZou
The red positive, green negative chart is driving me nuts
11:33 AM Nov 2nd

rwarn17588
Did the success of the Royals -- and their penchant of avoiding strikeouts and putting the ball in play -- a few years ago lead to the Red Sox and Astros adopting some of the same approach?

I remember a lot of people commenting about the Royals' style the two years they went to the World Series because the team was such an outlier at the time.
8:50 AM Nov 2nd

The Strikeout Push/Pull Effect

COMMENTS (22 Comments, most recent shown first)

Leave a comment

Report inappropriate comment


Type of Abuse:
Comments: