Remember me

Mike and Paul and Mark and Carlos

August 26, 2011
1. Mike and Paul
 
            Paul Splittorff and Mike Flanagan have both passed away this summer.   Both were left-handed pitchers from the 1970s and 1980s (Splittorff 1970-1984, Flanagan 1975-1992).   Both pitched their entire careers in the American League, Splittorff for one team, Flanagan for two teams although he was clearly associated with one.   Both remained with their teams after their playing careers, and were broadcasters with those teams at the time of their deaths.   Both teams were powerhouse franchises at the time Splittorff and Flanagan were active, but have long since fallen onto hard times.
            Splittorff had a career won-lost record of 166-143; Flanagan, of 167-143.   Each pitcher won 20 games once and 19 games once.   Each pitcher won 14 or more games five times.
            Flanagan started 404 games in his career with a career ERA of 3.90.   Splittorff started 392 games with a career ERA of 3.81.   Flanagan pitched 101 complete games in 404 starts (25%); Splittorff pitched 88 complete games (23%).
            Both had below-average strikeout rates but better than average rates of walks, home runs, wild pitches, balks and hit batsmen.   Each issued 41 intentional walks in his career, when the league average given the number of batters faced would have been 76 for each one.    Both had above-average numbers of shutouts.   Flanagan was 3-2 in post-season play; Splittorff was 2-0.
            Splittorff was three years older than I am; Flanagan was two years younger.   Their passing thus serves powerfully to remind me that my own time here could end at any moment.   They faced each other only once in their careers (April 22, 1978), which seems very odd in that Splittorff faced Catfish Hunter, for example, eleven times, Jim Palmer six times, Tommy John eight times and Scott McGregor six, while Flanagan faced Tommy John and Dennis Leonard seven times each, and Jack Morris eight. Splittorff won their one confrontation, 5-3.   Let us hope that they have met again beyond the veil.
 
 
2. Mark and Carlos
 
            Suppose that Mark Reynolds is batting against Carlos Marmol in the National League in 2010.   What is the chance that the outcome of that matchup will be a strikeout?
            I believe. . ..if my math is correct. . .that it is about 66%.   In the National League in 2010 the overall strikeout rate (strikeouts per plate appearances) was 19.5%.   Reynolds, however, struck out at more than twice the league strikeout rate (39.6%), while Marmol struck out 42% of the hitters that he faced.   When they match up, then, both are putting enormous upward pressure on the probability of a strikeout. 
            Same league. . .suppose that Luis Atilano is facing Jeff Keppinger.   Atilano struck out only 40 batters while facing 385 batters, barely over one-half the league strikeout rate. Keppinger struck out only 36 times in 575 plate appearances, less than one-third the league strikeout rate.    The probability of a strikeout resulting from a matchup of them (ignoring the platoon factor) is about 3%.  
            The math works in this way.    Suppose that a .600 team is facing a .400 team.    What is the likelihood that the .600 team will win?
            It’s 69%.   This can be figured by a method I introduced more than 30 years ago, called the Log5 method.    We begin by asking "What is the logarithmic equivalent of a .600 winning percentage?"   The logarithmic equivalent of a .600 winning percentage is that number which, if added to .500 and divided by the total, produces a .600 winning percentage. 
            That number, for .600, is .750; the Log5 of .600 is .750:
 
 
.750
 
 
 
----------------------------
=
.600
 
.750
.500
 
 
         
           
            The Log5 of a .500 team is .500.   The Log5 of a .700 winning percentage is 1.167:
 
 
1.167
 
 
 
----------------------------
=
.700
 
1.167
.500
 
 
         
 
 
            When a .700 team plays a .500 team, the .700 team will win 70% of the time.   The Log5 of a .400 team is .333:
 
 
.333
 
 
 
----------------------------
=
.400
 
.333
.500
 
 
         
 
 
            So when a .600 team meets a .400 team, the probability of the .600 team winning is 69.2%:
           
 
.750
 
 
 
----------------------------
=
.692
 
.750
.333
 
 
         
 
 
            This can also be figured by cross-multiplying the wins and losses. . .let us assume that Texas (30-20) is playing Pittsburgh (20-30). The probability that Texas will win can be figured as:
 
            (Tex W * Pitt L) / [(Tex W * Pitt L) + (Pitt W * Tex L)]
 
            This gets the same result:
 
            (30 * 30) / [(30 * 30) + (20 * 20)] = .692
 
            You can also generalize this method to figure, in essence, a "Log5" for something like "Mark Reynolds strikeout rate"; the process to do that was developed by Dallas Adams, and I don’t exactly understand how because I’m not that good at math.     Suppose that we put ".600" in cell A1 of a spreadsheet and ".400" in cell B1:
 
.600
.400
 
            In Cell C1 we put this:
 
            =(A1*(1-B1))/((A1*(1-B1))*(B1*(1-A1))
 
            You follow?    If you put that formula in cell C1, you get .692, which means that a .600 team beats a .400 team 69.2% of the time.
            But you can also put Mark Reynolds strikeout rate in cell A1 (.396) and the league strikeout rate in B1 (.195), and get an answer in C1, which is .730:
 
.396
.195
.730
 
            What this means, literally, is that a .396 team will beat a .195 team 73.0% of the time.   And we can do the same with Carlos Marmol’s strikeout rate, compared to the league:
 
.416
.195
.746
 
            Which means, literally, that a .416 team will beat a .195 team 74.6% of the time, but also means that comparing Carlos Marmol’s strikeout rate to the league norm, Carlos Marmol has a winning percentage of .746.
            Look, all of this sounds horribly obscure and theoretical, but it works like a dream in practice, and you can take my word for that or you can check it out, but you will save a hell of a lot of time if you take my word for it.
            When a good team plays a good team they are pushing against each other and thus pushing in the direction of .500, but when a good team plays a bad team they are pushing the percentages in the same direction, and thus pushing toward 1.000 (or toward .000).   When Carlos Marmol pitches against Mark Reynolds they are both pushing the strikeout probability upward toward 1.000, so that’s not good against good; that’s good against bad.   To reflect this, we have to convert Carlos Marmol’s .746 winning percentage to its complement, .254.   Then we put Mark Reynolds "strikeout win percentage" (.730) in cell A1, and the league’s "strikeout win percentage" against Carlos Marmol (.254) in cell B1, and we get the outcome in C1:
 
.730
.254
.888
 
            Or, if you prefer, you can put Carlos Marmol’s "strikeout win percentage" (.746) in cell A1 and the league’s "strikeout win percentage" against Mark Reynolds in cell B1 (.270), and you’ll get the same thing; it makes no difference which you consider the Alpha and which you consider the Beta.   Anyway, .888 is what we could call the "Mark Reynolds/Carlos Marmol compressed strikeout win percentage." 
            Mark Reynolds and Carlos Marmol have a very high compressed strikeout win percentage (.888), but in the league as a whole, 80.5% of plate appearances do NOT result in strikeouts.   The .888, then, is battling head to head with that .805. 
 
.888
.805
.658
 
            And the result is .658, which is the frequency with which Carlos Marmol will strike out Mark Reynolds—65.8% of the time.   Let’s display that as one continuous block:
 
 
 
A
B
C
1
.396
.195
.730
2
.416
.195
.254
3
.730
.254
.888
4
.888
.805
.658
 
 
            If Mark Atilano faced Jeff Keppinger, they would both be pushing the likelihood of a strikeout down.    Atilano’s "strikeout win percentage" is .323:
 
.104
.195
.323
 
            While Keppinger’s is .216:
 
.063
.195
.216
 
            When we put those together, we have to convert one of them into its complement so that they will both push in the same direction, rather than pushing against one another, so the combined "strikeout win percentage", with both Keppinger and Atliano pushing it downward, is .116:
 
.104
.195
.323
.063
.195
.216
.323
.784
.116
 
            Which will result in strikeouts in 3.1% of plate appearances:
 
 
A
B
C
1
.104
.195
.323
2
.063
.195
.216
3
.323
.784
.116
4
.116
.805
.031
 
 
 
 

COMMENTS (14 Comments, most recent shown first)

studes
In related news, the plus sign now works.
5:23 PM Oct 5th
 
studes
Just want to point out that there should be a plus sign in that formula to be copied over to Excel. It should read like this:

=(A1*(1-B1))/((A1*(1-B1))+(B1*(1-A1))
5:23 PM Oct 5th
 
Trailbzr
The concept here can be extended to more than two categories by eliminating the steps that don't help do so.

If we partition each hitter's and pitcher's PA/BF into the Gettysburg categories: K, W, H, B (that's Home Runs and Balls in Play) expressed as percentages, then in each category calculate:

K(bat) x K(pit) / K(league) = K'
and analogously for W, H, B

Generally, K', W', H', and B' won't sum to 1, so call their total T', then:

K=K'/T' and analogously for W, H, B

Log5 is the same calculation for two categories (Yes/No).​
8:59 PM Aug 27th
 
rezk42
Ah, gotcha. My bad.

8:04 PM Aug 27th
 
bjames
Sorry. 700 plus 600. . ..1167 plus 750. We've got to get that fixed.
11:00 AM Aug 27th
 
bjames
A .700 team has a .609 winning percentage playing a .600 team. Rezk42 skipped the stage of converting the won-lost record into a log5 before treating it as a logarithm. It's not 700/(700 600), but 1167/ (1167 750).

10:59 AM Aug 27th
 
rezk42
Bill, the two different ways you gave of estimating the expected winning percentage don't give the same answer.

If I have a 70-30 team against a 60-40 team, the first method gives me

(0.700) / (0.700 plus 0.600) = 0.538.

The second method gives me

(70 x 40) / (70 x 40 60 x 30) = 0.609.

The second method is the same as dividing the W/L ratio of team 1 by the sum of the two W/L ratios. The W/L ratios for the two teams are 70/30=2.333 and 60/40=1.500, so

(2.333) / (2.333 1.500) = 0.609.
10:16 AM Aug 27th
 
studes
Sorry. This is just a test of the plus sign.

46 17 - 7 = 56
7:06 AM Aug 27th
 
evanecurb
I had not heard of Mike Flanagan's passing. He was a favorite of mine. I swear he had a very good fastball despite the low strikeout totals, especially in his Cy Young season of '79.
12:17 AM Aug 27th
 
julesig
I remember reading about this in the Abstract. Isn't it essentially what is used to determine what outcomes go a Strat-O-Matic card? You can't just use the straight percentages there because 1/2 of the results come from the pitcher's card (or batter's), which is assumed to be average, so that would push the results towards the average. You can use this method to "inflate" the card percentages to the right levels to push against the average.
7:19 PM Aug 26th
 
hotstatrat
Hmmm. This logi5 can bi used in many different areas.
4:44 PM Aug 26th
 
Robinsong
It would be interesting to look at head to head on strikeout rates (e.g. Ryan or Marmot or Johnson or Clemens against high strikeout rates hitters (other than pitchers)). What I wonder is whether high-strikeout hitters will adjust, perhaps by cutting down their swings. In other words, Bill's approach works if hitter strikeout rates and pitcher strikeout rates are just independent tendencies. On wins, we assume that teams are trying equally hard against good and bad teams - though when Casey Stengel held back Ford to matchi him against th bestteam, that assumption broke down. On strikeouts, though, hitters may adjust strategy against tough pitchers. It might matter whether the high strikeout rate was due to uppercut swings or lack of control of the strike zone.
3:01 PM Aug 26th
 
Trailbzr
For some reason Plus signs don't come out.
Use letter K below:

-.422 K 1.079 = .657

-2.154 K -2.700 - -1.418 = -3.436

1/(1 exp(K 3.436)) = 0.31


11:49 AM Aug 26th
 
Trailbzr
I wrote a Reader Post last month advocating to think of calculations like those in terms of Logits. The Logit is the probability of Yes, divided by the probability of No, and then take the natural logarithm.

Reynolds/League/Marmol=
ln(.396/.604) = -.422
ln(.195/.805) = -1.418
ln(.416/.584) = -.339 (1.079 higher then League)

-.422 1.079 = .657

To convert a Logit back into a probability, the formula is:

1/(1 exp(-.657)) = .659.
Note the change in sign inside the exponent.


For Atilano/Keppinger, the relevant Logits are:
-2.154 -2.700 - -1.418 = -3.436
1/(1 exp( 3.436)) = 0.31

The Logit of a .500 league is zero.

boards.billjamesonline.com/showthread.php?2198-The-Logit-for-Probability-Calculations

11:44 AM Aug 26th
 
 
©2024 Be Jolly, Inc. All Rights Reserved.|Powered by Sports Info Solutions|Terms & Conditions|Privacy Policy