Why AL/NL WARs Differ in a Given Year (Hint: it’s more obvious than I thought)

36 Replies

Recently I made the “shocking” discovery that the AL and NL don’t have the same season WAR totals (on a per-team basis), even before interleague play. Of course I wondered why that is. After much verbal head-scratching on my part, Ed very kindly pointed out that the obvious answer I had been rejecting was, indeed, the answer:

WAR formulas are intentionally tweaked to reflect the relative strength of the leagues; otherwise, player WAR could not be used for any meaningful comparison across different leagues or seasons.

Here’s the explanation that appears on B-R:

[T]the leagues are not always equal in their quality levels as evidenced by things like inter-league play and also player performances when shifting leagues. Taking these differences into account assign slightly different multipliers to the leagues, but centered on 20 for 162 game seasons and 19 for 154 game seasons. One example of this is the post-war integration. The National League integrated far more quickly than the American League and was a higher quality league until the 1970’s.

I still don’t know just how the difference in leagues is assessed. But as an exercise, I tried to see if league strength could be seen in the players who moved between the leagues during the period 1946-68 — the post-war, pre-expansion era, which is where I first noticed the large difference in league WAR. As it turned out, almost all of those who spent roughly 2 full seasons in each league looked better in the AL, relative to that league.

There were only 12 players who had 1,000+ PAs in both the AL and the NL during 1946-68. This list shows their OPS+ in each league for those years only:

Chico Fernandez: AL 72, NL 61, +9 points of OPS+ in the AL
Frank Bolling: AL 91, NL 79, +12
Eddie Bressoud: AL 109, NL 81, +28
Gino Cimoli: AL 84, NL 85, -1
Tito Francona: AL 110, NL 92, +18
Bill Bruton: AL 97, NL 95, +2
Jackie Brandt: AL 103, NL 97, +6
Harvey Kuenn: AL 112, NL 98, +14
Roy Sievers: AL 127, NL 107, +20
Dick Stuart: AL 119, NL 116, +3
Frank Howard: AL 148, NL 125, +23
Frank Robinson: AL 182, NL 150, +32

Average: +14 points of OPS+ in the AL.

And the ERA+ for the 23 pitchers with at least 429 innings* in each league during 1946-68:

Jim Bunning: AL 116, NL 129, -13 points of ERA+ in the AL
John Buzhardt: AL 100, NL 94, +6
Gene Conley: AL 90, NL 107, -17
Moe Drabowsky: AL 105, NL 94, +11
Jack Fisher: AL 99, NL 84, +15
Ron Kline: AL 118, NL 97, +21
Mike McCormick: AL 94, NL 100, -6
Cal McLish: AL 107, NL 91, +16
Don McMahon: AL 141, NL 111, +30
Stu Miller: AL 145, NL 107, +38
Billy O’Dell: AL 128, NL 102, +26
Claude Osteen: AL 111, NL 101, +10
Milt Pappas: AL 113, NL 98, +15
Juan Pizarro: AL 115, NL 88, +27
Robin Roberts: AL 115, NL 113, +2
Johnny Sain: AL 103, NL 109, -6
Johnny Schmitz: AL 112, NL 106, +6
Bob Shaw: AL 103, NL 107, -4
Gerry Staley: AL 140, NL 100, +40
Hoyt Wilhelm: AL 162, NL 130, +32
Stan Williams: AL 111, NL 106, +5
Jim Wilson: AL 98, NL 86, +12
Al Worthington: AL 141, NL 102, +39

Average: +13 points of ERA+ in the AL.

These are small samples, but the consistency and the size of the difference strongly suggests that the NL had a significantly higher level of competition in this period. Eleven of the 12 hitters and 18 of 23 pitchers had higher “+” ratings in the AL.

This doesn’t explain why the AL won 13 of those 23 World Series, but we all know that a 7-game series isn’t as telling as a full season.

___________________

* Why 429 IP? I only wanted to use one page of P-I results, and 429 IP was the total for the 200th guy in the NL in that span.

36 Comments

Oldest

Newest Most Voted

Inline Feedbacks

View all comments

Mike L

12 years ago

“This doesn’t explain why the AL won 13 of those 23 World Series” The Yankees won ten out of fifteen times. The rest of the AL was 3-5.

e pluribus munu

12 years ago

Reply to Mike L

Which suggests that for much of that period, the distribution of talent was probably less balanced in the AL – though not as different from the NL as the simple tally of Yankee pennants might indicate.

MikeD

12 years ago

Reply to Mike L

Exactly. The NL was the stronger league, but that doesn’t mean the strongest team wasn’t in the AL.

Author

John Autin

12 years ago

Reply to Mike L

You’re right, Mike L. I should not have mentioned the WS at all.

Bells

12 years ago

Is OPS+, then, not adjusted for league? Baseball-reference seems to suggest it’s not… copy/pasted explanation of the measure:

Statistic Description: OPS+ 100*[OBP/lg OBP + SLG/lg SLG – 1] Adjusted to the player’s ballpark(s)

Just checking, though, as I’m not really familiar with how these things are calculated. Why would OPS+ be only league-relevant, but WAR be adjusted for minor differences in league strength? Perhaps it’s simple, but my brain isn’t working so well today.

bstar

12 years ago

Reply to Bells

Yes, I think OPS+ is adjusted for league. At least that’s what Fangraphs says in their explanation of it:

http://www.fangraphs.com/library/index.php/offense/ops/

“…Since OPS+ adjusts for league and park effects, it’s possible to use OPS+ to compare players from different years and on different teams…”

Author

John Autin

12 years ago

Reply to bstar

b, I interpret that as saying OPS+ is adjusted for the offensive context of the league as compared to historic norms, not as compared to the other league in a given year.

If OPS+ were adjusted for the strength of one league relative to its opposite number, then what I noted in the post about players with significant time in both leagues would be meaningless — it would not support the theory that the NL was stronger in that period.

bstar

12 years ago

Reply to John Autin

I think you’re right about the interpretation. sorry bells.

Bells

12 years ago

Reply to bstar

No, thanks bstar, I had the same interpretation as you and was like ‘wait, wouldn’t that make John A’s comparison tautological’? But I figured I must be wrong (John is usually pretty rigorous), and am glad that I am, because otherwise the mystery would still be there.

Forrest

12 years ago

Interesting. So now that we’re about to have seasons where there’s an interleague game EVERY DAY, will WAR stop being tweaked to reflect the league the player’s in?

Author

John Autin

12 years ago

Reply to Forrest

Forrest, I think I’m missing your point. Why would a change in number(?) and timing of future interleague games alter the rationale for weighting WAR according to league strength?

e pluribus munu

12 years ago

Reply to John Autin

Wouldn’t any change in the number of games alter the formula for weighting? After all, if interleague games were 50% of all games, the context discrepancy would presumably have shrunk to zero.

Author

John Autin

12 years ago

Reply to e pluribus munu

True statement, e, but Forrest asked if they would/should stop tweaking the formula entirely. I said the intended changes wouldn’t alter the rationale for weighting WAR — not that they shouldn’t alter the specific weights used.

e pluribus munu

12 years ago

Reply to John Autin

Right, JA, and another instance of my natural talent for making statements that are both trivially true and irrelevant. In this case I thought “every day” might have gotten confused with the concept of 50/50 balance. I craftily avoided clarity in my response.

kds

12 years ago

Reply to Forrest

Won’t make that much difference. From 252 inter-league games to 300 out of 2430 for the full schedule. Let’s compare Mays in the NL to Mantle in the AL. I’ve chosen 1955-1962 to get Mantle’s best years, Mays’ best 8 consecutive years were 1958-1965, so this is a little unfair to him. Mays had a few more PA, being healthier, and is just ahead in brWAR 69.2 to 68.1. Mantle was a considerably better hitter, 512 to 392 in Rbat and Mays’ advantage in defense leaves him still 42 behind by 42 RAA, 553 to 511. This leaves Mays behind… Read more »

no statistician but

12 years ago

JA: As usual, I have to quibble a little. The pattern of several of the position players here—Bolling, Francona, Kuenn, Sievers—shows a lower OPS+ in the NL at least in part because they had entered into their declining years as performers when they switched leagues. Stuart’s two big years in the AL don’t match his two best in the NL, and it’s only his clunker 1962 season that draws down his NL numbers. Jackie Brandt spent his prime years in the AL and his early and late years in the NL. The two big dogs at the end of the… Read more »

Author

John Autin

12 years ago

Reply to no statistician but

Solid quibbles, nsb, taken in the spirit of mutual pursuit of understanding.

no statistician but

12 years ago

Reply to John Autin

Yes—except that Conley actually spent his last years in the AL. Brain cramp.

Doug

12 years ago

I think comment #11 from kds explains the league differences more succinctly (and with greater precision) than what might be inferred from a grab-bag of test case players with selection difficulties such as those identified by nsb in comment #10.

Would be interesting to see the league-wide numbers kds mentions on a year-by-year basis, to see if they correspond to the perception that the AL has had a superior level of play in recent seasons.

Author

John Autin

12 years ago

Reply to Doug

Doug, I grant you that 12 position players hardly make up a valid test of AL/NL strength over a 23-year period. But I’m not sure how to do that test any better. Using a threshold lower than 1,000 PAs would bring in more players, but each would be more subject not only to randomness, but to the specific problem that players of that era were far more likely to change leagues after a poor year or two. The difficulties of such a study leave me wondering again at how B-R (S.F.) has arrived at the estimate of league strength. The… Read more »

Editor

Doug

12 years ago

Reply to John Autin

John, This was the kds quote that intrigued me. The difference between replacement level and average was 22 runs/162 games in the NL, but only 18 runs/162 games in the AL. The average AL player was about 4 runs per season worse than the average NL player. It sounds like kds has looked at WAR minus WAA on a league-wide (all players) basis, and then converted to RBAT. Subtracting one from the the other may not be entirely legitimate statistically, but I liked the approach conceptually, based on the notion that replacement level should be about the same for both… Read more »

Bells

12 years ago

Reply to John Autin

That’s still what bothers me too, John. The 1956 differences you cite are large, especially considering that a) the leagues were closed circuits save for a 4-7 game playoff series, and b) the number of samples of players switching, although telling a pretty clear story from your original post’s analysis, is really still small. A simple ‘sign test’ of binomial probability would certainly show it unlikely to randomly get 11 out of 12 position players to be better in one league, and without doing the math, 18 out of 23 pitchers being better in the AL is unlikely too, possibly… Read more »

Author

John Autin

12 years ago

Reply to Doug

“Would be interesting to see the league-wide numbers kds mentions on a year-by-year basis, to see if they correspond to the perception that the AL has had a superior level of play in recent seasons.”

It would indeed, but I don’t know how to convert the WAR numbers to the replacement-level adjustments kds cites.

On the WAR level, the AL has been rated 20%-21% stronger each of the last 5 years, both players and pitchers.

12 years ago

Reply to John Autin

John – From what I can tell, Fangraphs does NOT make this adjustment in their version of WAR. I’m looking at the line in the comparison chart that says “Varies Replacement Level by Quality of Competition”. Would be interesting to know why they chose not to adjust.

http://www.baseball-reference.com/about/war_explained_comparison.shtml

Author

John Autin

12 years ago

Reply to Ed

Ed, that’s a good find. Still, I’m not sure that the phrase means what you’re saying. It could refer to the varying quality of competition faced within one’s own league.

For example, a hitter on the ’93 Braves did not bat against his own historically great pitching staff, but did get 13 games against the historically high-yielding expansion Rockies, who allowed almost 6 R/G.

BTW, Atlanta swept that series, 13-0, averaging 8.2 R/G. David Justice hit .392/1.269 with 6 HRs; Jeff Blauser hit .426/1.269 with 4 HRs and 15 RBI (15 and 73 for the year), etc.

12 years ago

Reply to John Autin

Perhaps John though as far as I’m aware the place in which bWAR “Varies Replacement Level” is between the leagues. I’ve never seen anything that would suggest that players within the same league are assigned different replacement levels based on not facing their own teammates. They may make adjustments for that but making adjustments is different than “varying replacement level”.

Author

John Autin

12 years ago

Reply to John Autin

Ed, you’re prob’ly right. I’m starting to get out of my depth in the replacement-level discussion.

12 years ago

So nice to be mentioned in a HHS post! 🙂 As for the World Series, I have a few different thoughts. 1) One thing to look at would be total the batter and pitching WAR for each team in the WS and see how often the team with the higher WAR won. As commenters 1, 2 and 15 have noted, just because league A is stronger then league B, doesn’t mean that the top team from league A is stronger than the top team from league B. 2) Another thought is this…one thing we know about the playoffs is that… Read more »

Editor

Doug

12 years ago

Reply to Ed

The fact that the Yankees so utterly dominated the AL during this period would also be support the premise of a lower quality of play in the AL. Easier to come out on top consistently playing against lesser opponents.

no statistician but

12 years ago

Reply to Doug

Doug: The Dodgers and Giants together did a fair job of dominating the NL in that era, with 13 titles(Dodgers with 10) to the Yankees’ 15, so I’m not sure your argument is that strong. As for the rather startling 20% difference between leagues in 1956 cited by JA @ #18, I suspect it’s a crock, but I have no way to substantiate my suspicion. The NL had a three team race that went down to the wire, and the Yankees pulled away from the pack in early July, but in terms of competition within the leagues, the rest of… Read more »

Author

John Autin

12 years ago

Reply to no statistician but

nsb — “not a lot” is a reasonable count of the number of African-American & Latino stars in 1956. “A lot more than the AL” is another way to describe it. A Mays here, an Aaron there, a Banks here, a Frank Robby there — pretty soon you’re talking about real talent. Here’s my completely amateur count, by team, of 1956 players on each team who would not have been allowed to play in 1946. Players before the slash are regulars and SPs; after the slash are reserves. (?) marks players about whom I’m not certain they’d have kept out… Read more »

Richard Chester

12 years ago

Reply to John Autin

Why is Chico Carrasquel on that list? His uncle Alex already had 8 years in the ML prior to his (Chico’s) arrival.

Author

John Autin

12 years ago

Reply to John Autin

Richard, I’m sure I whiffed on a few. I was just looking at the picture and birthplace. You have to admit there weren’t many players from Venezuela before integration — in fact, Alex Carrasquel was the first, and the only Venezuelan with more than 5 games before Chico.

I simply didn’t know about Alex.

kds

12 years ago

If you go to a players “value” section where the WAR computations are shown and highlight a part of the players career, you will get not only the totals of each column for that period, but also averages per year and per 162 games. So you can just look at Rrep per 162 games played. That is what I did for Mays/Mantle. For pitchers it is a bit more difficult since there isn’t a Rrep column. But you do have RAA and RAR, so the difference should be Rrep. (Have to be a little careful since different pitchers can have… Read more »

kds

12 years ago

Now that I’ve done this work, I reread the explanations at B-ref of how they figure WAR. In the section on position players, down near the bottom, there is a table showing the # of replacement runs for a full time player for each league in each year, 1871-2012. Interestingly, the NL has a substantial lead in 1946, the year before the start of integration. The advantage for the NL continues to 1969.

Richard Chester

12 years ago

Reply to kds

See my post #72 under the “Talk About Athletics and Tigers ALDS Game 1” blog.