MLB Model Analysis

Grammer BaseRuns + PythagenPat + xFIP · Calibration Report

2024 Season 558 bets
2025 Season 443 bets
Combined 1,001 bets
Overall Win Rate
50.8%
509 W / 492 L
Total P&L
+$10,405
+10.39% ROI
Avg Market Odds
2.32
Decimal format
Model vs Actual Gap
–5.9%
Model overpredicts
Underdog Weakness
–13.3%
Odds 2.7+ range
Model Probability vs. Actual Win Rate
Each circle represents a bucket of bets grouped by the model's predicted win probability. The closer the circles are to the green diagonal line, the better calibrated the model is.
Model pred.
Market pred.
Perfect
Regression
Performance by Odds Range
Mkt Odds Bets Model Actual Gap
1.4–1.7 24 78.3% 70.8% –7.5%
1.7–2.0 332 65.9% 61.7% –4.2%
2.0–2.3 232 57.1% 52.2% –4.9%
2.3–2.7 227 50.7% 48.0% –2.7%
2.7+ 200 41.8% 28.5% –13.3%
Gap = Actual – Model Predicted. Negative = model overestimates.
Signal Type Comparison
VALUE
50.1%
n = 667
Avg Edge: +20.5%
STRONG
50.3%
n = 348
Avg Edge: +26.3%
STRONG signals show a larger edge over the market but win at nearly the same rate as VALUE bets — the extra edge doesn't translate to extra wins.
📊 Key Findings
📈 Linear Regression Summary
A regression was fit between the model's predicted win probability (x) and whether the bet actually won (y = 1) or lost (y = 0).
Actual Win% ≈ 0.034 + 0.833 × Model Prob
A perfect model would give slope = 1.0 and intercept = 0. A slope of 0.83 means the model is slightly overconfident — when it says 70% chance, the real odds are closer to 62%. The R² of 0.04 is expected for game-level data (single games are noisy), but the bucket-level calibration is more meaningful.
Slope
0.83
Intercept
0.034
0.043
P-value
<0.001
⚠️
Primary Weakness: Big Underdogs (Odds 2.7+)
200 bets were placed on teams with market odds above 2.7. The model predicted a 41.8% win rate — but the actual result was only 28.5%. This means the model is severely overvaluing underdogs in this range. Consider filtering out or reducing bet size on any picks with market odds above 2.7, or recalibrating the xFIP/BaseRuns weighting for those matchups.