As quickly as one story in baseball ends, one other begins. And so, with the 2025 season dissipating into silence because the champions hoist the World Collection trophy, its remnants seed the following section of the game’s existence out from the quantum foam. The 4 months between now and Opening Day really feel like an interminable hole, however now we have the Scorching Range League to maintain the MLB baseballmatic universe rolling. Which means, as has been the case for almost 1 / 4 of a century now, it’s time for me to start out rolling out the ZiPS projections for subsequent season.
For these new to my projections, ZiPS is a pc projection system I initially developed in 2002–04. It formally went reside for the general public in 2005, after it had reached a stage of non-craptitude I used to be content material with. The origins of ZiPS are much like Tom Tango’s Marcel the Monkey, popping out of discussions I had within the late Nineteen Nineties with Chris Dial, one among my finest mates (our first interplay concerned Chris calling me an expletive!) and a fellow stat nerd. ZiPS rapidly developed from its authentic iteration as a fairly easy projection system, and it now each does much more and makes use of much more knowledge than I ever envisioned it could 20 years in the past. At its core, nonetheless, it’s nonetheless doing two main duties: estimating what the baseline expectation for a participant is in the mean time I hit the button, after which estimating the place that participant could also be going utilizing massive cohorts of comparatively related gamers.
So why is ZiPS named ZiPS? On the time, Voros McCracken’s theories on the interplay of pitching, protection, and balls in play have been pretty new, and since I wished to combine a few of his findings, I made a decision (together with his blessing) that the identify of my system would rhyme with DIPS (defense-independent pitching statistics). I didn’t like SIPS, so I went with the following letter in my final identify. I initially named my work ZiPs as a nod to CHiPs, one among my favourite reveals to look at as a child, however I mis-typed ZiPs as ZiPS after I launched the projections publicly, and since my now-colleague Jay Jaffe had already reported on ZiPS for his Futility Infielder weblog, I selected to simply go together with it. I by no means anticipated that every one of this could be helpful to anybody however me; if I had, I might certainly have named it in much less weird style.
ZiPS makes use of multiyear statistics, with more moderen seasons weighted extra closely; at first, all of the statistics acquired the identical yearly weighting, however finally, this turned extra various primarily based on further analysis. And analysis is a giant a part of ZiPS. Yearly, I run tons of of research on numerous points of the system to find out their predictive worth and higher calibrate the participant baselines. What began with the info obtainable in 2002 has expanded significantly. Primary hit, velocity, and pitch knowledge started taking part in a bigger function beginning in 2013, whereas knowledge derived from Statcast has been included lately as I’ve gotten a deal with on its predictive worth and the influence of these numbers on present fashions. I imagine in cautious, conservative design, so knowledge are solely included as soon as I’ve confidence of their improved accuracy, which means there are at all times builds of ZiPS which are nonetheless a few years away. Extra inside ZiPS instruments like zBABIP, zHR, zBB, and zSO are used to raised set up baseline expectations for gamers. These stats work equally to the varied flavors of “x” stats, with the “z” standing for one thing I’d wager you’ve already guessed.
How does ZiPS challenge future manufacturing? First, utilizing each current taking part in knowledge with changes for zStats, and different elements corresponding to park, league, and high quality of competitors, ZiPS establishes a baseline estimate for each participant being projected. To get an concept of the place the participant goes, the system compares that baseline to the baselines of all different gamers in its database, additionally calculated from the most effective knowledge obtainable for the participant within the context of their time. The present ZiPS database consists of about 152,000 baselines for pitchers and about 185,000 for hitters. For hitters, exterior of realizing the place performed, that is offense solely; how good a participant is defensively doesn’t yield data on how a participant will age on the plate.
Utilizing an entire lot of stats, plus data on the form of a participant’s manufacturing and different traits, ZiPS then finds a big cohort that’s most much like the participant. I exploit Mahalanobis distance extensively for this. A couple of years in the past, Brandon G. Nguyen did an exquisite job broadly demonstrating how I do that whereas he was a pc science/math pupil at Texas A&M, although the variables used aren’t an identical.
For example, listed here are the highest 50 near-age comparables for potential American League MVP Cal Raleigh proper now, a very tough participant to comp. The entire cohort is way bigger than this, however 50 should be sufficient to present you an concept:
Prime 50 ZiPS Offensive Comps for Cal Raleigh
With comp candidates within the tons of of 1000’s, not the billions, you’re by no means blessed with good comps. ZiPS would like to discover a multitude of switch-hitting catchers of their late 20s with critical Three True Final result recreation and butt-themed nicknames (OK, the final bit isn’t within the database), nevertheless it gained’t, so it tries to assemble a gaggle that’s at the least Cal Raleigh-ish. From testing, I do know that ZiPS works considerably higher over the lengthy haul when the Cal Raleighs are in comparison with Cal Raleighs, not Francisco Lindors or Juan Pierres or Matt Raleighs. The precise combine algorithm to assemble comps was decided by in depth testing, mainly by having a pc working ZiPS 24-7 for a few 12 months. The massive group of comparable gamers is then used to calculate an ensemble mannequin on the fly for a participant’s future profession prospects, each good and dangerous.
One of many tenets of projections that I observe is that it doesn’t matter what the ZiPS projection says, that’s what the projection is. Even when inserting my opinion would enhance a particular projection, I’m philosophically against doing so. ZiPS is most helpful when individuals know that it’s purely data-based, not some unknown combine of information and my opinion. Over time, I wish to suppose I’ve taken a intelligent strategy to turning extra issues into knowledge — for instance, ZiPS’ use of fundamental damage data — however some stuff simply isn’t within the mannequin. ZiPS doesn’t know if a pitcher wasn’t allowed to throw his slider getting back from damage, or if a left fielder suffered a household tragedy in July. These types of issues are exterior a projection system’s purview, regardless that they’ll have an effect on on-field efficiency. ZiPS isn’t mathemagical, and anybody utilizing a great tool ought to know its limitations and apply their very own judgment to the query at hand.
It’s additionally essential to keep in mind that the bottom-line projection is, in layman’s phrases, solely a midpoint. You don’t count on each participant to hit that midpoint; 10% of gamers are “supposed” to fail to satisfy their Tenth-percentile projection and 10% of gamers are equally “supposed” to move their Ninetieth-percentile forecast. This level can create a shocking quantity of confusion. ZiPS gave a .300 batting common projection to only one participant in 2025: Luis Arraez. However that’s not the identical factor as ZiPS considering there would solely be a single .300 hitter. On common, ZiPS thought there could be 15 hitters with at the least 100 plate appearances to eclipse .300, not one. Ultimately, there have been 13.
One other essential factor to keep in mind is that the essential ZiPS projections aren’t playing-time predictors; by design, ZiPS has no concept who will really play within the majors in 2026. Contemplating this, ZiPS solely makes its projections for the way gamers would carry out in full-time main league roles. Having ZiPS inform me how somebody would hit as a full-time participant within the large leagues is a much more attention-grabbing use of a projection system than if it have been to inform me how that very same individual would carry out as a part-time participant or a minor leaguer. For the depth charts that go reside in each article, I exploit the FanGraphs Depth Charts to find out the taking part in time for particular person gamers. Since we’re speaking about crew development, I can’t go away ZiPS to its personal units for an utility like this. It’s the identical purpose I exploit modified depth charts for crew projections in-season. There’s a probabilistic aspect within the ZiPS depth charts: Generally Joe Schmo will play a full season, generally he’ll miss taking part in time and Buck Schmuck should step in. However the fundamental idea could be very easy.
There are not any main updates this 12 months on the extent of 2025, after I formally began utilizing spring coaching knowledge (with much less weight than common season knowledge) for a ultimate ZiPS run proper earlier than the season. There are, nonetheless, the same old calibration changes and quality-of-life updates that make ZiPS less complicated to run, whereas offering extra methods to have a look at projection knowledge. I’ll be doing a little KBO/NPB items after the 30 crew articles, so there’s somewhat bonus in there. And there are at the least just a few new issues within the mannequin, corresponding to a mannequin for what number of video games a catcher is more likely to play at 1B/DH so as to get a extra correct WAR projection than simply assuming all their video games are performed as a catcher.
Have any questions, recommendations, or issues about ZiPS? I’ll attempt to reply to as many as I moderately can within the feedback beneath. Additionally, if the projections have been invaluable to you now or up to now, I might urge you to contemplate changing into a FanGraphs Member, ought to you could have the power to take action. It’s along with your continued and far appreciated assist that I’ve been in a position to preserve a lot of this work obtainable to the general public for therefore a few years. Bettering and sustaining ZiPS is a time-intensive endeavor, and reader assist permits me the flexibleness to place an obscene variety of hours into its growth. It’s onerous to imagine I’ve been growing ZiPS for almost half my life now! Hopefully, the projections and the issues we’ve realized about baseball have offered you with a return in your funding, or at the least a small measure of leisure, whether or not it’s from being delighted or enraged.


















