Tuesday, October 27, 2009
How well did the Community forecast playing time?
Here’s a quick study:
For those guys expected to get more than 580 PA:
n = 60 players
604 = expected PA
582 = actual PA
Between 481 and 580 PA:
n = 123 players
540 = expected PA
510 = actual PA
Between 381 and 480 PA:
n = 74 players
436 = expected PA
406 = actual PA
Between 281 and 380 PA:
n = 52 players
327 = expected PA
293 = actual PA
Between 181 and 280 PA:
n = 71 players
235 = expected PA
202 = actual PA
Between 81 and 180 PA:
n = 66 players
128 = expected PA
117 = actual PA
Between 1 and 80 PA:
n = 134 players
39 = expected PA
51 = actual PA
0 PA:
n = 104 players
0 = expected PA
90 = actual PA
As you can see, a consistent bias for any player with at least 180 PA, and for any player below 80 PA, the bias goes the other way (but not as bad).
If we make three groups of players:
at least 180 PA expected: expected an average of 444, actually got 414… 30 PA too much
between 1 and 179 PA expected: average expected of 68, actually 72… a match pretty much
0 PA expected: actually got 90 PA
***
Guys who actually got zero PA, but were expected to play in 2009 (by the Community):
player_id ACT_PA EXP_PA name_tx
458628 0 324 Mather, Joe
459943 0 319 Clement, Jeff
425774 0 169 McPherson, Dallas
132961 0 168 Jenkins, Geoff
430958 0 158 Rabelo, Mike
445615 0 109 Ramirez, Max
457850 0 107 Casto, Kory
440361 0 106 Hopper, Norris
453327 0 103 Antonelli, Matt
460131 0 101 Bogusevic, Brian
446382 0 101 Barton, Brian
(bunch under 100)
As you can see, Joe Mather and Jeff Clement are the two biggest surprises as to who did not play in MLB.
Guys who were not given any playing time forecast, but actually played in MLB:
player_id ACT_PA EXP_PA name_tx
325392 587 0 Podsednik, Scott
150456 587 0 Kennedy, Adam
458085 565 0 Coghlan, Chris
467827 491 0 Parra, Gerardo
434540 358 0 Jones, Garrett
440251 351 0 Roberts, Ryan
425556 337 0 Nix, Laynce
116662 331 0 Jones, Andruw
435408 306 0 Santos, Omir
132788 283 0 Millar, Kevin
425543 211 0 Wilson, Josh
(bunch under 200)
Almost none of these guys were listed on any one team’s 40-man roster by the time the survey ran. This just tells me that I need to run a bit later in Spring Training (I ran it two weeks before the season started).
The biggest surprises. These 23 guys got at least 200 PA more than expected:
player_id ACT_PA EXP_PA diff name_tx
448242 451 61 390 Gwynn, Tony
493596 430 42 388 Beckham, Gordon
472528 398 44 354 Valbuena, Luis
460099 411 62 349 Reimold, Nolan
346874 432 101 331 Uribe, Juan
430574 397 94 303 Maier, Mitch
450314 599 299 300 Zobrist, Ben
434636 376 78 298 Pagan, Angel
460579 533 261 272 Morgan, Nyjer
466320 540 268 272 Cabrera, Melky
451594 518 264 254 Fowler, Dexter
430948 634 386 248 Callaspo, Alberto
465784 438 195 243 Cabrera, Everth
457705 493 254 239 McCutchen, Andrew
407781 599 362 237 Byrd, Marlon
445988 503 270 233 Prado, Martin
425661 266 35 231 Paulino, Ronny
434704 388 159 229 Young, Delwyn
456422 678 451 227 Bourn, Michael
407797 309 88 221 Green, Nick
461416 324 108 216 Venable, Will
466988 509 296 213 Bonifacio, Emilio
452655 676 474 202 Span, Denard
And these 27 guys got at least 250 PA fewer than expected:
player_id ACT_PA EXP_PA diff name_tx
433582 110 571 -461 Jackson, Conor
449107 127 588 -461 Aviles, Mike
408314 166 625 -459 Reyes, Jose
476704 76 526 -450 Lowrie, Jed
113232 112 551 -439 Delgado, Carlos
294558 29 457 -428 Nady, Xavier
460086 189 606 -417 Gordon, Alex
425867 193 585 -392 Greene, Khalil
430001 162 546 -384 Weeks, Rickie
435623 54 417 -363 Frandsen, Kevin
424825 215 577 -362 Crisp, Coco
457727 199 557 -358 Maybin, Cameron
136267 32 389 -357 Glaus, Troy
435520 106 436 -330 Flores, Jesus
493127 260 588 -328 Iwamura, Akinori
458628 0 324 -324 Mather, Joe
459943 0 319 -319 Clement, Jeff
136767 31 344 -313 Chavez, Eric
459941 115 408 -293 Buck, Travis
451186 265 550 -285 Milledge, Lastings
459991 23 303 -280 Sanchez, Gaby
458210 256 531 -275 Casilla, Alexi
114789 254 528 -274 Giles, Brian
430965 202 473 -271 Snyder, Chris
435222 268 536 -268 Fields, Josh
114260 17 273 -256 Floyd, Cliff
346795 182 438 -256 Chavez, Endy


Would you consider that normal that the expected plate appearances were higher than the actual plate appearances until you got lower down on the chart? What would be some of the likely causes of this...? catastrophic injuries? bad estimates? players getting playing time that weren’t listed?
What is the standard variation on plate appearances (actual vs expected) for the complete sample? Or standard variation by each of your sub group if that’s more telling.
vr, Xei