Tuesday, January 26, 2010

Does Robinson Cano's Approach Change With Men on Base?

I figured I'd re-post this at this site in case.

Despite a career MLB line of .306/.339/.480, and despite having some very good overall years, Robinson Cano can really frustrate Yankee fans. In addition to what seems likes maddening inconsistency in general, Cano has hit worse with runners on base in almost every season of his career so far, with 2007 being the lone exception.

Here's how Cano's splits in this category have looked so far in his career.

Year Split G PA AB H 2B 3B HR RBI SB CS BB SO BA OBP SLG BAbip wOBA
2005 --- 129 300 293 95 24 3 7 7 0 0 6 39 .324 .340 .498 .356 .375
2005 Men On 118 251 229 60 10 1 7 55 1 3 10 29 .262 .295 .406 .270 .306
2006 --- 113 241 231 86 25 0 8 8 0 0 9 23 .372 .398 .584 .390 .424
2006 Men On 108 267 251 79 16 1 7 70 5 2 9 31 .315 .335 .470 .330 .350
2007 --- 154 331 309 92 20 3 11 11 0 0 19 42 .298 .344 .489 .316 .365
2007 Men On 144 338 308 97 21 4 8 86 4 5 20 43 .315 .362 .487 .341 .371
2008 --- 151 347 327 95 20 1 12 12 0 0 16 32 .291 .331 .468 .293 .351
2008 Men On 142 287 270 67 15 2 2 60 2 4 10 33 .248 .273 .341 .271 .277
2009 --- 150 361 343 129 32 0 16 16 0 0 17 33 .376 .407 .609 .384 .446
2009 Men On 142 313 294 75 16 2 9 69 0 0 13 30 .255 .288 .415 .255 .307
All --- 697 1580 1503 497 121 7 54 54 0 0 67 169 .331 .363 .528 .346 .391
All Men On 654 1456 1352 378 78 10 33 340 12 14 62 166 .280 .312 .425 .294 .324


Babip: Batting average on balls in play


wOBA: Weighted on-base average



FWIW, the AL has typically hit better with men on base than without, at least looking over the last few seasons. The difference isn't huge, but it's generally been in the 5% range.



The difference in wOBA between Cano with the bases empty (---) vs. Cano with men on base is .068. Even if you were to completely ignore the fact that weights of wOBA are different with runners on base vs. the bases empty (in other words, positive offensive events are worth more when runners on base and less when the bases are empty), the difference between those two wOBAs over around 1500 PAs is close to 90 runs, roughly 40 runs over a 650 PA season.



When looking at Cano's actual value to the team, this is a real and persistent problem that has made him less valuable then what a context-neutral metric would have said. The question I want to look at here is if there is some reason to think there is more going on here than the vagaries of batted balls and sample size when breaking down a player's performance into subsets that fit into neat little buckets.



The sample size thing is important here. While it may feel like 1500 PAs in both splits is significant, it's still not quite enough to start thinking we're seeing definitive proof. We generally need at least 2000 PAs in a split, but even then we have to regress them somewhat, depending on the split and the player and factoring in the fact that by the time a player has accrued those 2000 PAs, he may be a different player than he was when he accrued the first n of them. So keep that in mind when looking at the numbers that follow.



Generally, we think of luck in the batter's box in terms of BABIP (batting average on balls in play). While it's more nuanced than that, we can see that Cano has a BABIP of .346 with the bases empty and a BABIP of .294 with men on base. However, if we use Pitch F/X data to try and break down his performance by batted ball type, it would look like this.




Flyout Groundout Lineout Popout
--- 409 755 115 134
Men On 347 632 110 92


Pitch F/X data is only from 2007 on, and isn't complete for those years, but it's pretty close. Unfortunately, Pitch F/X does not break out hits by batted ball type. However, we can try to extrapolate the total batted balls by dividing the outs for each type by the average percentage of outs when each type is hit, which are:

Fly balls are outs 79% of the time.
Ground balls are outs 72% of the time.
Line drives are outs 26% of the time.
Pop ups are outs 99% of the time unless Luis Castillo is under it.

Using those figures, we'd get a revised batted ball distribution like this.

Split FB % GB % LD % PU %
--- 518 24% 1049 49% 442 21% 135 6%
Men On 439 24% 878 48% 423 23% 93 5%


Extrapolating batted ball types in this way introduces some uncertainty into this, although I suppose you could say that there's also uncertainty in the classifications of batted ball types on the margins. Anyway, keep in mind the fact that although this data is presented empirically, there's some fuzziness in here.

You can probably already tell this by looking at the numbers, but that type of batted ball distribution is pretty similar in both cases, and does not support a BABIP difference of .052.

Delving a little further into Pitch F/X, we can look at what Cano does in the batter's box depending on whether there are men on base or not to see if his actual approach is changing.

Update: Charts below have been updated to include missing columns.

Split Pitch # max min avg ball % stkS% foul% stkC% In play, out(s)% In play, no out % HBP %
--- All Fastballs 1710 99.8 79.2 91.3 36.1% 3.6% 21.1% 17.3% 13.5% 7.5% 0.1%
Men On All Fastballs 1382 99.4 78.7 91.6 34.2% 3.8% 24.2% 16.4% 14.3% 3.5% 0.3%
--- Change-up 334 91.2 72.7 82.5 30.2% 7.2% 17.1% 10.5% 22.2% 10.5% 0.3%
Men On Change-up 237 90.4 74.9 82.8 40.5% 6.8% 13.1% 8.0% 26.2% 4.2% 0.0%
--- Curveball 294 85.7 60.5 77.3 29.6% 8.8% 18.4% 15.0% 18.7% 8.2% 0.0%
Men On Curveball 241 88.6 66.8 77.0 31.5% 14.1% 18.7% 13.7% 17.8% 2.1% 0.4%
--- Cut fastball 64 93.3 74.2 86.5 28.1% 7.8% 25.0% 17.2% 18.8% 3.1% 0.0%
Men On Cut fastball 61 92.5 78.0 87.2 23.0% 6.6% 16.4% 16.4% 32.8% 3.3% 0.0%
--- Knuckleball 15 74.1 60.2 67.9 33.3% 0.0% 13.3% 20.0% 13.3% 13.3% 0.0%
Men On Knuckleball 24 73.1 58.6 66.7 41.7% 4.2% 0.0% 25.0% 20.8% 4.2% 0.0%
--- Sinker 36 97.8 81.8 90.0 38.9% 2.8% 8.3% 19.4% 22.2% 8.3% 0.0%
Men On Sinker 28 96.5 80.9 89.7 32.1% 3.6% 25.0% 7.1% 25.0% 3.6% 0.0%
--- Slider 322 90.4 72.0 83.0 29.8% 9.9% 22.0% 9.6% 19.6% 6.5% 0.9%
Men On Slider 310 90.7 68.3 83.1 38.7% 10.6% 21.0% 8.4% 18.1% 3.2% 0.6%
--- Split-finger fastball 10 89.2 82.1 85.1 30.0% 10.0% 40.0% 0.0% 20.0% 0.0% 0.0%
Men On Split-finger fastball 3 87.6 84.0 85.3 66.7% 0.0% 0.0% 0.0% 33.3% 0.0% 0.0%
--- Total 2785 99.8 60.2 83.9 33.8% 5.4% 20.4% 15.3% 16.0% 7.8% 0.2%
Men On Total 2286 99.4 58.6 83.9 35.0% 6.2% 21.6% 14.1% 17.1% 3.4% 0.3%


#: number of times pitch was thrown as recorded in Pitch F/X
max: highest recorded starting velocity
min: lowest recorded starting velocity
avg: average recorded starting velocity
ball %: percentage of time pitch was taken for a ball
stkS%: percentage of time pitch was swung on and missed
foul%: percentage of time pitch was fouled off
stkC%: percentage of time pitch was taken for a called strike

Here's the pie chart version of the last two rows.



In general, it looks like he's a little more likely to swing at pitches with runners on. He takes pitches 48.9% of the time with no one on base, and 43.6% of the time when there are runners on, although that could be due to the fact that he's more likely to see a strike when there's a runner on base. But I don't know if a difference of 5% here is necessarily all that meaningful.(Note: After revising the data to include the missing outcomes, this is no longer true. Cano takes a pitch 49.1% of the time when there are either runners on or not)


Honestly, I expected to see more of a split here in the underlying data, but it's just not there.

Cano's results to this point with runners on base are markedly worse than his results with the bases empty, but it's not because of any obvious change in his approach in the two scenarios, unless I'm missing something here or not considering something that I should be.

I guess this is encouraging, because it means we really shouldn't have any reason to think that Cano will continue to hit as poorly with men on base as he has so far.

Another update: As suggested by sam, here's a look at the pitch locations against Cano with men on vs. with the bases empty. I don't think it shows much if anything, maybe that he gets more pitches outside with runners on, but here it is anyway.





No comments: