NAME: Barry Bonds' 2002 Plate Appearances TYPE: Census SIZE: 588 observations, 8 variables DESCRIPTIVE ABSTRACT: Data are provided for Barry Bonds' plate appearances in the 2002 baseball season. Variables include characteristics of the innings before the first pitch to Bonds (e.g., the number of outs, the number of runners on each base, the opposing pitcher's earned run average) and after the first pitch to Bonds (e.g., the outcome of the appearance, how many runs scored in the inning after Bonds hits). SOURCES: The data were obtained from CBS Sportsline at http://www.cbs.sportsline.com. This site has pitch-by-pitch summaries of every baseball game in 2002. Pitchers' ERAs were obtained from ESPN at http://www.espn.com. These data were analyzed in Reiter, J. P. (2002) "Should teams walk or pitch to Barry Bonds?" By the Numbers: The Newsletter of the SABR Statistical Analysis Committee, 12 (November 2002), pp. 7-11. VARIABLE DESCRIPTIONS: Each plate appearance is on a single line of a text file with line breaks. Values are delimited by spaces. Note that there are different variables in the 2002 dataset than in bonds2001.dat. Columns Description 1 Equals one when there is a runner on first base when Bonds appears and equals zero otherwise. 4 Equals one when there is a runner on second base when Bonds appears and equals zero otherwise. 7 Equals one when there is a runner on third base when Bonds appears and equals zero otherwise. 10 Number of outs in inning when Bonds appears. 13 Equals zero if Bonds does not reach base. Equals one if Bonds reaches first base on a single or error. Equals two if Bonds reaches second base on a double or error. Equals three if Bonds reaches third base on a triple or error. Equals four if Bonds hits a home run. Equals five if Bonds walks or is hit by a pitch. 16 Number of runs scored by Giants in the inning after first pitch to Bonds. 19-22 Opposing pitchers' career earned run average as of the end of the 2001 season. 25-26 Initials of player batting immediately after Bonds: JK = Jeff Kent BS = Benito Santiago RS = Reggie Sanders RA = Rich Aurelia YT = Yorvit Torrealba DB = David Bell SD = Shawn Dunston RM = Ramon Martinez NA = missing 29 Player batting immediately after Bonds (previous column numerically coded): 0 = missing 1 = Jeff Kent 2 = Benito Santiago 3 = Reggie Sanders 4 = Rich Aurelia 5 = Yorvit Torrealba 6 = David Bell 7 = Shawn Dunston 8 = Ramon Martinez NOTES: There are a few games for which data were not available, due to invalid web links. These missing data should not bias analyses, since they are missing completely at random. The at bats are in order of appearance within the same game, but they are not in order of appearance by game date. For rookie pitchers, I used their 2002 earned run average. STORY BEHIND THE DATA: These data, along with data from the 2001 season, were used to analyze whether walking rather than pitching to Bonds reduces the chance that the Giants will score runs. PEDAGOGICAL NOTES: See the document bonds2001.txt for pedagogical notes. SUBMITTED BY: Jerome P. Reiter Institute of Statistics and Decision Sciences Duke University Box 90251 Durham, NC 27708 jerry@stat.duke.edu