This directory contains the data sets used in THE BASIC PRACTICE OF STATISTICS, by David S. Moore. All data sets with 10 or more entries appear in the directory. The directory includes some data sets for which only a graph or output from statistical software appears in the text. These extras are: Exercise 1.33 (Calories and sodium in hot dogs, by type.) Exercise 1.70 (Monthly returns on Wal-Mart common stock for 228 consecutive months.) Figures 4.1 and 4.2. (Sample counts for SRSs of sizes 100 and 2500 from a population with p = 0.6.) The data used in examples and sample examinations in the Instructor's Guide are also included. There is a separate file for each data set. The files are named as follows: tab1-1.dat Table 1.1 ex1-10.dat Exercise 1.10 em1-7.dat Example 1.7 guide1.dat Instructor's Guide data set 1 Some data sets are used in multiple exercises or examples. In that case, they are named for the first place they occur in a chapter. If the same data appear in several chapters, they also appear several times on this disk to avoid the need for back references. To allow reading by any software, all data files are in plain text (ASCII) form, with only numerical entries. This requires that character entries be coded as numbers. The coding is usually obvious from comparing the data file with the data table in the text, but the notes below make the coding explicit. Each variable occupies a column, with columns separated by spaces. I've tried to transfer the data carefully, but life is uncertain. If you find errors in the data disk, please let me know: David S. Moore Department of Statistics Purdue University West Lafayette, IN 47907-1399 email: dsm@stat.purdue.edu NOTES CHAPTER 1 tab1-2.dat column 1: 51 states (including DC), alphabetical order 1 to 51 column 2: Region 1 = ENC (East North Central) 2 = ESC (East South Central) 3 = MA (Middle Atlantic) 4 = MTN (Mountain) 5 = NE (Northeast) 6 = PAC (Pacific) 7 = SA (South Atlantic) 8 = WNC (West North Central) 9 = WSC (West South Central) ex1-33.dat column 1: 1 = beef, 2 = meat, 3 = poultry column 2: Calories per hot dog column 3: Sodium per hot dog ex1-66.dat column 1: 1 = Control group, 2 = Experimental group column 2: Response (weight gain) ex1-67.dat column 1: 1 = DiMaggio, 2 = Mantle column 2: Response (home runs) ex1-70.dat Data in this file are rounded to 2 decimal places; the computer output in Exercise 1.70 is for the unrounded data. CHAPTER 2 tab2-4.dat The 3 sets of Anscombe data appear in the column order x1 y1 y2 x2 y3. That is, the first two sets are (x1,y1) and (x1,y2) with common x-values and the 3rd set is (x2,y3). There are n = 10 observations in each set. tab2-7.dat column 1: 1 = midsize car, 2 = large car column 2: EPA city mileage column 3: EPA highway mileage em2-7.dat column 1: 1 = before (same as tab2-2.dat), 2 = after column 2: Degree-days per day column 3: 100s of cubic feet of natural gas per day ex2-7.dat column 1: 1 = Female, 2 = Male column 2: Lean body mass (kg) column 3: Metabolic rate (kilocalories/24 hours) ex2-15.dat column 1: 1 = yellow, 2 = white, 3 = green, 4 = blue column 2: Cereal leaf beetles trapped on this board ex2-81.dat column 1: Age in years column 2: Incubation period in hours column 3: 1 = Survived, 0 = Died ex2-82.tex column 1: Nematode count, thousands column 2: Response (growth in centimeters) CHAPTER 4 fig4-1.dat These are the sample COUNTS behind the sample proportions whose distribution appears in Figure 4.1. They come from simulating the choosing of 1000 SRSs of size n = 100 from an infinite population with p = 0.6. (That is, these counts follow the binomial distribution with n = 100 and p = 0.6.) fig4-2.dat These are the 1000 sample COUNTS behind the sample proportions graphed in Figure 4.2. They are counts of successes from SRSs of size n = 2500 from an infinite population with p = 0.6. CHAPTER 6 tab6-2.dat The ``NA'' for unavailable entries has been coded as -999 em6-7.dat column 1: 1 = Group 1 (calcium), 2 = Group 2 (control) column 2: Decrease in blood pressure after treatment em6-11.dat column 1: 1 = Poisoned rats, 2 = Control group column 2: Response ex6-35.dat column 1: 1 = Control group, 2 = Experimental group column 2: Weight gain (grams) ex6-36.dat column 1: 1 = Female, 2 = Male column 2: SSHA score ex6-45.dat column 1: 1 = Treatment group, 2 = Control group column 2: Response (DRP score) ex6-63.dat column 1: 1 = Nitrite, 2 = Control column 2: rate of amino acid uptake CHAPTER 9 tab9-1.dat column 1: 1 = Compact, 2 = Midsize, 3 = Large column 2: EPA city mileage column 3: EPA highway mileage tab9-2.dat column 1: 1 = Bream, 2 = Perch, 3 = Roach column 2: Weight of fish (grams) em9-6.dat column 1: 1 = yellow, 2 = white, 3 = green, 4 = blue column 2: Cereal leaf beetles trapped on this board ex9-16.dat column 1: Nematode count, thousands column 2: Response (growth in centimeters)