Click on the title links to download the data. Please contact me before the class in which data will be used if you have any problems downloading!
cleaning Folder containing:
oil_production.csv Data on oil production by world region from 2000 to 2014, in thousands of barrels per day, from the U.S. Energy Information Administration.
ucb_stanford_2014.csv Data on federal government grants to UC Berkeley and Stanford University in 2014, downloaded from USASpending.gov.
scraping Folder containing:
urls.xls Spreadsheet with a list of urls, for use in web scraping exercise.geocoding Folder containing:
sf_test_addresses.tsv Text file containing a list of 100 addresses in San Francisco.sf_addresses_short.tsv The first 10 addresses from the previous file.refine-geocoder.json A script in JSON format that we will use to automate geocoding.pfizer Folder containing:
pfizer.sqlite SQLite database detailing payments by the drug company Pfizer to doctorspfizer.txt Tab-delimited text file containing the same datafda.txt Tab-delimited text file documenting warning letters from the Food and Drug Administration to doctors involved in research on experimental drugs or medical devices.Data used in reporting this news story.
nations Folder containing:
nations.csv Data from the World Bank Indicators portal, which is an incredibly rich resource. Contains the following fields:iso_a3 Three-letter code for each country, assigned by the International Organization for Standardization.country Country name.yearregion income_group World Bank regions and current income groups, explained here.population Estimated total population at mid-year, including all residents apart from refugees.gdp_percap Gross Domestic Product per capita in current international dollars, corrected for purchasing power in different territories.life_expect Life expectancy in years for a child born in the year in question, if prevailing patterns were to stay the same throughout its life.population Estimated total population at mid-year, including all residents apart from refugees.birth_rate Number of live births during the year per 1,000 people, based on mid-year population estimate.neonat_mortal Neonatal mortality rate: number of babies dying before reaching 28 days of age, per 1,000 live births in a given year.mapping Folder containing:
oregon_dams_csv Data on dams in Oregon, from the state’s dams inventory.oregon_seismic_risk Data om seismic risk for the state, calculated by the U.S. Geological Survey.