Click on the title links to download the data. Please contact me before the class in which data will be used if you have any problems downloading!
cleaning
Folder containing:
oil_production.csv
Data on oil production by world region from 2000 to 2014, in thousands of barrels per day, from the U.S. Energy Information Administration.
ucb_stanford_2014.csv
Data on federal government grants to UC Berkeley and Stanford University in 2014, downloaded from USASpending.gov.
scraping
Folder containing:
urls.xls
Spreadsheet with a list of urls, for use in web scraping exercise.geocoding
Folder containing:
sf_test_addresses.tsv
Text file containing a list of 100 addresses in San Francisco.sf_addresses_short.tsv
The first 10 addresses from the previous file.refine-geocoder.json
A script in JSON format that we will use to automate geocoding.pfizer
Folder containing:
pfizer.sqlite
SQLite database detailing payments by the drug company Pfizer to doctorspfizer.txt
Tab-delimited text file containing the same datafda.txt
Tab-delimited text file documenting warning letters from the Food and Drug Administration to doctors involved in research on experimental drugs or medical devices.Data used in reporting this news story.
nations
Folder containing:
nations.csv
Data from the World Bank Indicators portal, which is an incredibly rich resource. Contains the following fields:iso_a3
Three-letter code for each country, assigned by the International Organization for Standardization.country
Country name.year
region
income_group
World Bank regions and current income groups, explained here.population
Estimated total population at mid-year, including all residents apart from refugees.gdp_percap
Gross Domestic Product per capita in current international dollars, corrected for purchasing power in different territories.life_expect
Life expectancy in years for a child born in the year in question, if prevailing patterns were to stay the same throughout its life.population
Estimated total population at mid-year, including all residents apart from refugees.birth_rate
Number of live births during the year per 1,000 people, based on mid-year population estimate.neonat_mortal
Neonatal mortality rate: number of babies dying before reaching 28 days of age, per 1,000 live births in a given year.mapping
Folder containing:
oregon_dams_csv
Data on dams in Oregon, from the state’s dams inventory.oregon_seismic_risk
Data om seismic risk for the state, calculated by the U.S. Geological Survey.