(This article was first published on united states government survey data, and kindly contributed to R-bloggers)
the arf is fun to say out loud. it's also a single county-level data table with about 6,000 variables, produced by the united states health services and resources administration (hrsa). the file contains health information and statistics for over 3,000 us counties.like many government agencies, hrsa provides only a sas importation script and an ascii file. this new github repository contains two scripts:
2011-2012 arf - download.R
- download the zipped area resource file directly onto your local computer
- load the entire table into a temporary sql database
- save the condensed file as an R data file (.rda), comma-separated value file (.csv), and/or stata-readable file (.dta).
2011-2012 arf - analysis examples.R
- limit the arf to the variables necessary for your analysis
- sum up a few county-level statistics
- merge the arf onto other data sets, using both fips and ssa county codes
- create a sweet county-level map
for more detail about the area resource file (arf), visit:
notes:
the arf may not be a survey data set itself, but it's particularly useful to merge onto other survey data.
confidential to sas, spss, stata, and sudaan users: time to put down the abacus. time to transition to r. :D
To leave a comment for the author, please follow the link and comment on his blog: united states government survey data.
R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series,ecdf, trading) and more...