DATASURFING ON THE WORLD WIDE WEB - Part 2
Robin H. Lock
Department of Mathematics, Computer Science, and Statistics
St. Lawrence University
Canton, NY 13617
rlock@stlawu.edu
Outline for a talk at the 2016 Joint Statistical Meetings
ABSTRACT: This is a continuation of a presentation from the last JSM in Chicago (1996).
At that time we looked at web sources for students and instructors to obtain real data
for use in projects and class examples. What’s changed in this regard over the past 20 years?
Where are some places to go now to get easy access to useful data?
What new challenges have emerged for obtaining data from the ever expanding web? .
CATEGORIES OF DATA SOURCES
Dataset Archives with Teaching Support
Pages of Data Links
Government Sources
R Packages
Several R packages with good data for teaching (requires R to get the data) include ...
Data from Visualizations
More Data for Countries
Survey/Study Repositories
Funs and Games
SPORTS:
GAMES:
Data Scraping
Several useful R packages:
Example: A shiny app to scrape IMDb ratings for all episodes of a chosen TV show
created by Ivan Ramler and Tenzin Choeyang.