Data Sources#

This page is a semi-curated source of datasets for use in assignments. The different sections have datasets that are good for different assignments.

Best for loading directly into a notebook#

  • Tidy Tuesday inside the folder for each year there is a README file with list of the datasets. These are .csv files

  • Json Datasets warning some of these require API calls and that is not recommended until at least A4

  • National Center for Education Statistics Digest 2019 These data tables are available for download as excel and visible on the page.

  • Lots of wikipedia pages have tables in them.

  • Messy Artists .csv file, that needs to be cleaned, containing data on artists

  • Messy Wheels .csv file, that needs to be cleaned, containing data on various wheel attractions around the globe

  • Clean Artists .csv file, already cleaned, containing data on artists

  • Clean Wheels, .csv file, already cleaned, containing data on various wheel attractions around the globe

General Sources#

These may require some more work

Datasets in many parts#

Datasets with time#

Databases#

If you have others please share by creating a pull request or issue on this repo (from the GitHub logo at the top right, suggest edit).