Week 5: Data Sourcing#

Status: Draft scaffold.

Use this week for finding, downloading, querying, and scraping data from public and private sources.

Topics#

  • HTTP requests and APIs
  • Browser DevTools
  • Web scraping
  • PDF and document extraction
  • Scheduled data collection
  • Responsible scraping practices

Learning Goals#

  • Identify reliable data sources for analysis.
  • Collect data through APIs, browser inspection, and scraping workflows.
  • Extract useful data from documents and semi-structured sources.
  • Schedule repeatable data collection jobs responsibly.

Content Checklist#

  • Final lesson sequence
  • Data source examples
  • Legal and operational cautions
  • Assessment link

Lessons#