Tools in Data Science - May 2026#

This is the course readme for the May 2026 term of Tools in Data Science .

Tools in Data Science is a practical course about building, deploying, evaluating, and communicating data science work. The course focuses on working tools and reproducible workflows rather than only concepts. Students should expect to use command-line tools, Python, GitHub, APIs, LLMs, data processing tools, and visualization workflows throughout the term.

Teaching Team#

RoleNameNotes
Course instructorTBDAdd the confirmed instructor details before the term begins.
Teaching assistantsTBDAdd TA names, contact routes, and support responsibilities here.
Course supportTBDAdd forum, helpdesk, or escalation details here.

Learning Outcomes#

By the end of the course, students should be able to:

  • set up and maintain a reproducible development environment for data science work
  • use command-line tools, Git, GitHub, and structured text formats confidently
  • deploy data science outputs as websites, APIs, scheduled jobs, or lightweight services
  • use AI coding tools with review, testing, and reproducibility discipline
  • build LLM-based workflows using prompting, structured outputs, retrieval, and evaluation
  • source data from APIs, websites, documents, and public datasets responsibly
  • clean, transform, validate, and analyze data using appropriate tools
  • communicate results through charts, dashboards, narratives, and presentations

Course Structure#

The May 2026 course content is organized into eight weeks. The sidebar keeps each week collapsed by default; expand a week to see the topics covered.

WeekFocus
1Development Tools
2Deployment Tools
3AI Coding
4Large Language Models
5Data Sourcing
6Data Preparation
7Data Analysis
8Data Visualization

Assessments#

Repository Guide#

All new May 2026 course content should live inside 2026-05/. Each week and project is a folder with its own README.md plus lesson/topic pages.

2026-05/
├── README.md
├── _sidebar.md
├── assets/
├── week1/ … week8/   # README.md + lesson/topic pages
├── project-1/        # README.md, project-description.md, project-requirements.md
├── project-2/        # README.md, project-description.md, project-requirements.md
├── roe/              # README.md
└── resources/        # system requirements, course links, live sessions, assessments

Older terms remain in their term folders as archives. They should not be mixed into the current sidebar.