CMSI 3520

Database Systems

Fall 2022


Note
This page is maintained as an archival record of the course shown above, and as such, some links on this page may no longer be valid nor accessible. They are kept here as a record of the resources that were available at the time of the course offering.
  • Brightspace: Where you can access private content and check your grades
  • We will use GitHub Classroom to manage and submit assignments
  • Every now and then we may use Socrative to ask or answer questions as a class—my Socrative room is DONDILMU
  • We continue to live in very fluid and dynamic circumstances—make sure to follow the university’s pandemic portal for the latest news, updates, and policies—to which this class must, of course, adhere

Assignments

Dataset Sources

It’s hard to appreciate the power and potential of general database management systems when you don’t have seed data to populate it—so we’ll want to do our work with a database that can be fully populated from the get-go.

Our case study dataset is the one from the Netflix Prize competition from over a decade (a decade!!!) ago: https://www.kaggle.com/netflix-inc/netflix-prize-data—find one to call your own from one of these sites, or feel free to identify one independently:

  • Kaggle datasets are generally positioned for data science or machine learning, but sometimes also applicable to pure database work
  • The Awesome Public Datasets collection is similar in purpose and applicability
  • For the health-/medically-minded: the Drugs@FDA database file set is actually freely available