CMSI 3520

Database Systems

Fall 2021



  • Brightspace: Where you can access private content and check your grades
  • We will use GitHub Classroom to manage and submit assignments
  • Every now and then we may use Socrative to ask or answer questions as a class—my Socrative room is DONDILMU
  • We are back on campus and in person, but we continue to live in very fluid and dynamic circumstances—make sure to follow the university’s pandemic portal for the latest news, updates, and policies—to which this class must, of course, adhere

Assignments

Dataset Sources

It’s hard to appreciate the power and potential of general database management systems when you don’t have seed data to populate it—so we’ll want to do our work with a database that can be fully populated from the get-go.

Our case study dataset is the one from the Netflix Prize competition from over a decade (a decade!!!) ago: https://www.kaggle.com/netflix-inc/netflix-prize-data—find one to call your own from one of these sites, or feel free to identify one independently:

  • Kaggle datasets are generally positioned for data science or machine learning, but sometimes also applicable to pure database work:
  • The Awesome Public Datasets collection is similar in purpose and applicability
  • For the health-/medically-minded: the Drugs@FDA database file set is actually freely available