MIE1512H: Data Analytics

This course is a research seminar that focuses on recent developments in the area of Data Management for Analytics. Science, businesses, society, and government are been revolutionized by data-driven methods that benefit heavily from scalable data management techniques. The course provides an overview of data management concepts applied to analytics, covering methods and techniques, including distributed computations on massive datasets and frameworks for enabling large-scale parallel data processing on clusters of commodity servers. Emphasis is given to data management techniques for analyzing Web Data and Open Datasets. The course evaluation is based on student presentations, a focused bibliography survey, a hands on invigilated lab, and a course project (the last two using computational notebooks on scalable platforms). The project goal is to reproduce high quality published research in the area of data analytics, emphasizing data management aspects.

0.50
St. George
In Class