Purpose: This special topics course aims to expose students to different data management, data manipulation, and data analysis techniques. The class will cover all the major data management paradigms (Relational/SQL, XML/XQuery, RDF/SPARQL) including NoSQL and Data Stream Processing approaches. Going beyond traditional data management techniques, the class will expose students to information retrieval, data mining, data warehousing, network analysis, and other data analysis topics. Time-permitting, the class will include Big Data processing techniques, such as the map/reduce framework.
Prereq: A grade of C or better in CS 441 and CS 445 is required (or permission of the instructor). Good working knowledge of Java and familiarity with Unix are assumed. Having passed a statistics course is highly encouraged.
Textbook: There is no single textbook with enough coverage of all the material that we want to discuss in this class. We will rely on online references and also on O'Reilly's Safari Bookshelf for which the University has institutional access (i.e., you will not have to buy extra books).
