Class Notes

Notes: 1. please visit this page frequently, as it will be updated constantly during the term
2. Items under Required Material are considered mandatory reading and will be tested in the exams.
3. Items under Additional Material should be useful in helping you understand the Required Material.
4. Items in the third column (Online Resources and Reference Books) are provided for reference and in order to help you explore the different topics further.
5. Links to O'Reilly's Safari Online Bookshelf are only available within pitt.edu or from outside pitt.edu using the University's VPN service (https://sremote.pitt.edu). Please remember to sign out once you finished reading -- the University has a very limited number of concurrent user licenses.

Shortcuts:

13: Review for Final Exam (Apr 15)
Required Reading
(25) Whiteboard photos
Additional Material
Online Resources
Reference Books

12: RDF/SPARQL (Apr 6, 8, 13)
Required Reading
(22) Intro to RDF / SPARQL

(23) SPARQL

(24) SPARQL II

Additional Material
(22) Intro to RDF / SPARQL - Handout

(23) SPARQL - Handout

Online Resources
SPARQL Examples from Learning SPARQL, 1st Edition

DBpedia

ARQ - a SPARQL processor

twinkle - simple SPARQL GUI

topbraid composer - free sparql tool

Reference Books
Learning SPARQL, 2nd Edition, O'Reilly, 2013

11: SQL (Mar 16, 18, 23, 25, 30, Apr 1)
Required Reading
(17) Intro to SQL
(18) SQL II (answers)
(19) SQL III
(20) SQL IV
(21) SQL IV (cont)
Additional Material
(17) Intro to SQL -- Handout
(19) SQL III -- Handout
(20) SQL IV -- Handout
Online Resources
Interactive SQL Tutorial

SQL Tutorial

Reference Books

10: XQuery (Mar 2, 4)
Required Reading
(15) XQuery

(16) XQuery II

Additional Material
(15) XQuery Handout

Examples - Chapter 1:
1.5 1.6 1.7 1.8 1.9 1.10 1.11 1.12

Examples - Chapter 3:
3.1 3.2 3.3

(16) XQuery II Handout

Examples - Chapter 6:
6.1 6.2 6.3 6.4 6.5 6.6 6.7 6.8 6.9 6.10 6.11 6.12 6.13 6.14 6.15 6.16

Examples - Chapter 7:
7.1 7.2 7.4 7.5 7.9 7.10 7.11 7.12

Online Resources
XQuery tutorial

OxygenXML Editor

Reference Books
XQuery, by Priscilla Walmsley, O'Reilly, 2007

09: XML/XPath (Feb 27)
Required Reading
(14) XML and XPath

Additional Material
(14) XML and XPath Handout (Solutions)

Online Resources
XPath tutorial

XPath Tester

Reference Books

08: Data Summarization and Visualization (Feb 18)
Required Reading
(11) Data Summarization and Visualization
Additional Material
(11) Data Summarization and Visualization Handout
Online Resources
Reference Books
Data Mining Concepts and Techniques (3rd Edition), 2012 (Chapter 4)

07: Data Warehousing (Feb 16)
Required Reading
(10) Data Warehousing
Additional Material
(10) Data Warehousing Handout
Online Resources
Reference Books

06: Network Analysis (Feb 11)
Required Reading
(09) Network Analysis
Additional Material
(09) Network Analysis Handout
Online Resources
Reference Books

05: Web Information Retrieval (Feb 9)
Required Reading
(08) Web Information Retrieval
Additional Material
Online Resources
The Google Pagerank Algorithm and How It Works

PageRank Calculator

PageRank explained

Reference Books

04: Recommender Systems (Feb 2, 4)
Required Reading
(06) Recommender Systems
(07) Recommender Systems - II
Additional Material
(06) Recommender Systems Class Handout
(07) Recommender Systems - II Class Handout
Online Resources
A Programmer's Guide to Data Mining (Chapter 2, 3)
Reference Books

03: Information Retrieval (Jan 26, 28)
Required Reading
(04) Information Retrieval
(05) Information Retrieval II
Additional Material
(04) Information Retrieval Class Handout
(05) Information Retrieval II Class Handout and Solutions
Online Resources
Online Log Base 2 Calculator
Reference Books
Modern Information Retrieval (2nd Edition), 2011

02: Data Mining (Jan 14, 21)
Required Reading
(02) Introduction to Data Mining (Clustering)

(03) Association Rule Mining

Additional Material
(03) Association Rule Mining Class Handout
Online Resources
How Target Figured Out A Teen Girl Was Pregnant Before Her Father Did, Forbes, Feb 16, 2012

The parable of the beer and diapers, The Register, August 15, 2006

Reference Books
Data Mining Concepts and Techniques (3rd Edition), 2012

Mining of Massive Datasets (Sec 6.1, 6.2, 7.1.1, 7.1.2, 7.2.1, 7.3.1, 7.3.2)

01: Introduction to Big Data (Jan 12)
Required Reading
(01) Introduction to Big Data
Additional Material
Online Resources
Big Data and Its Technical Challenges in Communications of the ACM (July 2014)
Reference Books
The Fourth Paradigm: Data-Intensive Scientific Discovery