Class Notes

Notes: 1. please visit this page frequently, as it will be updated constantly during the term
2. Items under Required Material are considered mandatory reading and will be tested in the exams.
3. Items under Additional Material should be useful in helping you understand the Required Material.
4. Items in the third column (Online Resources and Reference Books) are provided for reference and in order to help you explore the different topics further.
5. Links to O'Reilly's Safari Online Bookshelf are only available within pitt.edu or from outside pitt.edu using the University's VPN service (https://sremote.pitt.edu). Please remember to sign out once you finished reading -- the University has a very limited number of concurrent user licenses.

Shortcuts:

19: Final Exam Review (Dec 3, 2014)
Required Reading
Additional Material
Online Resources
Reference Books

18: Advanced Topics (Dec 1) NEW
Required Reading
(25) Data Stream Management Systems
Additional Material
(25) Data Stream Management Systems - Handout
Online Resources
Reference Books

17: RDF/SPARQL (Nov 17, 19, 24)
Required Reading
(22) Intro to RDF / SPARQL

(23) SPARQL

(24) SPARQL II

Additional Material
(22) Intro to RDF / SPARQL - Handout

(23) SPARQL - Handout

Online Resources
SPARQL Examples from Learning SPARQL, 1st Edition

DBpedia

ARQ - a SPARQL processor

twinkle - simple SPARQL GUI

topbraid composer - free sparql tool

Reference Books
Learning SPARQL, 2nd Edition, O'Reilly, 2013

16: SQL Injection Avoidance (Nov 10)
Required Reading
(21) SQL Injection Avoidance
Additional Material
05.prepp.php (run)
05.query.html
06.prepp.php (run)
07.prepp.php (run)
08.prepc.php (run)
09.multip.php (run)

(19) SQL III - Handout (copy)

Online Resources
Reference Books

15: SQL Injection (Nov 7)
Required Reading
Exploits of a Mom
Additional Material
SQL Injection by Example
Online Resources
Multiple MySQL Statements
Reference Books

14: PHP Review (Nov 5)
Required Reading
Additional Material
alex1.php (run)
alex2.php (run)
alex3.php (run)
alex4.php (run)

dbinfo.php
01.query.php (run)
02.query.php (run)
03.queryp.php (run)
04.queryc.php (run)

Online Resources
Programming PHP, 3rd Edition By Kevin Tatroe, Peter MacIntyre, Rasmus Lerdorf (O'Reilly Media, February 2013)

PHP Cookbook, 3rd Edition By David Sklar, Adam Trachtenberg (O'Reilly Media, June 2014)

PHP Tutorial

MySQL dual procedural and OO interface

Reference Books
PHP: The good parts: by Peter MacIntyre (O'Reilly Media, April 2010)

13: SQL Review (Oct 22, 31, Nov 3)
Required Reading
(17) Intro to SQL

(18) SQL - II

(19) SQL - III

Additional Material
(17) Intro to SQL - Handout

(19) SQL III - Handout

Online Resources
SQL Tutorial
Reference Books

12: XQuery (Oct 15, 20)
Required Reading
(15) XQuery

(16) XQuery II

Additional Material
(15) XQuery Handout (Solutions)

Examples - Chapter 1:
1.5 1.6 1.7 1.8 1.9 1.10 1.11 1.12

Examples - Chapter 3:
3.1 3.2 3.3

(16) XQuery II Handout

Examples - Chapter 6:
6.1 6.2 6.3 6.4 6.5 6.6 6.7 6.8 6.9 6.10 6.11 6.12 6.13 6.14 6.15 6.16

Examples - Chapter 7:
7.1 7.2 7.4 7.5 7.9 7.10 7.11 7.12

Online Resources
XQuery tutorial

OxygenXML Editor

Reference Books
XQuery, by Priscilla Walmsley, O'Reilly, 2007

11: XML/XPath (Oct 13)
Required Reading
(14) XML and XPath

Additional Material
(14) Training vs Test Data Handout (Solutions)

(14) XML and XPath Handout (Solutions)

Online Resources
XPath tutorial

XPath Tester

Reference Books

10: Midterm Review (Oct 6)
Required Reading
Additional Material
Online Resources
Reference Books

09: Data-Driven Presentations (Oct 3)
Required Reading
(12) Data-Driven Presentations
Additional Material
Online Resources
Reference Books

08: Data Summarization and Visualization (Oct 1)
Required Reading
(11) Data Summarization and Visualization
Additional Material
(11) Data Summarization and Visualization Handout
Online Resources
Reference Books

07: Data Warehousing (Sep 29)
Required Reading
(10) Data Warehousing
Additional Material
(10) Data Warehousing Handout
Online Resources
Reference Books
Data Mining Concepts and Techniques (3rd Edition), 2012 (Chapter 4)

06: Recommender Systems (Sep 22, 24)
Required Reading
(08) Recommendation Systems

(09) Recommendation Systems - II

Additional Material
(08) Recommendation Systems Handout

(09) Recommendation Systems II Handout

Online Resources
A Programmer's Guide to Data Mining (Chapter 2, 3)
Reference Books

05: Introduction to Network Analysis (Sep 17)
Required Reading
(07) Network Analysis

Additional Material
(07) Network Analysis Handout

Online Resources
Wikipedia's entry on Probability
Reference Books
Analyzing The Social Web/ by Jennifer Goldbeck, Morgan Kaufmann, 2013

04: Web Information Retrieval (Sep 15)
Required Reading
(06) Web Crawling and Google's PageRank

Additional Material
Online Resources
The Google Pagerank Algorithm and How It Works

PageRank Calculator

PageRank explained

Reference Books

03: Information Retrieval (Sep 8, 10)
Required Reading
(04) Information Retrieval - I

(05) Information Retrieval - II

Additional Material
(04) Information Retrieval I Class Handout

(05) Information Retrieval II Class Handout

Online Resources
Online Log Base 2 Calculator
Reference Books
Modern Information Retrieval (2nd Edition), 2011

02: Data Mining (Aug 27, Sep 3)
Required Reading
(02) Introduction to Data Mining (Clustering)

(03) Association Rule Mining

Additional Material
(03) Association Rule Mining Class Handout
Online Resources
How Target Figured Out A Teen Girl Was Pregnant Before Her Father Did, Forbes, Feb 16, 2012
Reference Books
Data Mining Concepts and Techniques (3rd Edition), 2012

Mining of Massive Datasets (Sec 6.1, 6.2, 7.1.1, 7.1.2, 7.2.1, 7.3.1, 7.3.2)

01: Introduction to Big Data (Aug 25)
Required Reading
(01) Introduction to Big Data
Additional Material
Online Resources
Big Data and Its Technical Challenges in Communications of the ACM (July 2014)
Reference Books
The Fourth Paradigm: Data-Intensive Scientific Discovery