Tentative Schedule

- Administrative details and Introduction [slides]
- Introduction to cloud computing and AWS (Dr. He)
- Cloud platform practices: Examples (Dr. He)
- Data Representation from structured to unstructured (Dr. Dragut) [slides]
- Finding Similar Items (Dr. Dragut) [slides]
- Midterm Exam
- Distributed file systems: Hadoop and Google’s GFS (Dr. He)
- Map/Reduce Programming Model (Dr. Dragut) [slides]
- Virtualization: KVM and Xen (Dr. He)
- Data Ethics & Privacy (Dr. Dragut) [slides]
- Streaming Data [slides]
- Playing with Data [slides]
- Big Data as Matrices
- GPU Computing with Application to NLP
- Final Exam