To do
- Revise streaming and watermarks
- Revise through the contents again
- Do PyP
Notes
- What is Data Science?
- Principles of Big Data Systems
- MapReduce
- Performance analysis
- Relational databases
- Data mining
- Spark
- Streaming
- Graphs and PageRank
- NoSQL
Assignments
- Assignment 1 : Due 17 March
- Assignment 2 : Due 21 April
Exams
- Finals : 29 April, 5pm
- Open book, hard copy
References
https://lintool.github.io/MapReduceAlgorithms/MapReduce-book-final.pdf