Outline of cse6331




Reading Material

Thursday 08/24 Course organization    
Tuesday 08/29 Cloud Computing   bigdata-l01.pdf
Read: A view of cloud computing,
VMWare: Virtualization Overview
Thursday 08/31 Cloud Computing    
Tuesday 09/05 Map-Reduce Fundamentals   bigdata-l02.pdf
Read: MapReduce: A Flexible Data Processing Tool,
MapReduce: Simplified Data Processing on Large Clusters
Thursday 09/07 Map-Reduce Fundamentals    
Tuesday 09/12 Map-Reduce Fundamentals   Read: Sections 2.1 and 2.2 from Map-Reduce and the New Software Stack
Thursday 09/14 Map-Reduce Fundamentals    
Tuesday 09/19 Map-Reduce Fundamentals    
Thursday 09/21 Map-Reduce Fundamentals    
Tuesday 09/26 Map-Reduce Design Patterns   bigdata-l03.pdf
Read: Section 2.3 from Map-Reduce and the New Software Stack
Thursday 09/28 Map-Reduce Design Patterns Final project explained  
Tuesday 10/03 Map-Reduce Design Patterns Project 1 is due  
Thursday 10/05 Map-Reduce Design Patterns    
Tuesday 10/10 Apache Spark   bigdata-l04.pdf
Read: Spark Tutorial
Resilient Distributed Datasets
Thursday 10/12 NO CLASS Deadline to choose a final project  
Tuesday 10/17 Apache Spark Project 2 is due Read: Spark Programming Guide
Thursday 10/19 Apache Spark    
Tuesday 10/24 Apache Pig   bigdata-l05.pdf
Read: Pig Latin
Thursday 10/26 Apache Hive Project 3 is due Read: Hive
Tuesday 10/31 Apache Hive    
Thursday 11/02 Spark SQL   Read: Spark SQL
Tuesday 11/07 Graph Processing with Pregel and Giraph   bigdata-l06.pdf
Read: Pregel
Thursday 11/09 BigTable Project 4 is due bigdata-l07.pdf
Tuesday 11/14 Dynamo    
Thursday 11/16 MRQL   bigdata-l08.pdf
Tuesday 11/21 DIQL Project 5 is due  
Thursday 11/23 NO CLASS (Thanksgiving)    
Tuesday 11/28 Student Presentations    
Thursday 11/30 Student Presentations Project 6 is due  
Tuesday 12/05 Student Presentations    
Thursday 12/07 Student Presentations Deadline to submit the final project  
Tuesday 12/12
Final Exam    

Last modified: 11/16/17 by Leonidas Fegaras