Outline of cse6331




Reading Material

Tuesday 1/15 Course organization    
Thursday 1/17 Cloud Computing   bigdata-l01.pdf
Read: A view of cloud computing,
VMWare: Virtualization Overview
Tuesday 1/22 Cloud Computing   data scientist job outlook
Thursday 1/24 Cloud Computing    
Tuesday 1/29 Map-Reduce Fundamentals   bigdata-l02.pdf
Read: MapReduce: A Flexible Data Processing Tool,
MapReduce: Simplified Data Processing on Large Clusters
Thursday 1/31 Map-Reduce Fundamentals    
Tuesday 2/5 Map-Reduce Fundamentals   Read: Sections 2.1 and 2.2 from Map-Reduce and the New Software Stack
Thursday 2/7 Map-Reduce Fundamentals    
Tuesday 2/12 Map-Reduce Design Patterns   bigdata-l03.pdf
Read: Section 2.3 from Map-Reduce and the New Software Stack
Thursday 2/14 Map-Reduce Design Patterns    
Tuesday 2/19 Map-Reduce Design Patterns Project 1 is due  
Thursday 2/21 Map-Reduce Design Patterns    
Tuesday 2/26 Map-Reduce Design Patterns Project 2 is due  
Thursday 2/28 Apache Spark   bigdata-l04.pdf
Read: Spark Tutorial
Resilient Distributed Datasets
Tuesday 3/5 Apache Spark Project 3 is due  
Thursday 3/7 Midterm Exam    
Tuesday 3/12 NO CLASS (Spring Break)    
Thursday 3/14 NO CLASS (Spring Break)    
Tuesday 3/19 Apache Spark   Read: Spark Programming Guide
Thursday 3/21 Apache Spark Project 4 is due  
Tuesday 3/26 Apache Spark    
Thursday 3/28 Apache Pig   bigdata-l05.pdf
Read: Pig Latin
Tuesday 4/2 Apache Pig Project 5 is due  
Thursday 4/4 Apache Hive   Read: Hive
Tuesday 4/9 Spark SQL Project 6 is due Read: Spark SQL
Thursday 4/11 Spark SQL    
Tuesday 4/16 Graph Processing with Pregel and Giraph Project 7 is due bigdata-l06.pdf
Read: Pregel
Thursday 4/18 BigTable   bigdata-l07.pdf
Tuesday 4/23 Cassandra    
Thursday 4/25      
Tuesday 4/30   Project 8 is due  
Thursday 5/2      
Tuesday May 7
cse6331-004 Final Exam    
Thursday May 9
cse6331-005 Final Exam    

Last modified: 05/01/18 by Leonidas Fegaras