24/4/2014
Midterm 2 Week 1/5/2014
23-24 8/5/201415/5/2014
1. Data

...

Massive Data sets)Sets)
2. Module 5Link Analysis (Section 5.1 and 5.2 of EMCMining of Massive Data Sets)
3. Kaggle Challenge Discussion (AllState Data)
25-26 15/5/2014
1. Link Analysis (Section 5.1 and 5.2 of Mining of Massive Data sets)
2. Module 6 of EMC
27-28
22/5/2014

...

Massive Data sets)Sets)
2. Matrix/Vector + Database Operations in MapReduce (Chapter 2 of Mining of Massive Data Sets)
27-28
24/5/2014
1. Module 5 of EMC and Tutorials by Hortonworks (Pig, Hive, HBase)
29-30 1. Advanced Python
31-32
29/5/2014

1. Kaggle Challenge Discussion (Walmart Data)
2. MapReduce programming (Units 3-4 of UdaCity Course)
3. EMC Practice Test
24/4/2014
Midterm 2 Week 19-20
1/5/2014 Kaggle Wal-Mart Challenge Submission (Due Date: May 05, 2014)
21-22
8/5/2014
1. Section 5.1 and 5.2 (Mining of Massive Data sets)
23-24 15/5/2014
Kaggle Allstate8/5/2014
1. Data Streams Mining (Section 4.1 and 4.2 of Mining of Massive Data sets)
2. Module 5 of EMC
3. Kaggle Challenge Submission (Due Date: May 19, 2014)Discussion (AllState Data)
25-26 22/5/201415/5/2014
1. Section 8.1, 8.2Link Analysis (Section 5.1 and 8.4 (Mining5.2 of Mining of Massive Data sets)
2. Module 6 of EMC
27-28
22/5/2014
1. Web Advertising (Section 8.1, 8.2 and 8.4 of Mining of Massive Data sets)
2.
29-30
1. Advanced Python
31-32
29/5/2014
1. SectionsRecommendation Systems (Sections 9.1, 9.2 and 9.3 (Miningof Mining of Massive Data sets)
2.

Lectures
edited
... Thursday
1. Discussion on KNIME Implementation of Text Aanlytics, Regression and Association …

...

Thursday
1. Discussion on KNIME Implementation of Text Aanlytics, Regression and Association Rule Mining

...

and MapReduce (First 2 Units(Units 1-2 of UdaCity
11-12
29/3/2014
Saturday

...

2 of EMC Certification (Data
13-14
3/4/2014

...

5/4/2014
Saturday
1. Kaggle Challenge Discussion (Walmart Data)
2. Quick Overview of Python
3. Recap of the following topics for Data Analysiscertification (Decision Trees, Naive Bayes, K-Means and Association Rules)
17-18
10/4/2014
Thursday
1. Section 4.1 and 4.2 (MiningRecap of Massive Data sets)
2. Discussion onthe following topics for certification (Regression Analysis, Logistic Regression, Time Series Analysis and Text Analytics)
19-20
12/4/2014
Saturday
1. Certification Lab
21-22
17/4/2014
Thursday
1. Kaggle Wal-Mart Challenge Discussion (Walmart Data)
2. MapReduce programming (Units 3-4 of UdaCity Course)
24/4/2014
Midterm 2 Week

Lectures
edited
... 2. Introduction to Hadoop and MapReduce (First 2 Units of UdaCity Course)
11-12
29/3/2013 …

...

2. Introduction to Hadoop and MapReduce (First 2 Units of UdaCity Course)
11-12 29/3/201329/3/2014
Saturday

...

Big Data Analytics)Analytics Fundamentals)
13-14
3/4/2014
Thursday
1. Time Series Analysis
2. Module 3 of EMC Certification (EDA using R)
15-16
5/4/2014
Saturday
1. Python for Data Analysis
13-14
3/4/2014

Midterm 1 Week
9-10 20/3/2014
Kaggle PAKDD2014 Challenge Discussion (Due Date: April 1, 2014)
11-12
27/3/2014
Thursday
1. Time Series AnalysisDiscussion on KNIME Implementation of Text Aanlytics, Regression and Association Rule Mining
2. Kaggle PAKDD2014 SubmissionIntroduction to Hadoop and MapReduce (First 2 Units of UdaCity Course)
11-12
29/3/2013
Saturday
1. Modules 1 and 2 of EMC Certification (Data Science & Big Data Analytics)
13-14
3/4/2014

home
edited
... Spring 2014
Instructor: Dr. Sajjad Haider
Class Timings: 7:45 PM 11:00AM - 9:00 2:00 P…

...

Spring 2014
Instructor: Dr. Sajjad Haider
Class Timings: 7:45 PM11:00AM - 9:002:00 PM (Tuesdays and Thursdays)(Thursdays)
E-mail Address: sajjad.haider@khi.iba.edu.pk
Phone: 111-422-422 (Ext. 1612 and 1621)

...

Syllabus
Lectures
Unlike my other courses, this course is making extensive use of SAKAI and hence all the materials and updates are posted on the IBA SAKAI implementation. I would still update the lecture page from time to time to highlight the topics covered in each session.