Big Data Analytics

Learning Outcomes: 
Upon Completion of the course, the students will be able to
Identify and distinguish big data analytics applications
Describe big data analytics tools
Explain big data analytics techniques
Present cases involving big data analytics in solving practical problems
Conduct big data analytics using system tools
Suggest appropriate solutions to big data analytics problems
Syllabus: 
Unit NoTopics
1
Introducing Big Data
Introduction- Big Data and its importance, 4 Vs, structured, semi structured and structured data, big data problem in future  
2
Introduction to Hadoop Achitecture
Features, RDBMS Vs. Hadoop, versions, Introduction to some Hadoop distributions, Hadoop technology stack 
3
Hadoop Distributed File System (HDFS)
Introduction, HDFS Daemons, Different methods to HDFS access
4
Map Reduce
Introduction, Phases involved in Map Reduce, Architecture, Working of Map Reduce using Example Program
5
HBase
Limitations of Hadoop, Introduction to HBase, HBase Vs. RDBMS, Features, HBase components, General commands
6

Introduction, Overview, and History of NoSQL Databases Definition of the Four Types of NoSQL Database, Why NoSQL? The Value of Relational Databases, Getting at Persistent Data, Concurrency, Integration, Impedance Mismatch, Application and Integration Databases, Attack of the Clusters, The Emergence of NoSQL, Key Points

7
Comparison of relational databases to new NoSQL stores, MongoDB, Cassandra, HBASE, Neo4j use and deployment, Application, RDBMS approach, Challenges NoSQL approach, Key-Value and Document Data Models, Column-Family Stores, Aggregate-Oriented Databases
Replication and sharding, MapReduce on databases. Distribution Models, Single Server, Sharding, Master-Slave Replication, Peer-to-Peer Replication, Combining Sharding and Replication
8

NoSQL Key/Value databases using MongoDB,  Document Databases, What Features, Consistency, Transactions, Availability, Query Features, Scaling

9

Column- oriented NoSQL databases using Apache HBASE, Column-oriented NoSQL databases using Apache Cassandra, Architecture of HBASE, Features, Consistency, Transactions, Availability, Query Features, Scaling

10

NoSQL Key/Value databases using Riak, Key-Value Databases, What Is a Key-Value Store, Key-Value Store Features, Consistency, Transactions, Query Features, Structure of Data

11

Graph NoSQL databases using Neo4J ,NoSQL database development tools and programming languages, Graph Databases

Reference Books: 
Name: 
Hadoop: The Definitive Guide
Author: 
Tom White
Publication: 
O’Reilly Media
Edition: 
3rd, 2012
Name: 
The Big Data Tidal Wave: Finding Opportunities in Huge Data Streams with Advanced Analytics
Author: 
Bill Franks
Publication: 
Wiley,2012
Name: 
Big Data Analytics: Disruptive Technologies for Changing the Game
Author: 
ArvindSathi
Publication: 
MC Press,2012
Syllabus PDF: 
AttachmentSize
PDF icon BDA.pdf230.24 KB
branch: 
BDA
Course: 
2018
Stream: 
B.Tech