Big Data & Analytics detailed syllabus scheme for Computer Engineering (CS), 2019 regulation has been taken from the MU official website and presented for the Bachelor of Engineering students. For Course Code, Course Title, Test 1, Test 2, Avg, End Sem Exam, Team Work, Practical, Oral, Total, and other information, do visit full semester subjects post given below.
For 7th Sem Scheme of Computer Engineering (CS), 2019 Pattern, do visit CS 7th Sem Scheme, 2019 Pattern. For the Department Level Optional Course-3 scheme of 7th Sem 2019 regulation, refer to CS 7th Sem Department Level Optional Course-3 Scheme 2019 Pattern. The detail syllabus for big data & analytics is as follows.
Big Data & Analytics Syllabus for Computer Engineering BE 7th Sem 2019 Pattern Mumbai University
Course Objectives:
For the complete Syllabus, results, class timetable, and many other features kindly download the iStudy App
It is a lightweight, easy to use, no images, and no pdfs platform to make students’s lives easier..
Course Outcomes:
Learner will be able to..
- Understand the key issues in big data management and its associated applications for business decisions and strategy.
- Develop problem solving and critical thinking skills in fundamental enabling techniques like Hadoop, Mapreduce and NoSQL in big data analytics.
- Collect, manage, store, query and analyze various forms of Big Data.
- Interpret business models and scientific computing paradigms, and apply software tools for big data analytics.
- Adapt adequate perspectives of big data analytics in various applications like recommender systems, social media applications etc.
- Solve Complex real world problems in various applications like recommender systems, social media applications, health and medical systems, etc.
Prerequisites:
Some prior knowledge about Java programming, Basics of SQL, Data mining and machine learning methods would be beneficial.
Module 1
For the complete Syllabus, results, class timetable, and many other features kindly download the iStudy App
It is a lightweight, easy to use, no images, and no pdfs platform to make students’s lives easier..
Module 2
Hadoop HDFS and MapReduce
- Distributed File Systems: Physical Organization of Compute Nodes, Large-Scale File-System Organization.
- MapReduce: The Map Tasks, Grouping by Key, The Reduce Tasks, Combiners, Details of MapReduce Execution, Coping With Node Failures.
- Algorithms Using MapReduce: Matrix-Vector Multiplication by MapReduce, Relational-Algebra Operations, Computing Selections by MapReduce, Computing Projections by MapReduce, Union, Intersection, and Difference by MapReduce
- Hadoop Limitations 10
Module 3
NoSQL
- Introduction to NoSQL, NoSQL Business Drivers,
- NoSQL Data Architecture Patterns: Key-value stores, Graph stores, Column family (Bigtable)stores, Document stores, Variations of NoSQL architectural patterns, NoSQL Case Study
- NoSQL solution for big data, Understanding the types of big data problems; Analyzing big data with a shared-nothing architecture; Choosing distribution models: master-slave versus peer-to-peer; NoSQL systems to handle big data problems. 06
Module 4
For the complete Syllabus, results, class timetable, and many other features kindly download the iStudy App
It is a lightweight, easy to use, no images, and no pdfs platform to make students’s lives easier..
Module 5
Finding Similar Items and Clustering
- Distance Measures: Definition of a Distance Measure, Euclidean Distances, Jaccard Distance, Cosine Distance, Edit Distance, Hamming Distance.
- CURE Algorithm, Stream-Computing , A Stream-Clustering Algorithm, Initializing & Merging Buckets, Answering Queries 08
Real-Time Big Data Models
- PageRank Overview, Efficient computation of PageRank: PageRank Iteration Using MapReduce, Use of Combiners to Consolidate the Result Vector.
- A Model for Recommendation Systems, Content-Based Recommendations, Collaborative Filtering.
- Social Networks as Graphs, Clustering of Social-Network Graphs, Direct Discovery of Communities in a social graph.
Text Books:
- CreAnand Rajaraman and Jeff Ullman Mining of Massive Datasets, Cambridge University Press,
- Alex Holmes Hadoop in Practice, Manning Press, Dreamtech Press.
- Dan Mcary and Ann Kelly Making Sense of NoSQL – A guide for managers and the rest of us, Manning Press.
Reference Books:
For the complete Syllabus, results, class timetable, and many other features kindly download the iStudy App
It is a lightweight, easy to use, no images, and no pdfs platform to make students’s lives easier..
Suggested Practical List:
- Hadoop HDFS Practical:
- HDFS Basics, Hadoop Ecosystem Tools Overview.
- Installing Hadoop.
- Copying File to Hadoop.
- Copy from Hadoop File system and deleting file.
- Moving and displaying files in HDFS.
- Programming exercises on Hadoop.
- Use of Sqoop tool to transfer data between Hadoop and relational database servers.
- Sqoop – Installation.
- To execute basic commands of Hadoop eco system component Sqoop.
- To install and configure MongoDB/ Cassandra/ HBase/ Hypertable to execute NoSQL commands.
- Experiment on Hadoop Map-Reduce / PySpark:
- Implementing simple algorithms in Map-Reduce: Matrix multiplication, Aggregates, Joins, Sorting, Searching, etc.
- Create HIVE Database and Descriptive analytics-basic statistics, visualization using Hive/PIG/R.
- Write a program to implement word count program using MapReduce.
- Implementing DGIM algorithm using any Programming Language/ Implement Bloom Filter using any programming language.
- Implementing any one Clustering algorithm (A-Means/CURE) using Map-Reduce.
- Streaming data analysis – use flume for data capture, HIVE/PYSpark for analysis of twitter data, chat data, weblog analysis etc.
- Implement PageRank using Map-Reduce.
- Implement predictive Analytics techniques (regression / time series, etc) using R/ Scilab/ Tableau/ Rapid miner.
- Mini Project: One real life large data application to be implemented (Use standard Datasets available on the web).
The Experiments for this course are required to be performed and to be evaluated in Computational Lab-1.
For detail Syllabus of all subjects of Computer Engineering (CS) 7th Sem 2019 regulation, visit CS 7th Sem Subjects of 2019 Pattern.