CSD

CCS334: Big Data Analytics syllabus for CSD 2021 regulation (Professional Elective-I)

Big Data Analytics detailed syllabus for Computer Science & Design (CSD) for 2021 regulation curriculum has been taken from the Anna Universities official website and presented for the CSD students. For course code, course name, number of credits for a course and other scheme related information, do visit full semester subjects post given below.

For Computer Science & Design 5th Sem scheme and its subjects, do visit CSD 5th Sem 2021 regulation scheme. For Professional Elective-I scheme and its subjects refer to CSD Professional Elective-I syllabus scheme. The detailed syllabus of big data analytics is as follows.

Course Objectives:

Download the iStudy App for all syllabus and other updates.
Get it on Google Play

Unit I

UNDERSTANDING BIG DATA
Introduction to big data – convergence of key trends – unstructured data – industry examples of big data – web analytics – big data applications- big data technologies – introduction to Hadoop -open source technologies – cloud and big data – mobile business intelligence – Crowd sourcing analytics – inter and trans firewall analytics.

Unit II

NOSQL DATA MANAGEMENT
Introduction to NoSQL – aggregate data models – key-value and document data models -relationships – graph databases – schemaless databases – materialized views – distribution models – master-slave replication – consistency – Cassandra – Cassandra data model -Cassandra examples – Cassandra clients

Unit III

Download the iStudy App for all syllabus and other updates.
Get it on Google Play

Unit IV

BASICS OF HADOOP
Data format – analyzing data with Hadoop – scaling out – Hadoop streaming – Hadoop pipes -design of Hadoop distributed file system (HDFS) – HDFS concepts – Java interface – data flow -Hadoop I/O – data integrity – compression – serialization – Avro – file-based data structures -Cassandra – Hadoop integration.

Unit V

HADOOP RELATED TOOLS
Hbase – data model and implementations – Hbase clients – Hbase examples – praxis. Pig – Grunt – pig data model – Pig Latin – developing and testing Pig Latin scripts. Hive – data types and file formats – HiveQL data definition – HiveQL data manipulation – HiveQL queries.

Course Outcomes:

After the completion of this course, students will be able to:

  1. Describe big data and use cases from selected business domains.
  2. Explain NoSQL big data management.
  3. Install, configure, and run Hadoop and HDFS.
  4. Perform map-reduce analytics using Hadoop.
  5. Use Hadoop-related tools such as HBase, Cassandra, Pig, and Hive for big data analytics.

List of Experiments:

Download the iStudy App for all syllabus and other updates.
Get it on Google Play

Text Books:

  1. Michael Minelli, Michelle Chambers, and AmbigaDhiraj, “Big Data, BigAnalytics: Emerging Business Intelligence and Analytic Trends for Today’s Businesses”, Wiley, 2013.
  2. Eric Sammer, “Hadoop Operations”, O’Reilley, 2012.
  3. Sadalage, Pramod J. “NoSQL distilled”, 2013

Reference Books:

  1. E. Capriolo, D. Wampler, and J. Rutherglen, “Programming Hive”, O’Reilley, 2012.
  2. Lars George, “HBase: The Definitive Guide”, O’Reilley, 2011.
  3. Eben Hewitt, “Cassandra: The Definitive Guide”, O’Reilley, 2010.
  4. Alan Gates, “Programming Pig”, O’Reilley, 2011.

For detailed syllabus of all the other subjects of Computer Science & Design 5th Sem, visit CSD 5th Sem subject syllabuses for 2021 regulation.

For all Computer Science & Design results, visit Anna University CSD all semester results direct link.

Leave a Reply

Your email address will not be published. Required fields are marked *

*

This site uses Akismet to reduce spam. Learn how your comment data is processed.