Big Data Analytics detailed syllabus for Computer Science & Design (CSD) for 2021 regulation curriculum has been taken from the Anna Universities official website and presented for the CSD students. For course code, course name, number of credits for a course and other scheme related information, do visit full semester subjects post given below.
For Computer Science & Design 5th Sem scheme and its subjects, do visit CSD 5th Sem 2021 regulation scheme. For Professional Elective-I scheme and its subjects refer to CSD Professional Elective-I syllabus scheme. The detailed syllabus of big data analytics is as follows.
Course Objectives:
Download the iStudy App for all syllabus and other updates.

Unit I
UNDERSTANDING BIG DATA
Introduction to big data – convergence of key trends – unstructured data – industry examples of big data – web analytics – big data applications- big data technologies – introduction to Hadoop -open source technologies – cloud and big data – mobile business intelligence – Crowd sourcing analytics – inter and trans firewall analytics.
Unit II
NOSQL DATA MANAGEMENT
Introduction to NoSQL – aggregate data models – key-value and document data models -relationships – graph databases – schemaless databases – materialized views – distribution models – master-slave replication – consistency – Cassandra – Cassandra data model -Cassandra examples – Cassandra clients
Unit III
Download the iStudy App for all syllabus and other updates.

Unit IV
BASICS OF HADOOP
Data format – analyzing data with Hadoop – scaling out – Hadoop streaming – Hadoop pipes -design of Hadoop distributed file system (HDFS) – HDFS concepts – Java interface – data flow -Hadoop I/O – data integrity – compression – serialization – Avro – file-based data structures -Cassandra – Hadoop integration.
Unit V
HADOOP RELATED TOOLS
Hbase – data model and implementations – Hbase clients – Hbase examples – praxis. Pig – Grunt – pig data model – Pig Latin – developing and testing Pig Latin scripts. Hive – data types and file formats – HiveQL data definition – HiveQL data manipulation – HiveQL queries.
Course Outcomes:
After the completion of this course, students will be able to:
- Describe big data and use cases from selected business domains.
- Explain NoSQL big data management.
- Install, configure, and run Hadoop and HDFS.
- Perform map-reduce analytics using Hadoop.
- Use Hadoop-related tools such as HBase, Cassandra, Pig, and Hive for big data analytics.
List of Experiments:
Download the iStudy App for all syllabus and other updates.

Text Books:
- Michael Minelli, Michelle Chambers, and AmbigaDhiraj, “Big Data, BigAnalytics: Emerging Business Intelligence and Analytic Trends for Today’s Businesses”, Wiley, 2013.
- Eric Sammer, “Hadoop Operations”, O’Reilley, 2012.
- Sadalage, Pramod J. “NoSQL distilled”, 2013
Reference Books:
- E. Capriolo, D. Wampler, and J. Rutherglen, “Programming Hive”, O’Reilley, 2012.
- Lars George, “HBase: The Definitive Guide”, O’Reilley, 2011.
- Eben Hewitt, “Cassandra: The Definitive Guide”, O’Reilley, 2010.
- Alan Gates, “Programming Pig”, O’Reilley, 2011.
For detailed syllabus of all the other subjects of Computer Science & Design 5th Sem, visit CSD 5th Sem subject syllabuses for 2021 regulation.
For all Computer Science & Design results, visit Anna University CSD all semester results direct link.