Big Data Analytics detailed syllabus for Artificial Intelligence & Data Science (AI&DS) for 2021 regulation curriculum has been taken from the Anna Universities official website and presented for the AI&DS students. For course code, course name, number of credits for a course and other scheme related information, do visit full semester subjects post given below.
For Artificial Intelligence & Data Science 5th Sem scheme and its subjects, do visit AI&DS 5th Sem 2021 regulation scheme. For Professional Elective-I scheme and its subjects refer to AI&DS Professional Elective-I syllabus scheme. The detailed syllabus of big data analytics is as follows.
Course Objectives:
Download the iStudy App for all syllabus and other updates.

Unit I
UNDERSTANDING BIG DATA
Introduction to big data – convergence of key trends – unstructured data – industry examples of big data – web analytics – big data applications- big data technologies – introduction to Hadoop -open source technologies – cloud and big data – mobile business intelligence – Crowd sourcing analytics – inter and trans firewall analytics.
Unit II
NOSQL DATA MANAGEMENT
Introduction to NoSQL – aggregate data models – key-value and document data models -relationships – graph databases – schemaless databases – materialized views – distribution models – master-slave replication – consistency – Cassandra – Cassandra data model -Cassandra examples – Cassandra clients
Unit III
Download the iStudy App for all syllabus and other updates.

Unit IV
BASICS OF HADOOP
Data format – analyzing data with Hadoop – scaling out – Hadoop streaming – Hadoop pipes -design of Hadoop distributed file system (HDFS) – HDFS concepts – Java interface – data flow -Hadoop I/O – data integrity – compression – serialization – Avro – file-based data structures -Cassandra – Hadoop integration.
Unit V
HADOOP RELATED TOOLS
Hbase – data model and implementations – Hbase clients – Hbase examples – praxis.
Pig – Grunt – pig data model – Pig Latin – developing and testing Pig Latin scripts.
Hive – data types and file formats – HiveQL data definition – HiveQL data manipulation – HiveQL queries.
Course Outcomes:
After the completion of this course, students will be able to:
- Describe big data and use cases from selected business domains.
- Explain NoSQL big data management.
- Install, configure, and run Hadoop and HDFS.
- Perform map-reduce analytics using Hadoop.
- Use Hadoop-related tools such as HBase, Cassandra, Pig, and Hive for big data analytics.
List of Experiments:
Download the iStudy App for all syllabus and other updates.

Text Books:
- Michael Minelli, Michelle Chambers, and AmbigaDhiraj, “Big Data, Big Analytics: Emerging Business Intelligence and Analytic Trends for Today’s Businesses”, Wiley, 2013.
- Eric Sammer, “Hadoop Operations”, O’Reilley, 2012.
- Sadalage, Pramod J. “NoSQL distilled”, 2013
Reference Books:
- E. Capriolo, D. Wampler, and J. Rutherglen, “Programming Hive”, O’Reilley, 2012.
- Lars George, “HBase: The Definitive Guide”, O’Reilley, 2011.
- Eben Hewitt, “Cassandra: The Definitive Guide”, O’Reilley, 2010.
- Alan Gates, “Programming Pig”, O’Reilley, 2011.
For detailed syllabus of all the other subjects of Artificial Intelligence & Data Science 5th Sem, visit AI&DS 5th Sem subject syllabuses for 2021 regulation.
For all Artificial Intelligence & Data Science results, visit Anna University AI&DS all semester results direct link.