M.Tech, Syllabus

JNTUH M.Tech 2017-2018 (R17) Detailed Syllabus Web Mining

Web Mining Detailed Syllabus for Web Technology M.Tech first year first sem is covered here. This gives the details about credits, number of hours and other details along with reference books for the course.

The detailed syllabus for Web Mining M.Tech 2017-2018 (R17) first year first sem is as follows.

M.Tech. I Year I Sem.

Course Objectives:

  • To describe web mining and understand the need for web mining
  • To differentiate between Web mining and data mining
  • To understand the different application areas for web mining
  • To understand the different methods to introduce structure to web-based data
  • To describe Web mining, its objectives, and its benefits
  • To understand the methods of Web usage mining

UNIT – I : Introduction to Web Data Mining and Data Mining Foundations, Introduction – World Wide Web (WWW), A Brief History of the Web and the Internet, Web Data Mining-Data Mining, Web Mining. Data Mining Foundations – Association Rules and Sequential Patterns – Basic Concepts of Association Rules, Apriori Algorithm- Frequent Itemset Generation, Association Rule Generation, Data Formats for Association Rule Mining, Mining with multiple minimum supports – Extended Model, Mining Algorithm, Rule Generation, Mining Class Association Rules, Basic Concepts of Sequential Patterns, Mining Sequential Patterns on GSP, Mining Sequential Patterns on PrefixSpan, Generating Rules from Sequential Patterns.

UNIT – II : Supervised and Unsupervised Learning Supervised Learning – Basic Concepts, Decision Tree Induction – Learning Algorithm, Impurity  Function, Handling of Continuous Attributes, Classifier Evaluation, Rule Induction – Sequential Covering, Rule Learning, Classification Based on Associations, Naïve Bayesian Classification , Naïve Bayesian Text Classification – Probabilistic Framework, Naïve Bayesian Model . Unsupervised Learning – Basic Concepts , K-means Clustering – K-means Algorithm, Representation of Clusters, Hierarchical Clustering – Single link method, Complete link Method, Average link method, Strength and Weakness.

UNIT – III : Information Retrieval and Web Search: Basic Concepts of Information Retrieval, Information Retrieval Methods – Boolean Model, Vector Space Model and Statistical Language Model, Relevance Feedback, Evaluation Measures, Text and Web Page Preprocessing – Stopword Removal, Stemming, Web Page Preprocessing, Duplicate
Detection, Inverted Index and Its Compression – Inverted Index, Search using Inverted Index, Index Construction, Index Compression, Latent Semantic Indexing – Singular Value Decomposition, Query and Retrieval, Web Search, Meta Search, Web Spamming.

UNIT – IV : Link Analysis and Web Crawling: Link Analysis – Social Network Analysis, Co-Citation and Bibliographic Coupling, Page Rank Algorithm, HITS Algorithm, Community Discovery-Problem Definition, Bipartite Core Communities, Maximum Flow Communities, Email Communities. Web Crawling – A Basic Crawler Algorithm- Breadth First Crawlers, Preferential Crawlers, Implementation Issues – Fetching, Parsing, Stopword Removal, Link Extraction, Spider Traps, Page Repository, Universal Crawlers, Focused Crawlers, Topical Crawlers, Crawler Ethics and Conflicts.

UNIT – V:  Opinion Mining and Web Usage Mining Opinion Mining – Sentiment Classification – Classification based on Sentiment Phrases, Classification Using Text Classification Methods, Feature based Opinion Mining and Summarization – Problem Definition, Object feature extraction, Feature Extraction from Pros and Cons of Format1, Feature Extraction from Reviews of Format 2 and 3, Comparative Sentence and Relation Mining, Opinion Search and Opinion Spam. Web Usage Mining – Data Collection and Preprocessing- Sources and Types of Data, Key Elements of Web usage Data Preprocessing, Data Modeling for Web Usage Mining, Discovery and Analysis of Web usage Patterns -Session and Visitor Analysis, Cluster Analysis and Visitor Segmentation, Association and Correlation Analysis, Analysis of Sequential and Navigation Patterns.

TEXT BOOK:

  • Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data by Bing Liu (Springer Publications)

REFERENCES BOOKS:

  • Data Mining: Concepts and Techniques, Second Edition Jiawei Han, Micheline Kamber (Elsevier Publications)
  • Web Mining:: Applications and Techniques by Anthony Scime
  • Mining the Web: Discovering Knowledge from Hypertext Data by Soumen Chakrabarti

For all other M.Tech 1st Year 1st Sem syllabus go to JNTUH M.Tech Web Technology 1st Year 1st Sem Course Structure for (R17) Batch.

All details and yearly new syllabus will be updated here time to time. Subscribe, like us on facebook and follow us on google plus for all updates.

Do share with friends and in case of questions please feel free drop a comment.

Leave a Reply

Your email address will not be published. Required fields are marked *

*

This site uses Akismet to reduce spam. Learn how your comment data is processed.