IT478
Course Name:
Data Mining (IT478)
Programme:
Category:
Credits (L-T-P):
Content:
Introduction to data mining: Motivation and significance of data mining, Data mining on what kind of data? , data mining functionalities, interestingness measures, classification of data mining system, major issues in data mining; Data pre-processing: Need, data summarization, data cleaning, data integration and transformation, Attribute subset selection methods: filter based and wrapper based methods, Information gain based, correlation based, Minimum redundancy maximum relevance based methods, data discretization and concept hierarchy generalization. Data warehouse and OLAP technology: multidimensional data model(s), data warehouse architecture, OLAP server types, data warehouse implementation, on-line analytical processing and mining; Data cube computation and data generalization. Mining frequent patterns, associations and correlations: Basic concepts, efficient and scalable frequent itemset mining algorithms: A-priori and FP Tree methods, mining various kinds of association rules – multilevel and multidimensional, association rule mining versus correlation analysis, constraint based association mining; Colossal item set Mining: Enumeration methods, Dynamic switching method, parallel method, sequential pattern mining; Bayesian classification, associative classification, lazy learners, grid based and density based clustering methods, Clustering high dimensional data; Data mining on complex data and applications: Algorithms for mining multimedia data, text data, multimodal data, biological sequence data; Data mining applications and trends in data mining.