IT368

Course Name: 

Data Analytics

Programme: 

B.Tech (IT)

Semester: 

Sixth

Category: 

Programme Specific Electives (PSE)

Credits (L-T-P): 

3-0-0

Content: 

Introduction to Data analysis: statistical modeling, total information awareness, Bonferroni's Principle; Distributed File systems: MapReduce and Spark; Dimensionality Reduction: PCA, SVD, IGA; Finding Similar Items: Distance Measures, Near Neighbour Search, Similarity based search algorithms; Mining Data Streams: Stream Data Model, Sampling, filtering, Counting distinct elements; Link Analysis: Hubs and Authorities, PageRank, Personalized PageRank; Recommendation Systems: Content based, Collaborative filtering, Hybrid models, Evaluating recommender systems; Mining Social-network Graphs: graph centrality concepts, clustering, community detection, partitioning, overlapping community detection, SimRank; Applications of Large-scale Machine Learning, Current trends and research.

References: 

Jure Leskovec, Anand Rajaraman and Jeffrey David Ullman, "Mining of Massive Datasets" Cambridge University Press, 2014 Gareth James, Daniela Witten, Trevor Hastie and Robert Tibshirani, "An Introduction to Statistical Learning with Applications in R", Springer, 2013 Nina Zumel and John Mount, "Practical Data Science with R", 2014, Manning Publishers

Department: 

Information Technology
 

Contact us

G. Ram Mohana Reddy

Professor and Head,
Department of Information Technology, NITK, Surathkal,
P. O. Srinivasnagar, Mangalore - 575 025
Karnataka, India.
Ph.:    +91-824-2474056
Email:  infotech[AT]nitk[DOT]ac[DOT]in
            infotech[AT]nitk[DOT]edu[DOT]in

Sowmya Kamath S (Web Admin)

Connect with us

We're on Social Networks. Follow us & stay in touch.