Diploma Website   Student Corner   Pay Fees   NIRF

Data Mining and Business Intelligence (2170715)

Teaching Scheme (in Hours)

Theory Tutorial Practical Total
3 0 2 5

Subject Credit :  5

Examination Scheme (in marks)

PA (M)
ESE Viva (V)
PA (I)
70 30 30 20 150

Syllabus Content    Download

Unit-1:  Overview and concepts Data Warehousing and Business Intelligence

Why reporting and Analysing data, Raw data to valuable informationLifecycle of Data - What is Business Intelligence - BI and DW in today’s perspective - What is data warehousing - The building Blocks: Defining Features - Data warehouses and data 1marts - Overview of the components - Metadata in the data warehouse - Need for data warehousing - Basic elements of data warehousing - trends in data warehousing.

Unit-2:  The Architecture of BI and DW

BI and DW architectures and its types - Relation between BI and DW - OLAP (Online analytical processing) definitions - Difference between OLAP and OLTP - Dimensional analysis - What are cubes? Drill-down and roll-up - slice and dice or rotation - OLAP models - ROLAP versus MOLAP - defining schemas: Stars, snowflakes and fact constellations

Unit-3:  Introduction to data mining (DM)

Motivation for Data Mining - Data Mining-Definition and Functionalities – Classification of DM Systems - DM task primitives - Integration of a Data Mining system with a Database or a Data Warehouse - Issues in DM – KDD Process

Unit-4:  Data Pre-processing

Why to pre-process data? - Data cleaning: Missing Values, Noisy Data - Data Integration and transformation - Data Reduction: Data cube aggregation, Dimensionality reduction - Data Compression - Numerosity Reduction - Data Mining Primitives - Languages and System Architectures: Task relevant data - Kind of Knowledge to be mined - Discretization and Concept Hierarchy

Unit-5:  Concept Description and Association Rule Mining

What is concept description? - Data Generalization and summarization-based characterization - Attribute relevance - class comparisons Association Rule Mining: Market basket analysis - basic concepts - Finding frequent item sets: Apriori algorithm - generating rules – Improved Apriori algorithm – Incremental ARM – Associative Classification – Rule Mining

Unit-6:  Classification and Prediction

What is classification and prediction? – Issues regarding Classification and prediction: Classification methods: Decision tree, Bayesian Classification, Rule based, CART, Neural Network Prediction methods: Linear and nonlinear regression, Logistic Regression Introduction of tools such as DB Miner /WEKA/DTREG DM Tools

Unit-7:  Data Mining for Business Intelligence Applications

Data mining for business Applications like Balanced Scorecard, Fraud Detection, Clickstream Mining, Market Segmentation, retail industry, telecommunications industry, banking & finance and CRM etc., Data Analytics Life Cycle: Introduction to Big data Business Analytics - State of the practice in analytics role of data scientists Key roles for successful analytic project - Main phases of life cycle - Developing core deliverables for stakeholders.

Unit-8:  Advance topics

Introduction and basic concepts of following topics. Clustering, Spatial mining, web mining, text mining, Big Data: Introduction to big data: distributed file system – Big Data and its importance, Four Vs, Drivers for Big data, Big data analytics, Big data applications. Algorithms using map reduce, Matrix-Vector Multiplication by Map Reduce. Introduction to Hadoop architecture: Hadoop Architecture, Hadoop Storage: HDFS, Common Hadoop Shell commands , Anatomy of File Write and Read., NameNode, Secondary NameNode, and DataNode, Hadoop MapReduce paradigm, Map and Reduce tasks, Job, Task trackers - Cluster Setup – SSH & Hadoop Configuration – HDFS Administering – Monitoring & Maintenance.

Reference Books

Sr. Title Author Publication Amazon Link
1 Data Mining Concepts and Techniques J. Han, M. Kamber Morgan Kaufmann
2 Data mining: Concepts, models, methods and algorithms M. Kantardzic John Wiley &Sons Inc.
3 Data Warehousing Fundamentals Paulraj Ponnian John Willey
4 Data Mining: Introductory and Advanced Topics M. Dunham Pearson Education
5 Data Mining for Business Intelligence:Concepts, Techniques, and Applications in Microsoft Office G. Shmueli, N.R. Patel, P.C. Bruce Wiley India

About Us

Darshan Institute of Engineering & Technology is a leading institute offering undergraduate (B.E.), postgraduate (M.E.) and Diploma programs in engineering.

Our Contacts

At Hadala, Rajkot - Morbi Highway,
Gujarat-363650, INDIA

(+91) 97277 47310
(+91) 97277 47311

More contact details