What are the Steps Involved in data mining?

Write the steps of the KDD process: Explain them with the help of an example.

Ans. Many people treat data mining as a synonym for another popularly Used term Knowledge Discovery from Data, or KDD. Alternatively, others view data mining as simply essential steps in the process of knowledge discovery. So, steps involved in data mining or in the KDD process are depicted in fig. 1.1 and consist of an iterative sequence of the following steps —

Fig. 1:1 Data Mining as a Step in the Process of Knowledge Discovery

data-minig

(1) Data Cleaning — To remove noise and inconsistent data

(2) Data Integration Where multiple– data sources may be combined.

(3) Data Selection — Where data relevant to the analysis task are retrieved from the database.

(4) Data Transformation — Where data are transformed or consolidated into forms appropriate for mining by performing summary or aggregation operation, for instance

(5) Data Mining– An essential process where intelligent methods are applied in order to extract data patterns.

(6) Pattern Evaluation – To -identify the truly interesting pattern representing knowledge based on some interestingness measure.

(7) Knowledge Presentation – Where visualization and knowledge representation techniques are used to present the mined knowledge to the user.

Steps I to 4 are different forms of data processing where data are prepared for mining. The data mining step may interact with the user or, a knowledge base. The interesting patterns are presented to the user and may be stored as new knowledge in the knowledge base. Note that according to this view, data -mining is only one step in the entire process, albeit an essential one because it uncovers hidden patterns for evaluation.

We agree that data mining is a step in the knowledge discovery process. However, In industry, in media, and in the database research milieu, the term data mining is becoming more popular than the longer-term of, knowledge discovery from data

.

Leave a Reply

%d bloggers like this: