The Most Generally Used Techniques In The Data
Mining Are:
Predictive modeling: It is used when the goal is to estimate the value of a
particular target attribute and there exist sample training data for which
values of that attribute are known. An example is a classification, which takes
a set of data already divided into predefined groups and searches for patterns
in the data that differentiate those groups.
Decision Trees: Tree-shaped systems that represent sets of
selections. These decisions generate regulations for the class of a dataset.
The unique selection tree techniques include Classification and Regression
Trees (CART) and Chi-square automatic interaction Detection (CHAID).
Rule induction: The extraction and the usage of if-then
guidelines from the data primarily based on statistical importance.
Nearest neighbor method: A way that classifies every report in a
dataset based on the combination of classes of the k file(s) most similar to it
in an ancient dataset (where k ³ 1). It is rather known as the k-nearest
neighbor method
Descriptive modeling: This is also known as clustering, it also
divides data into groups. With clustering, however, the exact groups are not
known in advance but the patterns that are being discovered by analyzing the
data are used to determine the different groups
Pattern mining: This mostly concentrates on identifying the
rules that are described by a specific pattern within the data. Market-basket
analysis, which identifies the items that typically occur together in
a purchase transaction, was one of the first application of the data mining.
Synthetic Neural
Networks: Non-linear predictive
replica that is learned through immense training and resemble organically
neural networks in structure.
Genetic algorithms: Optimization strategies that use strategies
which include genetic combination, mutation, and herbal selection in a design
which are totally based on the ideas of evolution.
No comments:
Post a Comment