22
Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660 CSU ID : 2652790 Designing and Building a Data Mining Model with Decision Tree Algorithm 1) Creating Data Source View :

Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

  • Upload
    others

  • View
    5

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

Designing and Building a Data Mining Model with Decision Tree

Algorithm

1) Creating Data Source View :

Page 2: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

Page 3: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

2) Creating and deploying Mining Structure :

Page 4: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

Page 5: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

Page 6: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

Page 7: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

Page 8: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

Page 9: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

Page 10: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

· Default Algorithm Parameters :

COMPLEXITY_PENALTY: 0

MINIMUM_SUPPORT: 10

SPLIT_METHOD: 3

Page 11: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

Here we got bellow decision tree.

Page 12: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

· Update Algorithm Parameters :

COMPLEXITY_PENALTY: 0.5

MINIMUM_SUPPORT: 10

SPLIT_METHOD: 2

Page 13: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

Compare to default tree, here total levels of the tree were reduced to 8 levels.

Page 14: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

Page 15: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

· Update Algorithm Parameters :

COMPLEXITY_PENALTY: 0.9

MINIMUM_SUPPORT: 10

SPLIT_METHOD: 2

Page 16: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

Compare to default tree, here total levels of the tree were reduced to 5 levels but not able to

predict properly up to last level.

Page 17: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

· Update Algorithm Parameters :

COMPLEXITY_PENALTY: 0.5

MINIMUM_SUPPORT: 50

SPLIT_METHOD: 2

Page 18: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

Compare to default tree, here total levels of the tree were reduced to 6 levels. And ability for

prediction of the bike buyers were good.

Page 19: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

· Update Algorithm Parameters :

COMPLEXITY_PENALTY: 0.9

MINIMUM_SUPPORT: 50

SPLIT_METHOD: 2

Page 20: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

Compare to default tree, here the levels of the tree were reduce to 6 level.

Page 21: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

· Update Algorithm Parameters :

COMPLEXITY_PENALTY: 0.9

MINIMUM_SUPPORT: 100

SPLIT_METHOD: 2

Page 22: Designing and Building a Data Mining Model with Decision ...eecs.csuohio.edu/.../CIS660_LAB4MicroSoftDTDhruv.pdf · Dhruv R Patel Lab As signment 4 CIS 660 CSU ID : 2652790 Compare

Cleveland State University Dhruv R Patel Lab Assignment 4 CIS 660

CSU ID : 2652790

Compare to default tree, here total levels of the tree were reduced to 4 levels but not able to

predict properly. Here because of the large minimum support some important data for the

decision were missing.

3) Conclusion: By comparing all above trees of different parameters, below parameters

were best as the MINIMUM_SUPPORT was not too small and too large, so it not

exclude high priority data and not include low priority data.

COMPLEXITY_PENALTY: 0.5

MINIMUM_SUPPORT: 50

SPLIT_METHOD: 2