14
Intelligent Data Mining to Verify IKM Curriculum Patrick Knowlan Mark Ostrander Chris Jackson Rob Katich ISAT/CS 344 – Intelligent Systems

Intelligent Data Mining to Verify IKM Curriculum Patrick Knowlan Mark Ostrander Chris Jackson Rob Katich ISAT/CS 344 – Intelligent Systems

Embed Size (px)

Citation preview

Page 1: Intelligent Data Mining to Verify IKM Curriculum Patrick Knowlan Mark Ostrander Chris Jackson Rob Katich ISAT/CS 344 – Intelligent Systems

Intelligent Data Mining to Verify IKM Curriculum

Patrick Knowlan

Mark Ostrander

Chris Jackson

Rob Katich

ISAT/CS 344 – Intelligent Systems

Page 2: Intelligent Data Mining to Verify IKM Curriculum Patrick Knowlan Mark Ostrander Chris Jackson Rob Katich ISAT/CS 344 – Intelligent Systems

Introduction• What is the current IKM curriculum?

• Current technical market• Constantly changing, Recruit-A-Duke job postings

• IS to search job criteria• Derive high-quality information from text.• Appearance and frequency of key words

• Reference with technical/functional descriptions to reevaluate IKM course

Page 3: Intelligent Data Mining to Verify IKM Curriculum Patrick Knowlan Mark Ostrander Chris Jackson Rob Katich ISAT/CS 344 – Intelligent Systems

Background/ Purpose• Text Mining using R

• Search technology

• Compare and contrasts short text strings

• Define a relationship

• Looks for frequent key words

• Adjust IKM curriculum

Page 4: Intelligent Data Mining to Verify IKM Curriculum Patrick Knowlan Mark Ostrander Chris Jackson Rob Katich ISAT/CS 344 – Intelligent Systems

Size and Scope• Mine data to make useful for faculty and students of the

ISAT department

• Future curriculums for the IKM concentration

• Possibly other curriculums

Page 5: Intelligent Data Mining to Verify IKM Curriculum Patrick Knowlan Mark Ostrander Chris Jackson Rob Katich ISAT/CS 344 – Intelligent Systems

Specifications• The R Project – statistical computing

• Multiple packages• tm library

• NLP• Depends on eight additional packages for functionality

• Text mining to analyze recruitment data

Page 6: Intelligent Data Mining to Verify IKM Curriculum Patrick Knowlan Mark Ostrander Chris Jackson Rob Katich ISAT/CS 344 – Intelligent Systems

Methodology• Text mining using R

• Create readable file (.csv MS-DOS)

• Create a corpus

• Format and filter text

• Search frequency of key words• Create dictionaries of appropriate terms

• Compare and contrast searches

• Make suggestion based on results

Page 7: Intelligent Data Mining to Verify IKM Curriculum Patrick Knowlan Mark Ostrander Chris Jackson Rob Katich ISAT/CS 344 – Intelligent Systems

Discoveries• Capabilities

• Reduces time• Organization & formatting• Frequency analysis• Term grouping / association

• Limitations• Program compatibility

• Mac vs. PC

• Search capabilities• “C++” “C#”

Page 8: Intelligent Data Mining to Verify IKM Curriculum Patrick Knowlan Mark Ostrander Chris Jackson Rob Katich ISAT/CS 344 – Intelligent Systems

Demonstration

IKM Curriculum

Page 9: Intelligent Data Mining to Verify IKM Curriculum Patrick Knowlan Mark Ostrander Chris Jackson Rob Katich ISAT/CS 344 – Intelligent Systems

Results

0

10

20

30

40

50

60

70

80

ISAT Matching Curriculum

dbskillsmsofficeooprogvisualbasicwebdev

Term

Fre

qu

ency

Page 10: Intelligent Data Mining to Verify IKM Curriculum Patrick Knowlan Mark Ostrander Chris Jackson Rob Katich ISAT/CS 344 – Intelligent Systems

Results

0

10

20

30

40

50

60

Non ISAT Curriculum

androidERPiosjavajquerymysqloracleperlphppythonrorrubysharepointunix

Term

Fre

qu

ency

Page 11: Intelligent Data Mining to Verify IKM Curriculum Patrick Knowlan Mark Ostrander Chris Jackson Rob Katich ISAT/CS 344 – Intelligent Systems

Results

ISAT Matching Cirriculum

dbskills

msoffice

ooprog

visualbasic

webdev

Not Matched

Page 12: Intelligent Data Mining to Verify IKM Curriculum Patrick Knowlan Mark Ostrander Chris Jackson Rob Katich ISAT/CS 344 – Intelligent Systems

Results

Non ISAT Curriculum

android

ERP

ios

java

jquery

mysql

oracle

perl

php

python

ror

ruby

sharepoint

unix

Not Matched

Page 13: Intelligent Data Mining to Verify IKM Curriculum Patrick Knowlan Mark Ostrander Chris Jackson Rob Katich ISAT/CS 344 – Intelligent Systems

Conclusions• Summary

• VB is OUT

• JAVA is IN

• Stress web technology

• Web application development

• Database skills

• Oracle

• Microsoft Office necessity

• Unix systems

• Business technology classes

Page 14: Intelligent Data Mining to Verify IKM Curriculum Patrick Knowlan Mark Ostrander Chris Jackson Rob Katich ISAT/CS 344 – Intelligent Systems

Questions?