14
The use of an intelligent forum crawler for data retrieval from e- learning portals Miloš Pavković and Jelica Protić, University of Belgrade School of Electrical Engineering, Belgrade, Serbia 6th International Conference on Education and New Learning Technologies Barcelona, 7th - 9th of July 2014

The use of an intelligent forum crawler for data retrieval from e-learning portals

Embed Size (px)

DESCRIPTION

6th International Conference on Education and New Learning Technologies Barcelona , 7th - 9th of July 2014. The use of an intelligent forum crawler for data retrieval from e-learning portals. - PowerPoint PPT Presentation

Citation preview

sdfsdfs

The use of an intelligent forum crawler for data retrieval from e-learning portalsMilo Pavkovi and Jelica Proti, University of BelgradeSchool of Electrical Engineering, Belgrade, Serbia

6th International Conference on Education and New Learning Technologies Barcelona, 7th - 9th of July 2014

1IntroductionA large number of forums with different topicsForums are often used by students during their studies Large number of relevant information scattered around different forums inside one university domainForums are based on different technologies

2

2IssuesThe same topic can appear across different forums inside one university domainSchool official forums VS. departments independent forumsSame documents can be uploaded as post attachments to a couple of different web forumsSimilar courses at different schools

3

3Solution Specialized crawlerSpecialized forum crawlerAggregation of crawled data from multiple forums of a single university domainStoring data into databaseForum modules that use this database for helping students4

4Forum structureAlways defined by presented implicit paths

5

Example of a) forum b) thread c) attachments inside post.

5Crawler algorithmFCbRE Forum Crawler based on Regular ExpressionsAutomated systemIdentifying DOM structure and basic forum elements with regular expressions.Identifying forum implicit paths using regexExample: >>index\.php\?showforum\==\digit+!>+>\P=!