Upload
mbruemmer
View
109
Download
0
Tags:
Embed Size (px)
DESCRIPTION
Heiko Ehrig (Neofonie) introduced the company shortly. They shifted from developing search engines web and mobile application development and consulting, including interaction design, testing and data analytics. Neofonie developed a German text mining API that performs classification, keyword detection, entity detection, date detection, NER, and quotes (API key http://bit.ly/txtwerk). From their experience with NLP and linked data they point to the examination of the following issues: -extension of entity types -building more individual customer lexica and sentiment detection. -broaden LD and NLP for more languages than English. -development of a gold standard of German (N)ER. - discussion of standardized text mining API. -support of open data and open licenses.
Citation preview
Resources! Resources! Resources!
Heiko Ehrig (Head of Research)
2
Berlin, 1998, 1st german search engine, 180 pl, 2 companys
3
What we offer
3
4
✱ 12 Computer Scientists, Linguists, Mathematicians
✱ Text Mining and Analytics, Search ✱ Text Classification ✱ Named Entities and Concept Tagging ✱ Topic Detection and Tracking ✱ Sentiment Analysis (Customer‘s Voice)
✱ Data Analytics & Consulting
✱ Individual Projects
5
Research Department
✱ Works On German Texts ✱ Department Classification ✱ Keyword Detection ✱ Dates Detection ✱ Entity Detection (person, location, organisation) ✱ Concepts with Links to Freebase ✱ Named Entities with Links to Freebase ✱ Quotes
✱ Get Your API Key : http://bit.ly/txtwerk
6
txt werk - a Textmining API
✱ German Resources are rare!
✱ Example Named Entity Linking ✱ We did not find a Gold Standard ✱ Manual Labeling
✱ ERD Challenge 2014 (SIGIR'14 workshop) ✱ Googlers manually reannotated some hundred texts
from ClueWeb (data set not public)
7
Resources! Resources! Resources!
✱ More Entity Types (companies, products, brands)
✱ Individual Customer Lexica
✱ Sentiment Detection
✱ English and more languages
8
Roadmap
✱ Share your resources!
✱ Corporate-friendly licensing!
✱ If you leave the academia, share your resources!
✱ Lobby for resources @EC!
✱ Lobby for maintaining resources servers (like meta-share, datahub.io)
✱ Don‘t forget the Non-English Speaking World!
9
Wishes to the community