„Al. I. Cuza” University of Iasi, Romania Faculty of...

Applications of Natural Language Processing

Course 1 - 23 February 2012

„Al. I. Cuza” University of Iasi, Romania

Faculty of Computer Science

NLP Systems Evaluation

Information Retrieval

Information Extraction

Question Answering ◦ Introduction

◦ System components

Background knowledge indexing

Index creation and information retrieval

Answer extraction

◦ Results

◦ Error analysis

Conclusions

“An important recent development in NLP has been the use of much more rigorous standards for the evaluation of NLP systems”

Manning and Schutze

To be published, all research must: ◦ establish a baseline, and

◦ quantitatively show that it improves on the baseline

“How well does the system work?”

Possible domains for evaluation ◦ Processing time of the system

◦ Space usage of the system

◦ Human satisfaction

◦ Correctness of results

Measures: (Accuracy, Error), (Precision, Recall, F-measure)

A gold standard can specify what is correct

The results of a system are marked as: ◦ Correct: matches the gold standard

◦ Incorrect: otherwise

Accuracy = 66.66 %

Error = 33.33 %

Precision and Recall are set-based measures

They evaluate the quality of some set membership, based on a reference set membership

Precision: what proportion of the retrieved documents is relevant?

Recall: what proportion of the relevant documents

is retrieved?

relevant documents retrieved documents

Precision = 4 / 10 = 40 %

Recall = 4 / 14 = 28.57 %

Precision = 14 / 20 = 70 %

Recall = 14 / 14 = 100 %

Precision = 0 / 6 = 0 %

Recall = 0 / 14 = 0 %

F-measure is a measure of a test's accuracy, and it considers both the precision p and the recall r

General formula:

F1-measure:

F2-measure = ?

Information Retrieval is the “science of search”

Information retrieval (IR) is the science of searching for documents, for information within documents, and for metadata about documents, as well as that of searching relational databases and the World Wide Web

The study of systems for indexing, searching, and recalling data, particularly text or other unstructured forms

An information retrieval process begins when a user enters a query into the system

Queries are formal statements of information needs, for example search strings in web search engines

In information retrieval a query does not uniquely identify a single object in the collection. Instead, several objects may match the query, perhaps with different degrees of relevancy

IR systems compute a numeric score on how well each object in the database match the query, and rank the objects according to this value

The top ranking objects are then shown to the user

The process may then be iterated if the user wishes to refine the query

Precision = how many are relevant

Recall = how many of the relevant are included

Type of IR with goal to extract structured information from unstructured collections using NLP techniques

Automatic annotation and concept extraction from images/audio/video

An early commercial system from the mid 1980s was JASPER built for Reuters by the Carnegie Group with the aim of providing real-time financial news to financial traders.

Text simplification in order to create a structured view of the information present in free text ◦ Named Entity Extraction

◦ Coreference Resolution

◦ Relations between NE

◦ Comments Extraction

◦ Audio Extraction

Question Answering (QA) can be defined as the task which takes a question in natural language and produces one or more ranked answers from a collection of documents

The QA research area has emerged as a result of a monolingual English QA track being introduced at TREC (Text Retrieval and Evaluation Conference: http://trec.nist.gov/)

QA systems normally adhere to the pipeline architecture composed of three main modules (Harabagiu and Moldovan, 2003):

◦ question analysis – the results are keywords, answer and question type, focus

◦ paragraph retrieval - the results are a set of relevant candidate paragraphs/sentences from the document collection

◦ answer extraction – the results are a set of candidate answers ranked using likelihood measures

Harabagiu and Moldovan, 2003:

◦ Factoid – “Who discovered the oxygen?”, “When did Hawaii become a state?” or “What football team won the World Coup in 1992?”

◦ List – “What countries export oil?” or “What are the regions preferred by the Americans for holidays?”.

◦ Definition – “What is a quasar?” or “What is a question-answering system?”

How, Why, hypothetical, semantically constrained, polar (Yes/No) and cross-lingual questions

Person - "What”, "Who”, "Whom", "With who"

Location (City, Country, and Region) - "What state/city“, "From where”, "Where“

Organization - "Who produced“, "Who made“

Temporal (Date and Year) – “When”

Measure (Length, Surface and Other) – “How many/much”

Count - "How many/much“

Yes/No – “Did you fear that?”, “Are you blue?”

Local collections, internal organization documents, newspapers, Internet

Closed-domain - deals with questions from a specific domain (medical, baseball, etc.). Can exploit domain-specific knowledge (ontologies, rules, disambiguation)

Open-domain – general question about anything. Can use general knowledge about the world

The first QA systems have been created in the 60s:

BASEBALL (Green 1963) - answer questions about baseball games

LUNAR (Woods, 1977) – geological analysis of rocks returned by the Apollo moon missions

IURES (Cristea, Tufiş, Mihaiescu, 1985) – medical domain

Powerset: http://www.powerset.com/ (http://www.bing.com/)

Assimov the chat bot: http://talkingrobot.org/b/

AnswerBus: http://www.answerbus.com/index.shtml

NSIR: http://tangra.si.umich.edu/clair/NSIR/html/nsir.cgi

START (The first question answering system): http://start.csail.mit.edu/

CLEF (Cross Language Evaluation Forum) started in 2000 - http://www.clef-campaign.org/ European languages in both monolingual and cross-language contexts ◦ Coordination Istituto di Scienza e Tecnologie

dell'Informazione, Pisa, Italy

◦ Romanian Institute for Computer Science, Romania

TREC (Text REtrieval Conference) - started in 1992 http://trec.nist.gov/ ◦ National Institute of Standards and Technology

(NIST), Gaithersburg, Maryland, USA

Our group participates at CLEF exercises since 2006: ◦ 2006 – Ro–En (English collection) – 9.47% right answers

◦ 2007 – Ro–Ro (Romanian Wikipedia) – 12 %

◦ 2008 – Ro–Ro (Romanian Wikipedia) – 31 %

◦ 2009 – Ro–Ro, En–En (JRC-Acquis) – 47.2 % (48.6%)

◦ 2010 – Ro-Ro, En-En, Fr-Fr (JRC-Acquis, Europarl) – 47.5% (42.5%, 27 %)

2006 2007 2008 2009 2010 34

Lucene index 1

Lucene

indexes 2

Background knowledge

Test data (documents, questions, possible

answers)

Questions processing:

- Lemmatization

- Stop words elimination

- NEs identification - Lucene query

Answers processing:

- Lemmatization

- Stop words elimination

- NEs identification - Lucene query

Identify relevant

documents

Partial and

global scores

per answers 35

The Romanian background knowledge has 161,279 documents in text format ◦ 25,033 correspond to the AIDS topic

◦ 51,130 to Climate Change topic

◦ 85,116 to Music and Society topic

The indexing component considers the name of the file and the text from it => Lucene index 1

Test data was an XML file with 12 test documents ◦ 4 documents for each of the three topics (12 in total)

◦ 10 questions for each document (120 in total)

◦ 5 possible answers for each question (600 in total)

Test data processing involved 3 operations: ◦ extracting documents

◦ processing questions

◦ processing possible answers

The content of <doc> => <topic id>\<reading test id>\1..10

topic id

reading

test id

Stop words elimination

Lemmatization

Named Entity identification

Lucene query building

„Al. I. Cuza” University of Iasi, Romania Faculty of...

Documents

Feb Apln OC Shawna C

Cursul 6 23 Martie 2020 adiftene@info.uaicadiftene/Scoala/2020/IP/Cursuri/IP06.pdf · design vocabulary. It lets us design at a higher level of abstraction Having a vocabulary for

Course 4 13 March 2019 adiftene@info.uaicadiftene/Scoala/2019/PE/Courses/PE04.pdf · Sequence Diagram contents the sequence of actions which occur in the system, each object invocation

APLN Project Manager Talk

Cursul 6 21 Martie adiftene@infoiasi - profs.info.uaic.roadiftene/Scoala/2011/IP/Cursuri/IP06.pdf · Principii, responsabilități Information Expert Creator Low Coupling High Cohesion

Course 4 24 October 2016 Adrian Iftene …adiftene/Scoala/2017/ASET/Courses/ASET... · IBM Rational Rose Modeler 4. 5 ... wizards, form builders and other GUI controls are not “models”

Course 2 Adrian Iftene adiftene@info.uaicadiftene/Scoala/2018/... · Content management systems: Drupal, Joomla, WordPress In wiki software packages such as MediaWiki, DokuWiki, TWiki

„Al. I. Cuza” University of Iasi, Romania Faculty of ...adiftene/Scoala/2012/APLN/Courses/APLN... · Applications of Natural Language Processing Course 3 - 8 March 2012 „Al

Course 2 Adrian Iftene adiftene@info.uaicadiftene/Scoala/2017/ASET/Courses/ASET... · GRASP - Applying UML and Patterns –An Introduction to Object-Oriented Analysis and Design and

Course 8 Adrian Iftene adiftene@info.uaicadiftene/Scoala/2019/... · that a system satisfies its specification Runtime Verification: Runtime verification deals ... contract. Based

Curs 2 - 24 Februarie 2011 Adrian Iftene adiftene@info.uaicadiftene/Scoala/2011/LI/Cursuri/LI02.pdf · Curs 2 - 24 Februarie 2011 ... (depanatorul incorporat al) limbajului. (1) Text:

Introducere Adrian Iftene adiftene@info.uaicadiftene/Scoala/2016/IP/... · 2016. 2. 12. · Implementări Java, C++, C#, OOP (coding style) Teme propuse de profesor, studenți Se

Curs 6 Adrian Iftene adiftene@info.uaicadiftene/Scoala/2012/TAIP/Courses/TAIP... · Curs 6 – 14 Noiembrie 2011 Adrian Iftene adiftene@info.uaic.ro 1 ... Or, use C# . 21

Course 4 October 27 2020 Adrian Iftene adiftene@info.uaicadiftene/Scoala/2021/ASET/Courses/ASET04.pdfBPMN - graphical representation for specifying business processes in a business

Recapitulare IP Adrian Iftene adiftene@info.uaicadiftene/Scoala/2012/TAIP/Courses/IP_Recapitulare.pdf · Gestionare proiect: coordonare echipe, comunicare, ... Problemă: dat un anumit

Dove Medical Press Web viewSupplementary Figure 1 Expression of APLN in rat hippocampal neurons with kindled epilepsy Rats with kindled epilepsy were treated with APLN shRNA, APLN

Cursul 3 6 Martie adiftene@info.uaicadiftene/Scoala/2017/IP/Cursuri/IP03.pdf · Limbaje Grafice: arbori comportamentali, modelarea proceselor de business, EXPRESS (modelarea datelor),

Course 6 Adrian Iftene adiftene@info.uaicadiftene/Scoala/2014/TAIP/...21 Concern - a feature point, or requirement Crosscutting concern - a feature point that cannot be encapsulated

Course 10 Adrian Iftene adiftene@info.uaicadiftene/Scoala/2020/ASET/... · 2019-12-14 · Selenium IDE and Selenium WebDriver ... system in such a way that it does not alter the external

Course 12 9 January 2017 Adrian Iftene adiftene@info.uaicadiftene/Scoala/2017/... · WebDriver is designed to providing a simpler and uniformed programming interface Same WebDriver