15

Intro To The Calais Web Service @ OpenCalais.com

Embed Size (px)

DESCRIPTION

This is a publisher-centric look at version 4.0 of the free Calais Web Service and open API at OpenCalais.com

Citation preview

Page 1: Intro To The Calais Web Service @ OpenCalais.com
Page 2: Intro To The Calais Web Service @ OpenCalais.com

Introducing the Calais Web service (4.0)

• A Thomson Reuters initiative designed to power next

generation publishing solutions

• A free API that anyone can use at www.OpenCalais.com

• The fastest way to categorize & metatag the people, places,

companies, facts and events in your content

• An easy way to connect to the open data

sources in the Linked Data Cloud, including

Wikipedia, DBPedia, Shopping.com, the

Internet Movie Database (IMDB) and more

Page 3: Intro To The Calais Web Service @ OpenCalais.com

Why? Tagging text is costly and time-consuming

We help in areas where:– The economics don’t

support metadata creation

– The value of metadata is potentially high

– The value of aggregated metadata is potentially extremely high

Seco

nds

Year

s

Seconds

Years

Tweets

Blogs

News

Scient. Pubs

Great Novels

Latency

Sh

elf

Lif

e

Page 4: Intro To The Calais Web Service @ OpenCalais.com

Why? What Calais can help you do…

• Automate: Automatically tag the people, places, companies, facts and events in your content to increase its value and interoperability.

• Enhance: Enrich your content with open data from Wikipedia, the Internet Movie Database (IMDB), Shopping.com and more.

• Engage: Optimize your user experience, increase engagement and drive repeat visits with topic pages, personalized filtering and real-time alerts.

• Extend: Increase your syndication to next generation search engines, news aggregators, ‘related stories’ applications and others.

• Connect: Enter the emerging Linked Content Economy. Compete in a rapidly evolving ecosystem of enriched and interconnected content.

Page 5: Intro To The Calais Web Service @ OpenCalais.com

How it works:

• A semantic metadata generation service that extracts entities, facts and events from unstructured text

• Creates linkages from extracted entities to linked data ecosystem

• Provides a transportation layer for rich semantic metadata from producers to consumers

Page 6: Intro To The Calais Web Service @ OpenCalais.com

<Topic>M&A</Topic>

<Acquisition offset="494" length="130">  <Company_Acquirer>Reuters</Company_Acquirer>   <Company_Acquired>ClearForest Ltd.</Company_Acquired>   <Status>Planned</Status> </Acquisition>

<Company>Reuters</Company>

<Company>ClearForest Ltd.</Company> Reuters Announced the Acquisition of ClearForest

New York - April 30, 2007

Reuters, the global information company, has entered into an agreement to acquire all of the outstanding shares of ClearForest Ltd., a privately held provider of Text Analytics solutions, whose tagging platform and analytical products allow clients to derive precise business information from huge amounts of textual content.

ClearForest has received sufficient shareholder approval to complete the transaction, which is expected to close in approximately 30 days, subject to customary closing conditions. The financial terms were not disclosed. Reuters plans to retain and continue to work with the existing management team and their highly skilled workforces in the US and Israel. It also plans to continue to support existing products and customers.

Reuters believes that search will be a pivotal element to the future of how financial information is sourced and consumed. As part of its drive into this space, Reuters has created a new strategic group and appointed Gerry Campbell, who will oversee the integration of ClearForest and drive this innovation.

<Product>Text Analytic Solution </Product>

<Company>ClearForest Ltd.</Company>

<Company>Reuters</Company>

<Country>United States</Country>

<Country>Israel</Country>

<Company>Reuters</Company>

<Person>Gerry Campbell</Person>

<ManagementChange offset="2789" length="92"> <Person>Gerry Campbell</Person> <Company>Reuters</Company> <Action>Enters</Position> </ManagementChange>

Text markup by Calais

Page 7: Intro To The Calais Web Service @ OpenCalais.com

NEW!

NEW!

The Linked Data Cloud with new OpenCalais and Thomson Reuters information assets

Page 8: Intro To The Calais Web Service @ OpenCalais.com

Unstructured Text

Unstructured Text

Calais extracts

entities, facts and events

Calais extracts

entities, facts and events

Metadata returned to

the user with keys

Metadata returned to

the user with keys

Keys provide access to the Calais Linked

Data cloud

Keys provide access to the Calais Linked

Data cloud

Which provides information and

other Linked Data pointers

Which provides information and

other Linked Data pointers

To a range of open and partner Linked

data assets, including Thomson Reuters

To a range of open and partner Linked

data assets, including Thomson Reuters

11

22

33

44

55

66

The Process

Page 9: Intro To The Calais Web Service @ OpenCalais.com

Quick online demo1. Copy and paste the text of a business news article into the viewer here:

http://viewer.opencalais.com and press submit. The article is sent to the Calais engine which tags the content and returns it, marked-up.

2. The tags appear on the left hand rail, and you can click on the plus (+) sign to see the tags expand. (Note that the Calais Viewer is not the Calais service. It is merely a demonstration of how the service works.)

3. Since we are now on Calais 4.0, you can also use the viewer to see the Linked Data assets related to the tags Calais returns.

For example, here is the Calais summary page for IBM: http://d.opencalais.com/er/company/ralg-tr1r/9e3f6c34-aa6b-3a3b-b221-a07aa7933633.html

And here is the summary page for IBM in DBPedia (the Wikipedia translated into computer language): http://dbpedia.org/page/IBM

Page 10: Intro To The Calais Web Service @ OpenCalais.com

Calais progress to date• Launched in late January, 2008

• Already, 9,500 developers have joined OpenCalais.com

• 1-3 million content ‘transactions’ per day

• Delivered four major update releases

• Lots of interesting apps & integrations– Drupal

– WordPress

– Afresco

– Many others

Page 11: Intro To The Calais Web Service @ OpenCalais.com

What’s coming• French language support – DONE!

• Linked Data Integration – DONE!

• Spanish language support – in process…

• Social tags – simple topical tags for “aboutness”

• ? Tell us what you’d like to see

@ OpenCalais.com

Page 12: Intro To The Calais Web Service @ OpenCalais.com

Sample Calais Applications

Page 13: Intro To The Calais Web Service @ OpenCalais.com

Example: The Mail & Guardian Online, South African Newspaper

Using Calais to metatag new and historical articles, and:1. Build an index or topics A-Z2. Pull out automatic related articles or pictures3. Create news alerts on companies or people 4. Pull up maps for the countries named in articles5. Predict readers’ interests based on browsing habits 6. Create tag clouds to show popular subjects, people, etc.

Using Calais to optimize search and navigation; drive consumer engagement

Page 14: Intro To The Calais Web Service @ OpenCalais.com

Example: Gist - today’s news filtered by people, places & events

GIST uses Calais to prioritize stories, rank newsmakers & reveal trends / reader demand. It automatically aggregates multiple news sources and slots them into topic.

Page 15: Intro To The Calais Web Service @ OpenCalais.com

Example: The Powerhouse Museum in Sydney

Using Calais to tag historical archives & using tags as search terms