Information Extraction Research @ Yahoo! Labs Bangalore Rajeev Rastogi Yahoo! Labs Bangalore

Information Extraction Research @ Yahoo! Labs Bangalore

Rajeev RastogiYahoo! Labs Bangalore

The most visited site on the internet

• 600 million+ users per month

• Super popular properties– News, finance, sports– Answers, flickr,

del.icio.us– Mail, messaging– Search

Unparalleled scale

• 25 terabytes of data collected each day– Over 4 billion clicks every day– Over 4 billion emails per day– Over 6 billion instant messages per day

• Over 20 billion web documents indexed• Over 4 billion images searchable

No other company on the planet processes as much data as we do!

Yahoo! Labs Bangalore

• Focus is on basic and applied research– Search– Advertizing– Cloud computing

• University relations– Faculty research grants– Summer internships– Sharing data/computing

infrastructure– Conference sponsorships– PhD co-op program

What does search look like today?

Search results of the future: Structured abstracts

yelp.com

babycenter

epicurious

answers.com

New York Times

Gawker

Rank by price

Search results of the future: Intelligent ranking

A key technology for enabling search transformation

Information extraction (IE)

Reviews

Information extraction (IE)

• Goal: Extract structured records from Web pages

AddressCategory

PhonePrice

Multiple verticals

• Business, social networking, video, ….

Information Extraction Research @ Yahoo! Labs Bangalore Rajeev Rastogi Yahoo! Labs Bangalore

Documents

WEB WORKERS 1 Amitesh Madhur (amitesh@yahoo-inc.com) (Exceptional Performance, Bangalore)

BOSS: Yahoo HackU IIIT Bangalore

Architecture for Measuring Ad - DeveloperMarch · AdTech Data and Measurement April 25, 2019 Yahoo Bangalore (8 years) Yahoo Sunnyvale (2.5 years) Apply Cupertino (2.5 years)

Yamacraw-Yahoo Falls Backcountry Route Yahoo …...Yahoo Falls Scenic Area Yahoo Falls' Yahoo Arch Markers

Deep Learning with Theano (with a case study) - Yahoo … · Liangliang Cao 1 Deep Learning with Theano (with a case study) Liangliang “Lyon” Cao Yahoo! Labs

TCS Innovation Labs, Bangalore, India

Strategies for Human-Human Interaction Laura M. Haas, IBM Research – Almaden Margaret Martonosi, Princeton University Amanda Stent, Yahoo Labs

Scaling Concurrent Log-Structured Data Stores Concurrent Log-Structured Data Stores Guy Golan-Gueta Yahoo Labs Haifa, Israel ggolan@yahoo-inc.com Edward Bortnikov Yahoo Labs Haifa,

Pig Latin: A Not-So-Foreign Language for Data Processing - Yahoo! Labs

Vol. 3, Issue 2, February 2014 Phytochemical … bioprocess, Biozeen-Bangalore Biotech Labs Pvt Ltd, Bangalore, Karnataka, India1 Research scholar, Department of Biotechnology, IIT-Guwahati,

Loupe: A Handheld Near-Eye Display - Home | … A Handheld Near-Eye Display Kent Lyons Yahoo Labs 701 First Ave. Sunnyvale CA 94089 klyons@yahoo-inc.com Seung Wook Kim, Shigeyuki Seko,

Chomsky’s Spell Checker Cohan Sujay Carlos, Aiaioo Labs, Bangalore

Modeling item item similarities for personalized ... · FRONT PAGE By Deepak Agarwal,Liang ZhangandRahulMazumder Yahoo! Labs, Yahoo! Labs and Stanford University We consider the problem

Personalized Recommendation on Dynamic Content Using Predictive Bilinear Models Wei ChuSeung-Taek Park WWW 2009 Audience Science Yahoo! Labs

Axpert™ from Agile Labs, Bangalore, India

Payman Mohassel Yahoo Labs

Conference Program Location Map Internet Access …comad/2008/comad08_brochure.pdfand Cloud Computing Rajeev Rastogi, Yahoo! Labs Bangalore Building Internet Scale Applications using

Building Knowledge Bases from the Web Rajeev Rastogi Yahoo! Labs Bangalore

1 Internet Advertising Ramana Yerneni, Yahoo! Labs yerneni@yahoo-inc.com August 17, 2010

Lessons from the Netflix Prize Robert Bell AT&T Labs-Research In collaboration with Chris Volinsky, AT&T Labs-Research & Yehuda Koren, Yahoo! Research