22
Graduate School of Informatics Graduate School of Informatics Kyoto University, November 21, 2001 Kyoto University, November 21, 2001 Technologies of the Technologies of the Interspace Interspace Peer-Peer Semantic Indexing Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory Graduate School of Library and Information Science University of Illinois at Urbana-Champaign www.canis.uiuc.edu, [email protected]

Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

Embed Size (px)

Citation preview

Page 1: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

Graduate School of InformaticsGraduate School of InformaticsKyoto University, November 21, 2001Kyoto University, November 21, 2001

Technologies of the InterspaceTechnologies of the Interspace Peer-Peer Semantic IndexingPeer-Peer Semantic Indexing

Bruce SchatzCANIS Laboratory

Graduate School of Library and Information ScienceUniversity of Illinois at Urbana-Champaign

www.canis.uiuc.edu, [email protected]

Page 2: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

THE THIRD WAVE OF NET EVOLUTIONTHE THIRD WAVE OF NET EVOLUTION

PACKETS

OBJECTS

CONCEPTS

Page 3: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

SCALABLE SEMANTICSSCALABLE SEMANTICS

Automatic indexing Domain-Independent indexing Statistical clustering

Compute Context of

concepts within documents documents within repositories

Page 4: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

CROSS-OVERS IN SEMANTIC INDEXINGCROSS-OVERS IN SEMANTIC INDEXING

Page 5: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

1992 1993 1995 1996 1998

COMPUTING CONCEPTSCOMPUTING CONCEPTS

‘92: 4,000 (molecular biology)

‘93: 40,000 (molecular biology)

‘95: 400,000 (electrical engineering)

‘96: 4,000,000 (engineering)

‘98: 40,000,000 (medicine)

Page 6: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

SIMULATING A NEW WORLDSIMULATING A NEW WORLD Obtain discipline-scale collection

MEDLINE from NLM, 10M bibliographic abstracts human classification: Medical Subject Headings

Partition discipline into Community Repositories 4 core terms per abstract for MeSH classification 32K nodes with core terms (classification tree)

Community is all abstracts classified by core term 40M abstracts containing 280M concepts concept spaces took 2 days on NCSA Origin 2000

Simulating World of Medical Communities 10K repositories with > 1K abstracts (1K w/ > 10K)

Page 7: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

COMMUNITY PROCESSINGCOMMUNITY PROCESSING

Page 8: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

Existing TechnologiesExisting Technologies Extracting Concepts (AI)

Canonical noun phrases Generic statistical parser

Computing Context (IR) Co-occurrence frequency, in collection Useful interactively, not strict ordering

Page 9: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

CONCEPT NAVIGATIONCONCEPT NAVIGATION

Semantic Indexes for Community Repositories

Navigating Abstractions within Repository concept space category map

Interactive browsing by Community experts

Page 10: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory
Page 11: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

Category MapCategory Map

Page 12: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

Category Navigation

Category Navigation

Page 13: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

Concept NavigationConcept Navigation

Page 14: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

CONCEPT SWITCHINGCONCEPT SWITCHING

“Concept” versus “Term” set of “semantically” equivalent terms

Concept switching region to region (set to set) match

term

Semantic region

Concept SpaceConcept Space

Page 15: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

Medicine SessionMedicine Session

Page 16: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

Categories and ConceptsCategories and Concepts

Page 17: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

Concept SwitchingConcept Switching

Page 18: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

Document RetrievalDocument Retrieval

Page 19: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

Future TechnologiesFuture Technologies Concept Switching

Spreading activation, similarity clusters

Path Matching Aggregating indexes, many repositories

Dynamic Indexing On-the-fly collections, during session

Page 20: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

Peer-Peer ComputationsPeer-Peer Computations Local Interaction

Your PC does small computations e.g. screensaver for SETI

Global Merging Partition computation into small parts Each local forms part of global whole

Large-Scale Distribution 3M users of SETI@Home Public Health. www.intel.com/cure

Page 21: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

THE NET OF THE 21st CENTURYTHE NET OF THE 21st CENTURY

Beyond Objects to Concepts Beyond Search to Analysis Problem Solving via Cross-Correlating

Multimedia Information across the Net

Every community has its own special library Every community does semantic indexing

Page 22: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory

Zen of Information RetrievalZen of Information Retrieval Searching without Searching

Navigate concepts into documents Based on interactive recognition

Indexing without Indexing Compute context on dynamic collections Based on distributed extraction

Sharing without Sharing Record paths during user sessions Based on community practices