28
AuthorLink: AuthorLink: Instant Author Co- Instant Author Co- Citation Mapping for Citation Mapping for Online Searching Online Searching Xia Lin Howard D. White Jan Buzydlowski [email protected] Drexel University Philadelphia, PA, USA

AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski [email protected] Drexel University Philadelphia,

Embed Size (px)

Citation preview

Page 1: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

AuthorLink: AuthorLink: Instant Author Co-Citation Instant Author Co-Citation

Mapping for Online SearchingMapping for Online Searching

Xia Lin

Howard D. White

Jan [email protected]

Drexel University

Philadelphia, PA, USA

Page 2: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

Presented at the National Online Meeting Online 2001 At New York, May 15-17, 2001.

Page 3: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

Author SearchAuthor SearchA tradition from library catalogs– Card Catalog– Online Catalog– Bibliographical Databases– Full text Databases

Two basic approaches for author searching– String matching in the author field– Alphabetical indexing/browsing

Page 4: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

Problems of Author SearchingProblems of Author SearchingHow to search for related authors?– There are no easy solutions in most

current systems.The searcher usually needs to do a

lot of intellectual work to get to other related authors’ works

• Follow the citations• Follow the subjects

Page 5: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

Our ApproachOur ApproachAlways show related authors during the

author search– Put the targeted author among relevant related

authors– Visualize how these authors are related to each

other– Use the author groupings to reveal subject

areas

Page 6: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

A Map of Information Scientists A Map of Information Scientists

Page 7: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

PlatoPlato

Page 8: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

The AuthorLink SystemThe AuthorLink SystemBuilt on a significantly large database– ISI Arts and Humanities Database (AHCI)• 1988 - 1997• 1.26 million records

– Real time mapping and visualizingBased on two key methodologies – Author Co-Citation Analysis– Information Visualization

Page 9: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

Co-CitationCo-Citation Co-citation is the mentioning of any two

earlier documents in the bibliographic references of a later third document.

Later Document 3

Document 1 cites

Document 2cites

Page 10: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

Co-Citation AnalysisCo-Citation Analysis The count of mentions may grow over time

as new writings appear. Thus, co-citation counts can reflect citers’ changing perceptions of documents as more or less strongly related.

Documents shown to be related by their co-citation counts can be mapped as proximate in intellectual space.

Page 11: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

Co-Citation MappingCo-Citation MappingDetects patterns in the frequency

with which any works by any two authors are jointly cited in later works.

Only recurrent co-citation is significant: The more times authors are cited together, the more strongly related they are in the eyes of citers.

Page 12: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

ExampleExampleIf Ben Shneiderman and Shakespeare are cited

together in one article, it probably means little.If Ben Shneiderman and Stuart Card are cited

together in 205 articles,* it means a lot: their conjoined names have come to symbolize something like “interactive interfaces for digital libraries.” Possibly no subject heading captures this concept.

*Actual count, 7/10/00In a cited-author (CA) search on Dialog, SELECT CA=SHNEIDERMAN B AND CA=CARD SKwould retrieve the 205 citing articles.

Page 13: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

Use DIALOG for ACAUse DIALOG for ACASelection of authorsRetrieval of co-citation frequenciesCompilation of raw co-citation

matrixConversion to a correlation matrixMultivariate analysis of correlation

matrix (using principle components analysis, cluster analysis, and multidimensional scaling).

Page 14: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

The Old InterfaceThe Old Interface

Page 15: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

The AuthorLink SystemThe AuthorLink System An integrated system that, in seconds,

– Finds and ranks 24 authors most often cited with seed author

– Pairs all ranked authors systematically, performs co-citation searches for all pairs, and generates a data matrix containing the results.

– Maps the co-citation counts in the matrix and generates interface maps for the user.• Kohonen self-organizing maps (SOMs)• Pathfinder Networks (PFNETs)

Live interface can be used to retrieve documents from AHCI that cite paired authors

Page 16: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

Architecture of AuthorLinkArchitecture of AuthorLink

Front tier .. Middle tier .. Back tier

BRS Search EngineWeb Server

Java Servlets

Web-basedMap Interface

Java Applet

MappingProcedures

Application Server

OracleDatabase

MYSQL Database

Page 17: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

Live System DemoLive System DemoAuthorLink

Page 18: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

More Features of AuthorLinkMore Features of AuthorLinkAuthorLink presents an overview of a

field or a subject area.AuthorLink can distinguish similar

author names that are otherwise conflated in ISI data.

AuthorLink makes it easy for the user to explore intellectual territories from a single seed name, which minimizes cognitive load.

Page 19: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

Overview Features of AuthorLink Overview Features of AuthorLink

Page 20: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

Einstein-A and Mozart (Music)Einstein-A and Mozart (Music)

Page 21: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

Einstein-A and Niels Bohr (Physics)Einstein-A and Niels Bohr (Physics)

Page 22: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

AuthorLink helps to explore new territoriesAuthorLink helps to explore new territories

Page 23: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

Beyond AuthorLinkBeyond AuthorLinkConceptLink–Maps medical subject headings (MeSH)– Uses PUBMED as the backend search

engine– Uses UMLS co-occurrence counts

JournalLink– Developed in the same database as

AuthorLink to visualize journal relationships

Page 24: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

ConceptLinkConceptLink

Page 25: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,
Page 26: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

Query: “back pain”Query: “back pain”

Page 27: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

Live System DemoLive System DemoConceptLink

Page 28: AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Xlin@drexel.edu Drexel University Philadelphia,

Future DevelopmentFuture Development

Stress browsability– AuthorLink and ConceptLink are not only

search tools but also exploration and discovery tools

Develop middleware and interfaces that can be linked to any search engine– All ISI databases– DIALOG databases– Web search engines