40
CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and Edward A. Fox [email protected] http://fox.cs.vt.edu

CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

Embed Size (px)

Citation preview

Page 1: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

CitiViz:A Visual User Interface to the

CITIDEL System

ECDL 2004, Bath, England, September 2004

Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

Edward A. [email protected] http://fox.cs.vt.edu

Page 2: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

Acknowledgements (Selected) Sponsors: ACM, NLM, NSF (esp. grants CDA-9303152,

9312611; DUE-0121679, 0136690; IRI-9116991) Faculty/Staff: Lillian Cassel, Debra Dudley, C. Lee Giles,

Lenwood Heath, John Impagliazzo, Deborah Knox, JAN Lee, Manuel Perez, Naren Ramakrishnan, …

VT (Former) Students: Abhishek Agrawal, Supriya Angle, Guillermo Averboch, Anil Bazaz, Dennis Brueni, Robert France, Debby Hix, Marcos Goncalves, Aaron Krowne, Paul Mather, Kate McDevitt, Fernando Das Neves, Lucy Nowell, Durgesh Rao, Ryan Richardson, Hussein Suleman, Bill Wake, Jun Wang, Baoping Zhang, Jianxin Zhao

Page 3: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

Outline

Envision CITIDEL Other Related Works Research Questions CitiViz Homepage, Architecture Visualization Strategies, Examples Evaluation Conclusions, Future Work

Page 4: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

ENVISION

NSF “A User-Centered Database from the Computer Science Literature” (1991-93) With ongoing support from ACM

Collected bib/typesetter data, converted to SGML Scanned thousands of page images MARIAN search engine –

also applied to the Virginia Tech library catalog used as part of a prototype object-based DL with tailored visualization interface (L. Nowell dissertation)

Page 5: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

Envision Results Window

Page 6: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and
Page 7: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and
Page 8: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and
Page 9: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and
Page 10: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

Envision – Newer Version

Page 11: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

Envision – Newer Version – w. clusters

Page 12: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

Computing and Information Technology Interactive Digital Educational Library (CITIDEL)

Domain: computing / information technology

Genre: one-stop-shopping for teachers & learners: courseware (CSTC, JERIC), leading DLs (ACM, IEEE-CS, DB&LP, CiteSeer), PlanetMath.org, NCSTRL (technical reports), …

Submission & Collection: sub/partner collections www.citidel.org www.nsdl.org

Page 13: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

www.CITIDEL.org

Led by Virginia Tech, with co-PIs: Fox (director, DL systems) Lee (history) Perez (user interface, Spanish support)

Partners College of New Jersey (Knox) Hofstra (Impagliazzo) Villanova (Cassel) Penn State (Giles)

Page 14: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

CITIDEL Technology Features•Component architecture (Open Digital Library – Hussein Suleman)

•Re-use and compose re-deployable digital library components.

•Built Using Open Standards & Technologies

•OAI: Used to collect DL Resources and DL Interoperability

•XSL and XML: Interface rendering with multi-lingual community based translation of screens and content (Spanish, …)

•Perl: Component Integration

•ESSEX: Search Engine Functionality

•Fast, in-memory processing, snap-shots for persistence

•Multi-scheming

•Integrates multiple classifications / views through maps, closure

Page 15: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and
Page 16: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and
Page 17: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and
Page 18: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

Related Works

1st type of visualization techniques Predefined document attributes (e.g., author,

date, …) Envision

Semantic information (e.g., categories assigned to each document) Cougar, Cat-a Cone, Map.net ……

Document-query relevance TileBars, VIBE

Page 19: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

Related Works

2nd type of visualization techniques Automatically derive a collection overview via the

use of text mining Based on inter-document similarities

Scatter/Gather Grouper Galaxy of News Vivisimo Kartoo ……

Page 20: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

Research Questions

How to combine the two different types of visualization techniques for CITIDEL?

What text mining technology to use for post-retrieval analysis?

What are the key insights, how to support them?

What interaction and navigation strategies should be used to facilitate visual browsing and analysis?

Page 21: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

Addressing the Questions

1. Developed clustering components to discovery documents relationships and to identify subject categories for retrieved documents.

2. Developed a new visual interface:

Page 22: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

CitiViz HomePagehttp://feathers.dlib.vt.edu/CitiViz/index.html

Page 23: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

System Architecture

Component based design Communication between components is

XML based.

Page 24: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

System Architecture

Visualizing Components

Data Source Components Clustering Components

Java Servlets

Page 25: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

CitiViz Visualization Strategies

Overview strategy Aggregation by document clustering to show all the

retrieved documents Navigation strategies

Overview + detail Focus + context (Fish-eye view: hyperbolic tree) Combine tree graphs with scatter plot graphs. Integrate 2D scatter plot graph with a network of citations. Apply the aggregate towers technique to solve occlusion

problems of document visualized in the scatter plot graph.

Page 26: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

CitiViz Display of Detailed Information for a Selected Document: A Tower of Cylinders (to solve occlusion problem)

Page 27: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

CitiViz initial interface

Page 28: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

1. Show me retrieved results from ACM DL

Page 29: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

1. Show me retrieved results from ACM DL

Page 30: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

2. “algorithm analysis”, by “Donald Knuth”

Page 31: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

2. “algorithm analysis”, by “Donald Knuth”

Clustering results

Page 32: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

2. “algorithm analysis”, by “Donald Knuth”

Page 33: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

3. “data compression”

Page 34: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

3. “data compression”

Page 35: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

Evaluation Tasks

1. Given an author and a topic, find a document published by that author and belonging to that topic.

2. Given an author and a publication year, find a document published by that author and in that year.

3. Given a title, find a document having that title.

4. Find the most recently published paper.

Page 36: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

Evaluation Results

Page 37: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

Discussion

Users performed chosen tasks faster with CitiViz than with the standard interface for CITIDEL.

No significant difference for tasks 1 and 3 between CitiViz when using clustering versus when using ACM classification

Possible explanation of differences observed with tasks 2 and 4: the clustering yields one level towers, and some users were confused about the multi-level

towers resulting from the ACM classification

Page 38: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

Conclusions

Text mining + information visualization Document clustering provides insights for users. Overview of document attributes in the 2D scatter

plot Overview of hierarchical concept map displayed as

a hyperbolic tree supports “focus+context” navigation.

Integrated the 2D scatter plot space with a network of citations.

Online tutorial and system – also animation.

Page 39: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

Future Work

Add more Data Source Components (DSC) Current DSC for CITIDEL = DSC for all its member DLs DSC: send query, parse HTML to XML, cluster result data Develop other DSCs for different DLs (e.g., NDLTD)

Improve clustering component (S. Kim) Extend CITIDEL content Test usability of CitiViz with broad base of users

Page 40: CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and

Summary

Envision CITIDEL Other Related Works Research Questions CitiViz Homepage, Architecture Visualization Strategies, Examples Evaluation Conclusions, Future Work