Upload
jamese
View
26
Download
1
Embed Size (px)
DESCRIPTION
Impact of Information Architecture on Content Digitization and SEO. ASIDIC Spring 2007 Meeting S. Gurke SVP, Knovel Corp. Topics. Transforming e-book collection into reference database Why transform? Impact on content digitization Impact on search engine optimization (SEO) - PowerPoint PPT Presentation
Citation preview
Impact of Information Architecture on Content
Digitization and SEO
ASIDIC Spring 2007 MeetingS. Gurke
SVP, Knovel Corp.
Topics
• Transforming e-book collection into reference database
• Why transform?
• Impact on content digitization
• Impact on search engine optimization (SEO)
• Impact on pricing and revenue
Transforming E-Book Collection into Reference Database
• Collection– Formatted content is outside database– Book presentation – Search full text and metadata indexes
• Database (XML)– Unformatted content is inside database– Database presentation (chunks)– Search tagged content chunks with metadata
Why Transform?
• Better reflect use patterns
• Streamline search and improve relevancy ranking
• Increase usage
Content Digitization Now
• Content– Text (PDF, HTML)– Metadata (TOC, etc.) – Interactive tables, graphs, equations
• Process (outsourced)– Scanning/OCRing– Keying– Special techniques (e.g., graph calibration)
• Tools– CMS– SQL database
Challenges
• Conversion from relational to XML database– Metadata– Interactive content – Text
• Chunking and tagging text– Who does it?– TOC and Subject Index as chunk metadata
Impact on SEO
• Exposing secure content
• Making metadata work– Title and author– TOC and Subject Index
• Benefits for users– Finding information made easier– Comprehensive search– Improved relevancy
Impact on Pricing and Revenue
• Pricing models– Subscription– Usage based
• Transaction• By the drink
– Hybrid/Enterprise
• Ad-based STM revenue
• Usage based royalties