Upload
others
View
6
Download
0
Embed Size (px)
Citation preview
Competence Center ELAN Fraunhofer FOKUS
Competence Center ELAN Fraunhofer FOKUS
OOXML Validation and Document Test LibraryTowards Document Interoperability
Dr. Klaus-Peter Eckert
Document Interoperability Initiative DIIBrusselsNovember 12th, 2009
2© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Agenda
Fraunhofer and the FOKUS competence center ELAN
FOKUS document interoperability lab
– Document and test library
– Online validator
– White paper on document interoperability
Document and test library
– Document standards
– Translation between standards
– Translation and conformity
– Document categories
Document validation
– Types of document conformance
– Validating semantic constraints
3© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
The FOKUS Competence Center ELANInteroperability and Innovation – Future of eGovernment
4© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Fraunhofer in Germany
At present Fraunhofer maintains
80 research units
including 57 Fraunhofer institutes
at over 40 different locations
with a staff of 15.000 scientists & engineers
1.3 billion € annual budget
5© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Competence Center ELAN
Cooperative and process-oriented eGovernment
Our Mission:
– One-Stop-Government
– Interoperability
– Innovative und secure architectures
– Infrastructures and business models at the interface to state and public administrations
Our Experience: eGovernment projects since 1995
– On federal, state and communal level
– International: EU (eTEN, IST), Japan, Brazil…
Facts & Figures: Competence Center ELAN operating since July 2004
– Approx. 50 employees
– More than 50 partners from industry and administration
8© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
From cradle to grave... “Forms, Forms, Forms“ & „ Standards, Standards, Standards“….
9© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
OOXML Validation and Document Test Library Document Interoperability Lab
12© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Document Interoperability Lab in a nutshell
Fraunhofer FOKUS has initiated its Document Interoperability Lab
http://www.fokus.fraunhofer.de/go/is29500-validation
The lab is going to test the validity of documents regarding the document-standard ISO/IEC 29500 known as Office Open XML.
A document and test library consisting of valid template documents will be offered. Word processing, presentation as well as spreadsheet documents belonging to selected application domains and document categories can be retrieved and uploaded.
The online validator office-o-tron, developed by Alex Brown, is accessible via the document and test library. Fraunhofer FOKUS contributes to the definition of conformance rules.
Fraunhofer FOKUS has published a white paper on interoperability between ODF and OOXML, extending the initial DIN contribution to ISO SC34 WG5http://www.fokus.fraunhofer.de/go/wpdocinterop-en
13© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Document Test Library in a nutshell
Main goals:
– Create a suite of documents for testing and verifying IS29500 interoperability
Create and maintain web site for document repository
– Up-/download documents
– Assign category tags to documents
– Maintain category tags
– Validate documents considering IS29500 and document categories
The document and test library is available at http://www.is29500-interoperability.com
14© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
IS29500 Validator in a nutshell
Main goals:
– Develop a tool to validate schema, package conventions and „semantic“ properties and categories of documents considering IS29500
Contribute to the open source project Office-o-tron that develops and maintains an online validation tool
– Define compliance rules
– Evaluate compliance rules
– Produce comprehensible error messages
– Accessible for users via the document library web page
Office-o-tron project will be lead by Griffin Brown Digital Publishing Ltd. and hosted by Fraunhofer FOKUS
15© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
OOXML Validation and Document Test LibraryDocument Test Library – Towards Document Categories
18© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Why should a document format be standardized?What is the document?
Different representations
of a document exist
What is „the“ document?
What is „the“ standard?
19© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Why should a document format be standardized?Continued
Different document
standards exist
Standards depend
on tasks
20© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Reasons for translation of document formats
ISO 26300
ODF
ISO 29500
OOXML
21© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Prerequisites for the translation of document formats
ISO 26300
ODF
ISO 29500
OOXML
Translation rules are defined
on the document standard
Translation rules are executed
on documents (instances)
Translation rules are not
standardized
Translation rules are not
comprehensive
23© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Prerequisites for the translation of documents
ISO 26300
ODF
ISO 29500
OOXML
Translation rules are executed
on documents
Documents have to be
standard conform
Document must not use
extensions of the standards
Documents have to be build
of translatable parts
24© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Document categories
Reduce the complexity and features to be used within documents
Definition of translatable document templates for special purposes
Improve the translatability of documents
Additional conformity tests are necessary – membership in a category
25© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Classification hierarchy – classification example
26© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Document categoriesInitial classification schema
Data Interchange, Data Manipulation (Aggregate/ETL/Cleansing), Financial Statements / Balance Sheets, Forms and Calculators (Expense Report, Input Forms, Invoices, Purchase Orders, Product Selection, etc.), Misc (Faxes, Memos, Stationary, etc.), Pitch Book, Scheduling (Budgets, Schedules, etc.), Stochastic Modelling/Simulation (Factory Floor, Financial, Process Flow, Valuation, etc.), Tracking List (Agenda, Calendar, Inventories, Receipts, Statements, Time Sheet, etc.), ...
– Architecture and Design, Conference Presentation ( aka “keynote” ), Informal, Management Meetings, Note Taking, Photo Book, Pitch Book, Sales Presentation, Self-Running, Signage, Training, ...
Additional attributes (optional, unique)
– Conformance type
– Language
Document type (mandatory, unique)
– Word processing
– Spreadsheets
– Presentation
Application domain (optional, ambiguous)
– Private
– Business
– Academic
– Legal
Category / subcategories (mandatory, ambiguous, type depending)
– Application (EU project app.), Article, Book, Essay, Flyer, Label, Legal Document, Letter, Meeting Notes, Memo, Newsletter, Notes, Outline, Proposal, Research Paper, Report, Résumé, Structured Document/Form, ...
– Application Add-ins, Budgeting (Forecasting/Actuals), Data Analysis,
27© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Sample document categorizationUpload and retrieval
1. Word–Processing Spreadsheet Presentation
2. Word–Processing Spreadsheet Presentation
Private Business Academic Legal
3. Word–Processing Spreadsheet Presentation
Private Business Academic Legal
Poster Memo Meeting Note Research-Paper Report Proposal
Resume Essay Book Photo-Book Letter Data …
Spreadsheet Presentation
Private Business Legal
Poster Memo Meeting Note Proposal
Resume Essay Book Photo-Book Letter Data …
28© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Document and Test LibraryHome
29© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Document and Test LibraryLibrary
30© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Document and Test LibraryBrowse
31© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Document and Test LibraryCategories
32© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Document and Test LibrarySearch
33© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Sample presentation documentPhoto book
34© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Sample wordprocessing documentTest document
35© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Document and Test LibraryUpload
36© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Document and test libraryLink to online validator
37© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Online validation using Office-o-tronTest document
38© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Online validation using Office-o-tronTest document (cont.)
39© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Document Test LibraryConclusion
Collection of IS29500 compliant documents
Lib can be used to evaluate document interoperability of different applications
Lib can be used to evaluate translation between document standards
Upload and download of categorized documents
Definition of document categories
Support for different document languages
Close link to IS29500 validator
Challenges
– Different versions of document standards
– Adequate definition of the term „IS29500 compliance“ – what is a valid document?
– Relation between test library (categories) and test suite (features)
40© Fraunhofer FOKUS, Berlin
Competence Center ELAN Fraunhofer FOKUS
Dr. Klaus-Peter [email protected]
Fraunhofer Institute for Open Communication Systems
Kaiserin-Augusta-Allee 3110589 Berlin, Germany
www.fokus.fraunhofer.de
Thank you !
Any questions ?
54© Fraunhofer FOKUS, Berlin