Lead: Michael Brauer
Co-Lead: Svante Schubert
These are the Wiki pages of the OpenOffice.org XML Project.
The following information can be found on these pages:
- 1 Current Efforts of the XML Project
- 2 Contribution Areas
- 3 Articles and Books
- 4 Reports and Research Documents covering the OpenDocument format
- 5 OpenDocument Tools at OpenOffice.org
- 6 External OpenDocument Tools
Current Efforts of the XML Project
- OpenDocument: Contributions to the formula specification that is developed by the OASIS OpenDocument Formula Subcommittee.
- OpenDocument: Contributions to the metadata enhancements for OpenDocument v1.2 that are developed by the OASIS OpenDocument Metadata Subcommittee.
- OpenDocument: Contributions to the OpenDocument v1.2 specification, including reviews of the specification documents.
The OpenOffice.org XML project is looking for contributions in the areas listed below. If you are interested in working in one of these areas, please contact us at email@example.com.
XML Based Filters
The following suggestions are related to the framwork for XML based filters.
Detection of the file type
Every time a document is being imported into OpenOffice.org the so called filter detection chooses between the existing filter implementations to find the one matching the type of document that shall be loaded. For filters that load XML documents using the filter framework, the filter detection only searches for a string with the first 1000 characters of a document. This string can be specified in the XML Filter Settings dialog.
A better solution, that we would like to have instead, would be to check the name and or XML namespace of the root node of the XML document.
Access to images
Currently XML based filters have only access to the BASE64 encoded data of embedded images an objects. Having also access to the binary data (for instance by unpacking the zip file of an OpenDocument file) or having a simple way to convert the BASE64 data into a binary file would simplify the developmenmt of filters.
Currently error logging is comparable weak for the XML based / XSLT filters. The only way to enable logging is to set a Java environment variable (e.g. -DXSLTransformer.statsfile=/usr/local/offices/xslt_debug.txt) in the Office options for Java.
A few new features are imaginable, such as:
- Customizing the filter logging via GUI
- Usage of defined log level (analog to Java Logging)
- GUI flag for a cumulative log file (instead replacing log for every transformation)
Currently, no XML validation takes place during the import or export of documents. Validation is only possible from a test dialog. This test dialog might be dropped in favor of an external development tools and/or (optional) validation during runtime. To achieve this, it is not sufficient to reuse the existing functionality, because it supports only DTDs, but no Relax-NG or other XML schema langauges like Schematron.
Articles and Books
|Articles and Books|
|OASIS OpenDocument Essentials||OASIS OpenDocument Essentials introduces you to the XML that serves as the native internal format of OpenOffice.org. You should read this book if you want to extract data from an OpenDocument document, convert your data to an OpenDocument document, or simply find out how OpenOffice.org stores its data under the hood. The book is written by J. David Eisenberg for O'Reilly & Associates; the content is currently licensed under a Creative Commons License.|
|Adventures with OpenOffice and XML||An article on XML.com, which shows how to use the XML File Format within the AxKit content management system.|
|Thinking XML: The open office file format||A nice introduction to the OpenOffice.org XML File Format; also goes into standardization at OASIS. Introduces the general document structure and demonstrates how to extract content.|
|From Open Source to Open Standard: The OASIS OpenDocument Format||Paper for the "From Open Source to Open Standard: The OASIS OpenDocument Format" XTech 2005 presentation by Michael Brauer.|
More articles and book on OpenDocument can be found at opendocument.xml.org.
Reports and Research Documents covering the OpenDocument format
More reports and resarch document can be found on opendocument.xml.org.
OpenDocument Tools at OpenOffice.org
|Tools at OpenOffice.org|
|SAXEcho||SAXEcho is a tool to be used with the OpenOffice.org application. It allows to view the XML representation of a document at runtime. It can either display the native XML file format representation of an in-memory document, or the output of an an XSLT transformation on said document. It is written in Java.|
|xfilter tool||This page documents the
External OpenDocument Tools
|ODFDOM||ODFDOM is an OpenDocument (ODF) framework. It's purpose is to provide an easy common way to create, access and manipulate ODF files, without requiring detailed knowledge of the ODF specification.|
|ODF Validator||ODF Validator is a Java tool that validates OpenDocument files and checks them for certain conformance criteria.|
|ODF Online Validator||ODF Online Validator is an online validation and conformance check tool for ODF documents.|
|ODF XSLT Runner||ODF XSLT Runner is a small Java command line application that allows you to apply XSLT stylesheets to XML streams included in ODF packages without extracting them from the package.|
|ODF XSLT Runner Task||ODF XSLT Runner Task is a task definition for Ant which allows to apply XSLT stylesheets to ODF documents similar to Ant's build-in <xslt> task.|
|The Perl OpenDocument Connector||This toolkit allows direct OpenDocument file update and creation from Perl scripts. It works with the main document classes (texts, spreadsheets, presentations, and drawings), and provides an easy access to a large set of content and presentation elements.|
|CMFOODocument||OpenOffice Documents for icoya Content & Collaboration Management - With CMFOODocument OpenOffice Writer files can easily be integrated inside your Plone site. The OO-Documents are converted by XSL transformation into HTML.|
|OpenOffice.org Perl Library||The OpenOffice.org Perl Library (ooolib) can be used to create simple OpenOffice.org Calc spreadsheet and Writer text documents.|