Office Binary <-> OpenXML translator project on SourceForge
The following milestones are planned in the Office Binary (doc, xls, ppt) Translator to Open XML project
-
Milestone 0 (February 15th, 2008): Launching the project on SourceForge
- outline of the anatomy of binary Office documents (Compound file format, streams inside, ...)
- brief description of the streams in Word documents
- initial doc/docx mappings, e.g. for the main character and paragraph attributes
-
Milestone 1 (April 1st, 2008): First results
- technical specification how to read and parse Office documents, in particular Word documents
- a first version of a parser (command line tool) being able to read and interprete the individual components (streams) of a binary Word document
- enhanced doc/docx mapping documentation
- detailed scope of the translator to be implemented for the next milestone
-
Milestone 2 (June 30th, 2008): Final Word translator and more
- translator which reads binary Word documents and creates Open XML Word documents
- final documentation on binary Word documents and doc/docx mapping
- preliminary documentation on binary PowerPoint and Excel architecture and ppt/pptx,xls/xlsx mapping
