Chem4Word Project
In partnership with Dr. Peter Murray-Rust and his team at the Unilever Centre for Molecular Science Informatics, Microsoft Research is investigating the introduction of chemistry-related features in Microsoft Office Word, including authoring and semantic annotations. Our approach to chemistry authoring will be modeled after the mathematic equation authoring in Word 2007 and will leverage many of the user-interface and XML extensibility options that are provided by Office 2007.
The Microsoft External Research vision includes support for the scholarly communication lifecycle through software and services so that data and information flow in a coordinated and seamless fashion, from authoring through publication to long term information preservation. The Chem4Word project aims to simplify the authoring of chemical information in Word - specifically the inclusion of chemical structures. This project will also demonstrate how semantic information can be captured at authoring time as the way to more accurately represent the chemical content, create high quality depictions, contribute to simpler pre-publication processes and richer information discovery scenarios, as well as to preserve chemical information for archival purposes.
Microsoft Office Word 2007 introduced the ability to add, edit and display mathematical equations. Significantly, these equations are not only of high visual quality on the screen and in print, but are also stored within the OOXML document using structured XML markup (OMML), which creates opportunities for rich data mining scenarios. For example, it is now possible to leverage the semantics of mathematical equations, and not just the (end-user) perceived character strings. You can read more about Math in Office on Murray Sargent's blog.
The goal of the Chem4Word project is to enable similar authoring, display, and mining scenarios for chemistry-related information within Office Word. Specifically, we aim to:
- Provide easy authoring of chemical information within Microsoft Office Word 2007 documents
- Allow end-user denotation of inline "chemical zones"
- Render high-quality, print-ready visual depictions of chemical structures
- Store and expose chemical information in a semantically rich manner to support publishing and mining scenarios, for authors, readers, publishers, and other vendors across the broad chemical information community
People
- Savas Parastatidis
- Rudy Potenzone
- Murray Sargent
- Geraldine Wade
- Alex Wade
Collaborations
Events
|
|
|