Word Processing Data Type

The native data type of MS WORD is called MS WORD format, where the term format is applied at the level of the whole file. The extension is .doc. This is an example of a company-specified data type.

"Save As" dialog boxMost word processors, including WORD, will also import and export to a variety of foreign data types, including their own native data type in earlier versions of themselves.

The ability of a program to import and export data created in previous versions of itself, is called downward compatibility.

 

Embedded or Referenced Inclusions

Images and other non-character data in a word processor is typically embedded. This means that it is present in the file itself, rather than being "pointed at" or referenced in the word processing file, while actually stored in a separate file. We saw an example of referencing in HTML files that include an image.

Typically, word processors (e.g. WORD) embed their alien data objects. Desktop publishing programs (e.g. Quark Express) are more likely to reference.

Rich Text Format

For portability, files can be exported to a company-independant data type called Rich Text Format, which is a text-based standard for coding word processing files in a manner analogous to how HTML is used to code web page. RTF is a typographical (destined for the printed page) standard.

RTF cannot represent everything possible in WORD but covers the basics. For more information on RTF see Rich Text Format (RTF) Version 1.5 Specification