As the first step in the decommissioning of the site has been converted to read-only mode.

Here are some tips for How to share your SAS knowledge with your professional network.

Reading Microsoft Word XML files with SAS®

From sasCommunity
Jump to: navigation, search


Larry Hoyle, Institute for Policy and Social Research, University of Kansas


In 2005 Microsoft announced that the new default format for documents created in Microsoft Office will be XML-based. The ability of SAS to read XML offers a convenient method for extracting structured information from Microsoft Word documents. This paper examines three scenarios where information from a Word document is read into SAS datasets: extracting text along with associated properties (styles and attributes), extracting all data from tables, and extracting coordinates of objects in drawings.

Online Materials

View the pdf for [ Reading Microsoft Word XML files with SAS®.

Files for this and a related paper are available at:

Contact Info