As the first step in the decommissioning of sasCommunity.org the site has been converted to read-only mode.


Here are some tips for How to share your SAS knowledge with your professional network.


Reading Microsoft Word XML files with SAS®

From sasCommunity
Jump to: navigation, search

Author

Larry Hoyle, Institute for Policy and Social Research, University of Kansas

Abstract

In 2005 Microsoft announced that the new default format for documents created in Microsoft Office will be XML-based. The ability of SAS to read XML offers a convenient method for extracting structured information from Microsoft Word documents. This paper examines three scenarios where information from a Word document is read into SAS datasets: extracting text along with associated properties (styles and attributes), extracting all data from tables, and extracting coordinates of objects in drawings.

Online Materials

View the pdf for [http://www2.sas.com/proceedings/sugi31/019-31.pdf Reading Microsoft Word XML files with SAS®.


Files for this and a related paper are available at: http://www.ipsr.ku.edu/ksdata/sashttp/sugi31/

Contact Info

User:LarryHoyle