As the first step in the decommissioning of sasCommunity.org the site has been converted to read-only mode.


Here are some tips for How to share your SAS knowledge with your professional network.


Name Pursing Algorithm

From sasCommunity
Jump to: navigation, search

by Sata Hackenbruck

This paper presents name pursing algorithm that can be used to split a long character string into meaningful columns.

The researchers were presented with datasets that contained a person's full name in one long character field. However, since this was a primary matching field, the name needed to be split into a first name, last name, middle initial, etc. To do this, the researchers developed a name pursing algorithm which helped split the long character field into meaningful columns. The specific example uses a patient name string to try to figure out the last name, first name, middle name, and many things in between that need to be cleaned.


Example datasets:


Presented at a State of Oregon SAS Users Group meeting.

Find more SOSUG presentations, as well as other tips, tricks, and tools created by or recommended by SOSUG members here.