← Seminar by Tiff Macklem: Lessons Learned from the Global Financial Crisis – Monday, March 30, 2015

Using PROC SQL to Find Uncommon Observations Between 2 Data Sets in SAS →

Separating Unique and Duplicated Observations Using PROC SORT in SAS 9.3 and Newer Versions

April 10, 2015 5 Comments

As Fareeza Khurshed commented in my previous blog post, there is a new option in SAS 9.3 and later versions that allows sorting and the identification of duplicates to be done in one step. My previous trick uses FIRST.variable and LAST.variable to separate the unique observations from the duplicated observations, but that requires sorting the data set first before using the DATA step to do the separation. If you have SAS 9.3 or a newer version, here is an example of doing it in one step using PROC SORT.

There is a data set called ADOMSG in the SASHELP library that is built into SAS. It has an identifier called MSGID, and there are duplicates by MSGID. Let’s create 2 data sets out of SASHELP.ADOMSG:

DUPLICATES for storing the duplicated observations
SINGLES for storing the unique observations

proc sort
     data = sashelp.adomsg
          out = duplicates
          uniqueout = singles
          nouniquekey;
     by msgid;
run;

Here is the log:

NOTE: There were 459 observations read from the data set SASHELP.ADOMSG.
NOTE: 300 observations with unique key values were deleted.
NOTE: The data set WORK.DUPLICATES has 159 observations and 6 variables.
NOTE: The data set WORK.SINGLES has 300 observations and 6 variables.
NOTE: PROCEDURE SORT used (Total process time):
real time 0.28 seconds
cpu time 0.00 seconds

Note that the number of observations in WORK.DUPLICATES and WORK.SINGLES add to 459, the total number of observations in the original data set.

In addition to Fareeza, I also thank CB for sharing this tip.

Filed under Data Analysis, SAS Programming Tagged with data analysis, data manipulation, duplicate, first.variable, last.variable, nouniquekey, PROC SORT, SAS, sas programming, sorting, unique, uniqueout

5 Responses to Separating Unique and Duplicated Observations Using PROC SORT in SAS 9.3 and Newer Versions

Ron Walker says:

May 19, 2016 at 1:18 pm

My thanks as well for the first post and for this tip – it worked flawlessly with my data. I’m relatively new to writing SAS code, and I was able to show this tip to my SAS mentor and teach him a cool new trick (for once). Much appreciated!

Reply
- Eric Cai - The Chemical Statistician says:
  
  May 20, 2016 at 4:41 pm
  
  You’re welcome, Ron! Thanks for visiting my blog!
  
  Reply
Alex Feng says:

May 31, 2016 at 8:03 pm

Great tips. Thanks Eric!

Reply
- Eric Cai - The Chemical Statistician says:
  
  June 26, 2016 at 2:41 pm
  
  You’re welcome, Alex!
  
  Reply
PEPGRA Healthcare says:

June 26, 2020 at 5:51 am

Thanks for writing this blog. It is very much informative and at the same time useful for me

Reply

	Eric Cai - The Chemi… on Convert multiple variables bet…
	Jack on Convert multiple variables bet…
	Eric Cai - The Chemi… on Getting the names, types, form…
	Emily V on Getting the names, types, form…
	Lauren McClain on Convert multiple variables bet…
	Eric Cai - The Chemi… on Convert multiple variables bet…
	Lauren McClain on Convert multiple variables bet…
	Eric Cai - The Chemi… on Exploratory Data Analysis: Com…
	CK on Exploratory Data Analysis: Com…
	Eric Cai - The Chemi… on Video Tutorial: Breaking Down…

The Chemical Statistician

Separating Unique and Duplicated Observations Using PROC SORT in SAS 9.3 and Newer Versions

5 Responses to Separating Unique and Duplicated Observations Using PROC SORT in SAS 9.3 and Newer Versions

Your thoughtful comments are much appreciated! Cancel reply

Eric’s Twitter Feed (@chemstateric)

Recent Comments

Popular Topics

Recent Posts

About Eric

Blogs and Web Sites That I Like to Read

Archives

Categories

The Chemical Statistician

Separating Unique and Duplicated Observations Using PROC SORT in SAS 9.3 and Newer Versions

Share this:

Related

5 Responses to Separating Unique and Duplicated Observations Using PROC SORT in SAS 9.3 and Newer Versions

Your thoughtful comments are much appreciated! Cancel reply

Eric’s Twitter Feed (@chemstateric)

Recent Comments

Popular Topics

Recent Posts

About Eric

Blogs and Web Sites That I Like to Read

Archives

Categories