jmp-l
[Top] [All Lists]

Re: How to remove a row which contains a duplicate column entry?

To: jmp-l@lists.biostat.wustl.edu
Subject: Re: How to remove a row which contains a duplicate column entry?
From: bdahman@vcu.edu
Date: Wed, 11 May 2005 13:02:44 -0400
Cc: "James T Metz" <james.metz@abbott.com>, jmp-l@lists.biostat.wustl.edu, jmp-l-owner@lists.biostat.wustl.edu
In-reply-to: <OF2D957BD6.514E1C95-ON86256FFE.00508329@northamerica.intra.abbott.com>

try this code:

DataTable("yourtablename") << Join(With(DataTable("yourtablename")), By Matching Columns( :Compount ID =  :Compound ID), Drop Multiples(1, 1), Include Non Matches(0, 0), Output Table("your outputTable name"))

Bassam


"James T Metz" <james.metz@abbott.com>
Sent by: jmp-l-owner@lists.biostat.wustl.edu

05/11/2005 10:56 AM
Please respond to
jmp-l@lists.biostat.wustl.edu

To
jmp-l@lists.biostat.wustl.edu
cc
"James T Metz" <james.metz@abbott.com>
Subject
[jmp-l] How to remove a row which contains a duplicate column entry?






JMP Users,


       Can someone please help.


       I am looking for a procedure, formula, or JSL script that will either mark or remove a row from a data table for

which one of the columns contains more than one identical entries (duplicates, triplicates, etc.) based on a particular

column e.g., "Compound ID".


       For example, I have a data table.  One of my (character) columns is named "Compound ID".  I have 3 rows out

of 25,000 rows which have the same identical Compound ID - "JTM-00001A".  I also have other duplicates such as

2 rows with Compound ID -"JTM-12345B", 2 rows with Compound ID - "JTM-67895A" and so forth.  I want to generate

another table which contains single, unique Compound ID's i.e., there are no duplicate names.  You may also assume

that you can either mark or remove any of the duplicate or triplicate or ... rows i.e., the information in the other columns

is the same.  I want to retain all of the rows which are not duplicates, triplicates, etc.


       Does anyone have any intelligent way to do this?  I can search for (and remove) duplicates using the Histogram
platform (with character data), but this is very tedious and inefficient!


       I would be most grateful if someone could help.  Thank you!


       Regards,

       Jim Metz



James T. Metz, Ph.D.
Research Investigator Chemist

GPRD R46Y AP10-2
Abbott Laboratories
100 Abbott Park Road
Abbott Park, IL  60064-6100
U.S.A.

Office (847) 936 - 0441
FAX    (847) 935 - 0548

james.metz@abbott.com

This communication may contain information that is legally privileged, confidential, or exempt from disclosure.  If you are not the intended recipient, please note that any dissemination, distribution, use, or copying of this communication is strictly prohibited.  Anyone who receives this message in error should notify the sender immediately by telephone or return email and delete it from his or her computer.

<Prev in Thread] Current Thread [Next in Thread>