From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on ip-172-31-74-118.ec2.internal X-Spam-Level: X-Spam-Status: No, score=-1.9 required=3.0 tests=BAYES_00 autolearn=ham autolearn_force=no version=3.4.6 Path: eternal-september.org!reader02.eternal-september.org!news.misty.com!border2.nntp.dca1.giganews.com!nntp.giganews.com!buffer2.nntp.dca1.giganews.com!buffer1.nntp.dca1.giganews.com!news.giganews.com.POSTED!not-for-mail NNTP-Posting-Date: Mon, 27 Dec 2021 13:51:57 -0600 From: Dennis Lee Bieber Newsgroups: comp.lang.ada Subject: Re: Some advice required [OT] Date: Mon, 27 Dec 2021 14:51:55 -0500 Organization: IISS Elusive Unicorn Message-ID: References: <7bede061-4b0f-4029-beb1-1056637e57d6n@googlegroups.com> <49538254-21ed-4fd0-8316-1bccc7d3c635n@googlegroups.com> <87sfue8a0v.fsf@bsb.me.uk> <31332c61-a370-43a5-bbe0-efe338ee6d8fn@googlegroups.com> User-Agent: ForteAgent/8.00.32.1272 X-No-Archive: yes MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Usenet-Provider: http://www.giganews.com X-Trace: sv3-GNQYqzHI/q7HuUac0UYanENolApOpb1lhnsw2iQrYdpByAscgufctqdbKBcu3cJITHIw/Ei4QVWO7r9!F0XIkjKlNSn5l0/9qtz3bxorh16PtGqpBbh4XZq+f+H+KsGkwHLCiS9Cw3Uuzc2DK/DZeL16 X-Complaints-To: abuse@giganews.com X-DMCA-Notifications: http://www.giganews.com/info/dmca.html X-Abuse-and-DMCA-Info: Please be sure to forward a copy of ALL headers X-Abuse-and-DMCA-Info: Otherwise we will be unable to process your complaint properly X-Postfilter: 1.3.40 X-Original-Bytes: 3588 Xref: reader02.eternal-september.org comp.lang.ada:63286 List-Id: On Mon, 27 Dec 2021 10:24:21 -0800 (PST), Laurent declaimed the following: > >Yes those are the cases which are annoying me. > >That's why I came up withe idea of multiplying the value of the result (S=1, I=2 and R=3) with the position of the value. >Tried it with triplets but there will still be cases where different results will give the same numeric value. >Ignoring empty cells for the moment. > Multiplying by column position inherently gives priority to the column with the highest position. If the columns are, of themselves, not significant, your algorithm needs to ignore column (reordering the columns should not make a change in final selection). You could just about as easily do a multi-column sort giving priority to the most significant column. >Strain F: SSR (1*1+2*1+3*3) =12 and Strain G: RRS (1*3+ 2*3+3*1) = 12 will be the same numerical value but they are different resistance profiles >I would in this case keep both. So far as I can make out -- ANY collision qualifies as "different resistance profiles". In that example, the count of Ss vs the count of Rs differ, but... >The results are way longer than only 3 values so the possibilities for collisions are higher. > >R R R R R S R R R S S S R S S => numeric:1812180608 >R R R R R S R R R R S S S S S => numeric:1812180806 > >I have to keep both and that was an easy one. Only 2 to compare not 5. > In this example the count of Ss and count of Rs is the same between the two. And, again, you've applied an arbitrary ranking of the columns (changing the order of the columns will tend to produce wildly different sums). >Yes there is a hierarchy in the drugs but that information is not available in the exported results I work with. In that situation I would complain to the provider that the exported data is incompletely defined. At the very least, the columns should be in ascending (or descending) order of significance, justifying use of column position as a weight (even better would be to have a row of the data containing the weight to be used for a given column, which makes the column order irrelevant). -- Wulfraed Dennis Lee Bieber AF6VN wlfraed@ix.netcom.com http://wlfraed.microdiversity.freeddns.org/