comp.lang.ada
 help / color / mirror / Atom feed
From: Dennis Lee Bieber <wlfraed@ix.netcom.com>
Subject: Re: Some advice required [OT]
Date: Mon, 27 Dec 2021 14:51:55 -0500	[thread overview]
Message-ID: <k05ksg9487nd13shdjgogl2j9eg6tq5c85@4ax.com> (raw)
In-Reply-To: 31332c61-a370-43a5-bbe0-efe338ee6d8fn@googlegroups.com

On Mon, 27 Dec 2021 10:24:21 -0800 (PST), Laurent <lutgenl@icloud.com>
declaimed the following:

>
>Yes those are the cases which are annoying me.
>
>That's why I came up withe idea of multiplying the value of the result (S=1, I=2 and R=3) with the position of the value. 
>Tried it with triplets but there will still be cases where different results will give the same numeric value.
>Ignoring empty cells for the moment.
>
	Multiplying by column position inherently gives priority to the column
with the highest position. If the columns are, of themselves, not
significant, your algorithm needs to ignore column (reordering the columns
should not make a change in final selection). You could just about as
easily do a multi-column sort giving priority to the most significant
column.

>Strain F: SSR (1*1+2*1+3*3) =12 and Strain G: RRS (1*3+ 2*3+3*1) = 12 will be the same numerical value but they are different resistance profiles
>I would in this case keep both.

	So far as I can make out -- ANY collision qualifies as "different
resistance profiles".  In that example, the count of Ss vs the count of Rs
differ, but...

>The results are way longer than only 3 values so the possibilities for collisions are higher.
>
>R	R	R	R	R	S	R	R	R	S	S	S	R	S	S  => numeric:1812180608
>R	R	R	R	R	S	R	R	R	R	S	S	S	S	S  => numeric:1812180806
>
>I have to keep both and that was an easy one. Only 2 to compare not 5.
>

	In this example the count of Ss and count of Rs is the same between the
two. And, again, you've applied an arbitrary ranking of the columns
(changing the order of the columns will tend to produce wildly different
sums).

>Yes there is a hierarchy in the drugs but that information is not available in the exported results I work with.

	In that situation I would complain to the provider that the exported
data is incompletely defined. At the very least, the columns should be in
ascending (or descending) order of significance, justifying use of column
position as a weight (even better would be to have a row of the data
containing the weight to be used for a given column, which makes the column
order irrelevant).


-- 
	Wulfraed                 Dennis Lee Bieber         AF6VN
	wlfraed@ix.netcom.com    http://wlfraed.microdiversity.freeddns.org/

  reply	other threads:[~2021-12-27 19:51 UTC|newest]

Thread overview: 25+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-12-27  9:21 Some advice required [OT] Laurent
2021-12-27 11:16 ` Niklas Holsti
2021-12-27 12:29   ` Laurent
2021-12-27 13:14     ` Ben Bacarisse
2021-12-27 18:24       ` Laurent
2021-12-27 19:51         ` Dennis Lee Bieber [this message]
2021-12-27 20:49         ` Ben Bacarisse
2021-12-27 22:09           ` Laurent
2021-12-28  0:29             ` Ben Bacarisse
2021-12-28  7:48               ` Laurent
2021-12-28  9:05                 ` Laurent
2021-12-28 12:54                   ` Laurent
2021-12-28 13:57                     ` Ben Bacarisse
2021-12-28 18:19                       ` Laurent
2021-12-28 13:43                 ` Ben Bacarisse
2021-12-28 16:49                 ` Dennis Lee Bieber
2021-12-29  4:20                   ` Randy Brukardt
2021-12-27 17:41     ` Dennis Lee Bieber
2021-12-27 18:56       ` Niklas Holsti
2021-12-27 19:44         ` Laurent
2021-12-28  2:10     ` Randy Brukardt
2021-12-28  6:02       ` Laurent
2021-12-29  3:58         ` Randy Brukardt
2021-12-27 17:18 ` Simon Wright
2021-12-27 18:30   ` Laurent
replies disabled

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox