Post reply

Warning: this topic has not been posted in for at least 120 days.
Unless you're sure you want to reply, please consider starting a new topic.

Note: this post will not display until it's been approved by a moderator.

Name:
Email:
Subject:
Message icon:

Verification:
Sum of two plus two?:

shortcuts: hit alt+s to submit/post or alt+p to preview


Topic Summary

Posted by: hsei
« on: April 22, 2011, 17:41:12 »

To make my recommendations more clear:
If I use the precise algorithm alone I get less than 1 % false pairs at a threshold of 70 %.
If I use the standard algorithm ("content") at a threshold of about 90 % additionally, I get a few percent additional pairs (with a "precise" rating close to 0), but with a higher false pair probability.
I don't trust in auto-marking, I just use it as "click-saver".
Posted by: Springdream
« on: April 22, 2011, 11:24:34 »

well, if you go below 85% many life versions or different language versions are treated as same...
but meanwhile I use 85%. That results in 1% more findings than using 90%.
Posted by: hsei
« on: April 07, 2011, 21:16:56 »

I agree partially:
A "precise threshold" of 85-90% is reasonable. But I additionally use a standard threshold of 80% (they are combined by OR), since sometimes the precise algorithm completely misses similarity pairs. Of course theses "standard" candidates have to be examined by listening since the standard algorithm is much less reliable, but rather often I found "true" similar pairs.
Posted by: Springdream
« on: April 07, 2011, 17:30:45 »

I think the default settings are not good with premium subscription.
Best results may result from ONLY checking for PRECISE >90% and nothing else.
The same applies for auto mark files...

If I am correct please bring in that information at least in the "First Step" guide