Author Topic: Good default settings  (Read 27671 times)

hsei

  • Jr. Member
  • **
  • Posts: 70
    • View Profile
Re: Good default settings
« Reply #15 on: April 07, 2011, 21:16:56 »
I agree partially:
A "precise threshold" of 85-90% is reasonable. But I additionally use a standard threshold of 80% (they are combined by OR), since sometimes the precise algorithm completely misses similarity pairs. Of course theses "standard" candidates have to be examined by listening since the standard algorithm is much less reliable, but rather often I found "true" similar pairs.

Springdream

  • Jr. Member
  • **
  • Posts: 51
    • View Profile
Re: Good default settings
« Reply #16 on: April 22, 2011, 11:24:34 »
well, if you go below 85% many life versions or different language versions are treated as same...
but meanwhile I use 85%. That results in 1% more findings than using 90%.

hsei

  • Jr. Member
  • **
  • Posts: 70
    • View Profile
Re: Good default settings
« Reply #17 on: April 22, 2011, 17:41:12 »
To make my recommendations more clear:
If I use the precise algorithm alone I get less than 1 % false pairs at a threshold of 70 %.
If I use the standard algorithm ("content") at a threshold of about 90 % additionally, I get a few percent additional pairs (with a "precise" rating close to 0), but with a higher false pair probability.
I don't trust in auto-marking, I just use it as "click-saver".