Show Posts

This section allows you to view all posts made by this member. Note that you can only see posts made in areas you currently have access to.


Messages - AntiBotQuestion

Pages: [1]
1
General / Re: Similarity seems not optimized for 300,000+ mp3 files
« on: February 15, 2016, 08:29:23 »
It requires way less than 300k tracks to be annoying, I tell you. Similarity keeps telling me I've got some twenty hours left, plus minus - it can do that for days. Down to sixteen now, two days after I completed a scan and hit a new, just to test.
My disk is USB2 only, if the bottleneck were read speed, then it would not be using 90 percent CPU all the time. For days.

2
Wishlist / Re: A clearer reporting of "100.0" percent matches
« on: February 22, 2015, 20:42:52 »
It seems that I can get a "Content" match of "100.0%" even for two files with different length and different bitrate. It is, I think, unreasonable - it is not the same content if it is not the same content (different tags are OK as long as tags are a separately-reported thing).

It might be that the Precise algorithm can tell the difference

Obviously not. It just matched "100.0%" Content and "100.0%" precise with different wordlength (20 bits used vs 16 bits used).

Too bad, this piece of software looked promising. Now I have to manually check each possible match, and that takes multiple operations per pair - I cannot even mark and drag them to an application that actually can compare by bits.

3
Wishlist / A clearer reporting of "100.0" percent matches
« on: February 15, 2015, 22:32:02 »
It seems that I can get a "Content" match of "100.0%" even for two files with different length and different bitrate. It is, I think, unreasonable - it is not the same content if it is not the same content (different tags are OK as long as tags are a separately-reported thing).

It might be that the Precise algorithm can tell the difference, but (1) then disclose that, although you like to keep the Precise algorithm a secret, and (2) deduplication based on audio content is an even less sophisticated method than the non-Premium mode; not only is it available in less sophisticated deduplicators, but from a user point of view it is counterintuitive to switch to the more sophisticated method to get a less sophisticated method.

Suggestion: For streams that decode to the truly bit-exact same signal, display "exact" rather than "100.0%". Takes the same space in the table.

(That does mean that you might need to round off decoded mp3s  somewhere around -150 dB if you do not calculate with very high precision, if you do not want a CBR and a VBR with the same signal to be reported as different.)

4
I just started to play around with and evaluate a few deduplication applications (thus far Similarity seems to be at least in top-2 if not in top-1 :-) ) so I hope I don't make a fool out of myself asking for something which is actually supported in the premium version.

(1) I wish for a grouping of duplicate folders (with an option to choose "all music matching", "all pictures matching" and both) which tells me if folder1 and folder2 contain all the same audio.
Those could e.g. be highlighted in different colour, put on top when sorted by match, or something like that.
Also an option to tell me if folder1 contains everything folder2 does (and possibly some); that could tilt the scales as of which one to keep.

(2) I would like to be able to drag files from Similarity over to an application of my choice (like if it were an Explorer window).
(E.g. a tagger like mp3tag, or an integrity verifier like audiotester.exe or a different media player.)

To push (2) further, I would like an option to drag files from Similarity where the entire folders are dumped into the other application. For example, Similarity matches and groups together file c:\folderX\song07.mp3 with c:\folderY\song06.m4a , and I drag that group over to a media player to get displayed the entire folders' contents.


Pages: [1]