Show Posts

This section allows you to view all posts made by this member. Note that you can only see posts made in areas you currently have access to.

Messages - Admin

Pages: 1 2 [3] 4 5 ... 42
Wishlist / Re: Clarify and expand Automatically Mark Files options
« on: February 29, 2016, 16:29:52 »
We think about user-friendliness of auto-mark dialog.
"keep or delete files with a certain keyword/string in their filenames" easly can be solved with scripts.
date modified added in 2.1

Wishlist / Re: Add edit option to image files
« on: February 29, 2016, 16:27:33 »
I think it's just called raname function not edit.

Wishlist / Re: More information in status bar
« on: February 29, 2016, 16:26:16 »
We planning to add loging window in future, all permanent actions logged into text log file (located %appdata%\Similarity folder).

Wishlist / Re: A clearer reporting of "100.0" percent matches
« on: February 29, 2016, 16:25:13 »
Similarity didn't compare files byte by byte, it uses audio data from decoders, some decoders never give raw data (flac, apple quicktime) only converted to some other format. If we also read file second time to calculate some digital signature it takes 2 times more, and everything to have very ephemeral benefit. You can always use any byte by byte duplicate comparing software with hashes for such task.
And Similarity content algorithms compares only starting 30-60 sec. of file not more.

Wishlist / Re: Delete complete folder
« on: February 29, 2016, 16:15:11 »
This feature added in 2.1 version.

Wishlist / Re: Player that continues, to compare
« on: February 29, 2016, 16:13:36 »
It's hard to implement each song can have own delays and synchronize staring point isn’t simple task, and Similarity can already play and move cursor on several players, just use Ctrl and double-click.

Wishlist / Re: Auto-mark : folder priorities
« on: February 29, 2016, 16:09:33 »
Will be implemented in future version

Sorry for the delay.
1) You can use folder groups for comparing exact groups (ie. new an old one)
2) We implement in future drag & drop.

General / Re: Similarity seems not optimized for 300,000+ mp3 files
« on: February 29, 2016, 15:29:24 »
Similarity algorithms isn't linear or even better logarithmic, they quadratic. Complexity of content based algorithm of Similarity is N^2. If directly calculated each new file need to be compared with all previous ones (it can't be searched by some index in relational databases, fingerprints can't be sorted to greater or lower).Example, for 300K files we have sum of arithmetic progression (1 + 300000) * 300000 / 2 = 90000300000 / 2 comparisions, compare it with 100K file for example (1 + 100000) * 100000 / 2 = 10000100000 / 2. You see comparing 300K file is 9 times longer then comparing 100K, not just 3 times. Even worse if you computer (all CPUs and GPUs) can compare 1mln fingerprints in 1sec (very, very fast computer), processing 300K files took 25hours.
To optimize this we added duration check, it dramatically decrease comparison count.
And we already working on new algorithm what can be used to compare 1mln of files and it will be linear, but it still far from completion.

General / Re: How to scan files placed on NAS?
« on: February 29, 2016, 15:11:46 »
That setting of you NAS server (ie it samba server). Can you send us to email screen shots of Similarity folders tab with disclosed network.

General / Re: Analysis Rating in Automark script?
« on: February 29, 2016, 15:08:50 »
Sorry, but automark dialog already have such field see "rating" field this is rating from Analysis tab. Similarity marks file from pair of duplicates with lowest rating value if it calculated (you need to analyse files before use, otherwise it uses next priority)

General / Re: Change cache location?
« on: February 29, 2016, 15:04:19 »
Sorry for the delay, we add such setting in next version.

General / Re: Script for deleting duplicates
« on: February 29, 2016, 14:58:26 »
Sorry for the delay, you don't need a script, just use automark with tags algorithm > 95%. Tags algorithm exactly compares only album, artist and title.

If you need some specific tag restriction here sample script that marks only files with same album and artist, marks file by worst analysis.rating.
Code: [Select]
Author: Similarity Team
Version: 1.0
Mark files after scan only if tags restrictions satisfied. Priority by rating (you can simply change it).
Warning! Using analysis.rating forces to analyse file, if you didn't analyse files before, it take much time (because this script analyses files only on 1 cpu core).
We suggest to perform analyses before launch of this script (Analyse all context menu).

// album, artist, title - comment/uncomment string to disable/enable such restriction
// threshold - threshold for string comparing algorithm, scores between [0...1], 0 - absolutely different, 1 - absolutely same
// minlength - minimal length of text in the tag for comparing, files don't fit the criterion will be skipped
var myProperties = {
    album: { threshold: 0.9, minlength: 2 },
    artist: { threshold: 0.9, minlength: 2 },
    // title: { threshold: 0.9, minlength: 2 },

// unmark all files;

function checkRestrictions(item1, item2) {
    // enumerate all selected tag fields
    for (var property in myProperties) {
        // check each field
        var minlength = myProperties[property].minlength;
        var threshold = myProperties[property].threshold;
        var text1 =[property];
        var text2 =[property];
        // skip short tags
        if (minlength > 0 && (text1.length < minlength || text2.length < minlength)) return false;
        // check text string(tag values) similarity
        if (threshold > 0.0 && text.calculate(text1, text2) < threshold) return false;
    return true;

// simple mark by lower bitrate
var dups =;
for (var idx = 0; idx < dups.length; ++idx) {
    // skip counter-pair (1-2 and 2-1), process pair only once
    if (dups[idx].item1.path > dups[idx].item2.path) continue;
    // check for our special tag restrictions
    if (!checkRestrictions(dups[idx].item1, dups[idx].item2)) continue;
    // ok now we select by our priority
    if (dups[idx].item1.analysis.rating > dups[idx].item2.analysis.rating) dups[idx].item2.marked = true;
    else dups[idx].item1.marked = true;

General / Re: How to find serial
« on: February 29, 2016, 14:05:08 »
If you have lost key or changed email just write to to resolve this situation.

General / Re: Duplikate automatisch markieren und löschen
« on: February 29, 2016, 12:49:50 »
Use automark dialog and set topest priority to "Group".

Pages: 1 2 [3] 4 5 ... 42