Setting to remove duplicated results from index

Have a suggestion for "Everything"? Please post it here.
Post Reply
sugoro
Posts: 5
Joined: Sat Nov 29, 2014 9:19 pm

Setting to remove duplicated results from index

Post by sugoro » Tue Jun 20, 2017 10:00 pm

Use-case: when remapping paths for software like DrivePool (viewtopic.php?t=1572 and viewtopic.php?f=4&p=17171), we will often have "duplicated" results in the database. I say "duplicated" because they are not technically duplicated entries, as they belong to different drives. But, with remapping, they will map to exact duplicates in the db.

For example, we could have two drives mounted to folders Drive1 and Drive2, in a "mirror" configuration, where File.txt is duplicated to both drives. Like so
C:\Drive1\File.txt
C:\Drive2\File.txt

We then remap those, to point to the actual pooled drive, say, at D:\
Then, File.txt is accessed with D:\File.txt

In the db, we'll have D:\File.txt twice.


This setting toggle would remove the duplicates (possibly after sorting) and the db would not contain any exact duplicate entries.


Thanks for reading!

void
Site Admin
Posts: 5731
Joined: Fri Oct 16, 2009 11:31 pm

Re: Setting to remove duplicated results from index

Post by void » Wed Jun 21, 2017 6:54 am

Is excluding one of the mirror drives possible? eg: C:\Drive2

To exclude a folder in Everything:
  • In Everything, from the Tools menu, click Options.
  • Click the Exclude tab on the left.
  • Click Add Folder....
  • Select c:\drive 2 and click OK.
  • Click OK.

sugoro
Posts: 5
Joined: Sat Nov 29, 2014 9:19 pm

Re: Setting to remove duplicated results from index

Post by sugoro » Wed Jun 21, 2017 11:11 am

void wrote:Is excluding one of the mirror drives possible? eg: C:\Drive2

To exclude a folder in Everything:
  • In Everything, from the Tools menu, click Options.
  • Click the Exclude tab on the left.
  • Click Add Folder....
  • Select c:\drive 2 and click OK.
  • Click OK.

Yes, for simple cases. It won't work very well for more complicated duplication scenarios. I have specific duplication rules, to maximize space (no point duplicating backups that are already stored in another location, offsite).
Some folders are in 4 drives, others in 3, others in 2.

Also, you set rules like "keep 3 copies of this folder's contents" but you usually don't tell the program to "keep those folders in those 3 drives". It will place the files in whichever drive it determines to be the best, and files can be moved to other drives during its balancing routine.

Because of this, there's no to ignore "this folder, on these drives, except this one", since parts if folder will live in different drives, depending on how many drives you have in the pool and your duplication/placement rules.



Thanks for the reply!

dlong500
Posts: 3
Joined: Mon Sep 14, 2020 6:49 pm

Re: Setting to remove duplicated results from index

Post by dlong500 » Mon Sep 14, 2020 7:53 pm

@void Adding a feature to hide duplicate full paths would be extremely useful in a complex configuration using a pooling software like DrivePool. Excluding specific disks won't help because DrivePool handles it's own duplication algorithms (disks aren't simple mirrors). But it seems like it should be fairly simple to track duplicated index entries in such a scenario because the full path, size, and date will be exactly the same for duplicate files on drives that have been mapped to a virtual pooled drive.

For example, let's say we have drive P: and drive Q: representing volumes on physical disks, and we remap both of those to a virtual drive X:

If we have a file (test.txt) that exists on:
P:\PoolPart.xxx\test.txt
Q:\PoolPart.xxx\test.txt

the everything index will show:
X:\test.txt
X:\test.txt

Couldn't there a way to be able to detect a duplicated index entry so we could hide one (or more) of the same rows in the GUI?

Post Reply