Do you have suggestions for duplicate file finders?

Julia7789
Junior Member

05-09-2016, 09:59 PM

Hi there.
I don’t have any specific recommendations, but you might explore free tools like PhotoRec or other recovery utilities that can locate and remove exact duplicate files.

Julia7789

05-09-2016, 09:59 PM #1

Hi there.
I don’t have any specific recommendations, but you might explore free tools like PhotoRec or other recovery utilities that can locate and remove exact duplicate files.

SedentarySauS
Senior Member

411

05-14-2016, 11:43 PM

Free Fast Duplicate File Finder - Remove Duplicates
A tool to locate duplicate files quickly. It helps identify similar images and content.
www.mindgems.com
I’ve heard positive reviews about this app, but I haven’t tried it myself. There seem to be both free and paid options available.
My experience with duplicate finders is mostly limited to photos, which isn’t what you need for your situation.
Don’t compare yourself... that sounds like a really frustrating issue.

SedentarySauS

05-14-2016, 11:43 PM #2

Sapphire01
Junior Member

05-15-2016, 01:34 AM

Uncertain about the program's function, but the solution lies in file hashes. Exact duplicates will generate identical hashes. Generate hashes for each file, sort them by hash, and remove the duplicates. It’s likely a tool exists that can handle this automatically. Renaming is quite another matter. Sorting and renaming thousands of files isn’t easy. I once managed around 20k mp3 files—took a long time.

Sapphire01

05-15-2016, 01:34 AM #3

DarthBotto
Junior Member

05-24-2016, 09:32 PM

I've experienced many issues with similar "apps" in the past. I strongly recommend ensuring you have a solid and secure backup of all your files elsewhere before proceeding.

DarthBotto

05-24-2016, 09:32 PM #4

I've experienced many issues with similar "apps" in the past. I strongly recommend ensuring you have a solid and secure backup of all your files elsewhere before proceeding.

lukeysky14
Member

05-30-2016, 09:18 AM

I've tried Dupeguru at work before
Link
We used it on our servers to clear space. People tend to throw things everywhere 😡

lukeysky14

05-30-2016, 09:18 AM #5

I've tried Dupeguru at work before
Link
We used it on our servers to clear space. People tend to throw things everywhere 😡

XAcmedX
Member

06-09-2016, 03:42 AM

Checking if a file hash is completely unique or if similar files can share the same hash.

XAcmedX

06-09-2016, 03:42 AM #6

Checking if a file hash is completely unique or if similar files can share the same hash.

CHadek
Member

06-09-2016, 05:40 AM

Hashes are completely distinct. Altering even a single bit produces an entirely different hash value. This process secures your password by converting it into a unique code that matches the stored version. It also enables authorities to analyze drives for prohibited material using extensive databases of recognized files and their corresponding hashes. And so on.

CHadek

06-09-2016, 05:40 AM #7

SashaUHC
Member

101

06-16-2016, 06:51 AM

The question asks whether the filename affects the hash value when comparing identical files with different names.

SashaUHC

06-16-2016, 06:51 AM #8

The question asks whether the filename affects the hash value when comparing identical files with different names.

blehhh
Member

06-16-2016, 08:35 AM

No filename affects the hash. It relies entirely on the file content itself.
Experiment with it for fun. Take a .doc file, compute its hash, then append a space to the end of the text, save and rehash. The resulting hash will be completely different. Or modify just one pixel in an image and observe the change. It's really surprising how unpredictable the hash becomes each time.

blehhh

06-16-2016, 08:35 AM #9

Jelmerro
Member

202

06-17-2016, 01:58 AM

#10

Hello there.
Following up on what was mentioned, I’m proceeding with Dupeguru.
According to my initial message, I’ve just completed restoring large amounts of data from a 14TB drive that I accidentally deleted. The files are distributed across four smaller drives.
As suggested, I’m using Dupeguru to eliminate duplicates, but I’m not entirely clear on which settings to apply.
I’ve selected the standard content scan mode, though I’m unsure about the additional features:
- Apply regular expressions during filtering
- Partially hash files exceeding a certain size
- Disregard duplicate hardlinking of identical files
These options are left unchecked by default. Could you explain what each means and whether enabling them is necessary?

Jelmerro

06-17-2016, 01:58 AM #10