F5F Stay Refreshed Software General Software Do you have suggestions for duplicate file finders?

Do you have suggestions for duplicate file finders?

Do you have suggestions for duplicate file finders?

J
Julia7789
Junior Member
12
05-09-2016, 09:59 PM
#1
Hi there.
I don’t have any specific recommendations, but you might explore free tools like PhotoRec or other recovery utilities that can locate and remove exact duplicate files.
J
Julia7789
05-09-2016, 09:59 PM #1

Hi there.
I don’t have any specific recommendations, but you might explore free tools like PhotoRec or other recovery utilities that can locate and remove exact duplicate files.

S
SedentarySauS
Senior Member
411
05-14-2016, 11:43 PM
#2
Free Fast Duplicate File Finder - Remove Duplicates
A tool to locate duplicate files quickly. It helps identify similar images and content.
www.mindgems.com
I’ve heard positive reviews about this app, but I haven’t tried it myself. There seem to be both free and paid options available.
My experience with duplicate finders is mostly limited to photos, which isn’t what you need for your situation.
Don’t compare yourself... that sounds like a really frustrating issue.
S
SedentarySauS
05-14-2016, 11:43 PM #2

Free Fast Duplicate File Finder - Remove Duplicates
A tool to locate duplicate files quickly. It helps identify similar images and content.
www.mindgems.com
I’ve heard positive reviews about this app, but I haven’t tried it myself. There seem to be both free and paid options available.
My experience with duplicate finders is mostly limited to photos, which isn’t what you need for your situation.
Don’t compare yourself... that sounds like a really frustrating issue.

S
Sapphire01
Junior Member
7
05-15-2016, 01:34 AM
#3
Uncertain about the program's function, but the solution lies in file hashes. Exact duplicates will generate identical hashes. Generate hashes for each file, sort them by hash, and remove the duplicates. It’s likely a tool exists that can handle this automatically. Renaming is quite another matter. Sorting and renaming thousands of files isn’t easy. I once managed around 20k mp3 files—took a long time.
S
Sapphire01
05-15-2016, 01:34 AM #3

Uncertain about the program's function, but the solution lies in file hashes. Exact duplicates will generate identical hashes. Generate hashes for each file, sort them by hash, and remove the duplicates. It’s likely a tool exists that can handle this automatically. Renaming is quite another matter. Sorting and renaming thousands of files isn’t easy. I once managed around 20k mp3 files—took a long time.

D
DarthBotto
Junior Member
6
05-24-2016, 09:32 PM
#4
I've experienced many issues with similar "apps" in the past. I strongly recommend ensuring you have a solid and secure backup of all your files elsewhere before proceeding.
D
DarthBotto
05-24-2016, 09:32 PM #4

I've experienced many issues with similar "apps" in the past. I strongly recommend ensuring you have a solid and secure backup of all your files elsewhere before proceeding.

L
lukeysky14
Member
65
05-30-2016, 09:18 AM
#5
I've tried Dupeguru at work before
Link
We used it on our servers to clear space. People tend to throw things everywhere 😡
L
lukeysky14
05-30-2016, 09:18 AM #5

I've tried Dupeguru at work before
Link
We used it on our servers to clear space. People tend to throw things everywhere 😡

X
XAcmedX
Member
60
06-09-2016, 03:42 AM
#6
Checking if a file hash is completely unique or if similar files can share the same hash.
X
XAcmedX
06-09-2016, 03:42 AM #6

Checking if a file hash is completely unique or if similar files can share the same hash.

C
CHadek
Member
62
06-09-2016, 05:40 AM
#7
Hashes are completely distinct. Altering even a single bit produces an entirely different hash value. This process secures your password by converting it into a unique code that matches the stored version. It also enables authorities to analyze drives for prohibited material using extensive databases of recognized files and their corresponding hashes. And so on.
C
CHadek
06-09-2016, 05:40 AM #7

Hashes are completely distinct. Altering even a single bit produces an entirely different hash value. This process secures your password by converting it into a unique code that matches the stored version. It also enables authorities to analyze drives for prohibited material using extensive databases of recognized files and their corresponding hashes. And so on.

S
SashaUHC
Member
101
06-16-2016, 06:51 AM
#8
The question asks whether the filename affects the hash value when comparing identical files with different names.
S
SashaUHC
06-16-2016, 06:51 AM #8

The question asks whether the filename affects the hash value when comparing identical files with different names.

B
blehhh
Member
50
06-16-2016, 08:35 AM
#9
No filename affects the hash. It relies entirely on the file content itself.
Experiment with it for fun. Take a .doc file, compute its hash, then append a space to the end of the text, save and rehash. The resulting hash will be completely different. Or modify just one pixel in an image and observe the change. It's really surprising how unpredictable the hash becomes each time.
B
blehhh
06-16-2016, 08:35 AM #9

No filename affects the hash. It relies entirely on the file content itself.
Experiment with it for fun. Take a .doc file, compute its hash, then append a space to the end of the text, save and rehash. The resulting hash will be completely different. Or modify just one pixel in an image and observe the change. It's really surprising how unpredictable the hash becomes each time.

J
Jelmerro
Member
202
06-17-2016, 01:58 AM
#10
Hello there.
Following up on what was mentioned, I’m proceeding with Dupeguru.
According to my initial message, I’ve just completed restoring large amounts of data from a 14TB drive that I accidentally deleted. The files are distributed across four smaller drives.
As suggested, I’m using Dupeguru to eliminate duplicates, but I’m not entirely clear on which settings to apply.
I’ve selected the standard content scan mode, though I’m unsure about the additional features:
- Apply regular expressions during filtering
- Partially hash files exceeding a certain size
- Disregard duplicate hardlinking of identical files
These options are left unchecked by default. Could you explain what each means and whether enabling them is necessary?
J
Jelmerro
06-17-2016, 01:58 AM #10

Hello there.
Following up on what was mentioned, I’m proceeding with Dupeguru.
According to my initial message, I’ve just completed restoring large amounts of data from a 14TB drive that I accidentally deleted. The files are distributed across four smaller drives.
As suggested, I’m using Dupeguru to eliminate duplicates, but I’m not entirely clear on which settings to apply.
I’ve selected the standard content scan mode, though I’m unsure about the additional features:
- Apply regular expressions during filtering
- Partially hash files exceeding a certain size
- Disregard duplicate hardlinking of identical files
These options are left unchecked by default. Could you explain what each means and whether enabling them is necessary?