Question

79

uniq command not working properly?

rated 0 times [ 79] [ 0] / answers: 1 / hits: 42018 / 3 Years ago, mon, may 31, 2021, 11:17:24

So I'm checking the md5 hash of my files with this as my output:

657cf4512a77bf47c39a0482be8e41e0  ./dupes2.txt

657cf4512a77bf47c39a0482be8e41e0  ./dupes.txt

8d60a927ce0f411ec94ac26a4785f749  ./derpina.txt

15f63928b8a1d5337137c38b5d66eed3  ./foo.txt

8d60a927ce0f411ec94ac26a4785f749  ./derp.txt

However, after running find . -type f -exec md5sum '{}' ';' | uniq -w 33 to find the unique hashes I get this:

657cf4512a77bf47c39a0482be8e41e0  ./dupes2.txt

8d60a927ce0f411ec94ac26a4785f749  ./derpina.txt

15f63928b8a1d5337137c38b5d66eed3  ./foo.txt

8d60a927ce0f411ec94ac26a4785f749  ./derp.txt

From my understanding, only one of either derpina.txt or derp.txt should be showing up since their hashes are the same. Am I missing something? Can anyone enlighten me as to why it outputs like this?

Answers

Only authorized users can answer the question. Please sign in first, or register a free account.

ingsta

Add To Favorites

Follow

Total Points: 391

Total Questions: 103

Total Answers: 124

Location: Bonaire

Member since Wed, Mar 29, 2023

1 Year ago

ingsta questions

1 How to kill chromium snap from CLI?

Sun, Oct 23, 22, 01:42, 2 Years ago

1 Zoom client crashes on Ubuntu Studio 22.04 LTS

Sat, Oct 30, 21, 11:27, 3 Years ago

1 22.04 is suggested on Ubuntu's website but not in the repository

Tue, Aug 2, 22, 13:35, 2 Years ago

1 Get DKMS offline?

Sun, Nov 28, 21, 12:49, 2 Years ago

1 keyboard backlight Acer Nitro5 AN515-55 Ubuntu 20.10 works but 2 question

Sun, Jan 16, 22, 05:17, 2 Years ago

View All

answered 3 Years ago istictroubli · Accepted Answer

You need to use sort before uniq:

find . -type f -exec md5sum {} ';' | sort | uniq -w 33

uniq only removes repeated lines. It does not re-order the lines looking for repeats. sort does that part.

This is documented in man uniq:

Note: uniq does not detect repeated lines unless they are adjacent. You may want to sort the input first, or use sort -u without uniq.