dupeGuru: find duplicate or similar music files on Linux

dupeGuruI posted earlier on how to search visually similar images on Ubuntu. Here’s a solution to find similar music files – dupeGuru. You can search different file formats. Check it out if you have a pretty messed up music or photo collection. In addition to the standard edition, there are two versions of dupeGuru dedicated towards music (dupeGuru Music Edition) and photo (dupeGuru Picture Edition).

dupeGuru uses a fuzzy search algorithm to find files with similar names or data. You can customize your search further from the preferences section.

dupeGuru supports Linux, WIndows and Mac. To install dupeGuru on Ubuntu:

$ sudo apt-add-repository ppa:hsoft/ppa
$ sudo apt-get update
//Standard Edition
$ sudo apt-get install dupeguru-se
//Music Edition
$ sudo apt-get install dupeguru-me
//Picture Edition
$ sudo apt-get install dupeguru-pe

Webpage: dupeGuru

findimagedupes: find visually similar images on Ubuntu

image_editor_compI was trying to order my camera images when I found that I have already copied, resized, renamed many of them previously and now I can’t find a way to get rid of duplicates. What I needed was a tool to find visually similar images. After searching a while I found findimagedupes which is available in synaptic. A dry run showed promising results and soon I found it can also detect similar images with different aspect ratios, more than 2 similar images etc. Performance is quite good considering I ran it on a directory having around 7000 files in it. Now going through the output list to delete the extra copies.

Usage

$ findimagedupes Pictures/ > dups
//get the output in dups file

Installation

To install on Ubuntu:

$ sudo apt-get install findimagedupes

Webpage: findimagedupes

Similar software

Some of my favourite Ubuntu utilities

ubuntu_apps_compThere are some excellent applications on Ubuntu which come handy frequently. I have listed some:

  1. Baobab: Graphical disk usage analyzer.
  2. BleachBit: Free up your system by removing temp files, history and other cruft.
  3. eboard: If you play on FICS eboard is indispensable.
  4. FSlint: Remove duplicate files. It can find things like duplicate files, problematic filenames, temporary files, bad symlinks, empty directories and nonstripped binaries.
  5. galculator: A better alternative to the gnome-calculator which still seems buggy/user-unfriendly to me when converting between Hex to Dec etc.
  6. GCstar: A collection manager for almost everything you collect.
  7. gFTP: Tiny multithreaded FTP, FTPS (control connection only), HTTP, HTTPS, SSH and FSP transfer client.
  8. Glipper: A very handy clipboard manager that can remember entries across reboot. It can be accessed using a keyboard shortcut beside your pointer (like right click) and you can choose the copied text.
  9. Gpick: Pick any color from the screen.
  10. GtkOrphan / RpmOrphan: The omnipotent clean-up utility of to get rid of unused libraries, packages etc.
  11. HandBrake: The best video encoder ever written!
  12. Leafpad: A great relief the bloated gedit editor is removable on Ubuntu 12.04. I use the lightweight Leafpad. When I need power, I use vi.
  13. localepurge: remove unnecessary locale files.
  14. Marlin: Lightweight and superfast file manager. To make it faster:
    $ sudo mv /usr/lib/x86_64-linux-gnu/tumbler-1/tumblerd /usr/lib/x86_64-linux-gnu/tumbler-1/tumblerd.bak
  15. nautilus-image-converter: Resize and rotate images from nautilus.
    To customize it, edit:
    /usr/share/nautilus-image-converter/nautilus-image-resize.ui
  16. PCManFM: Ultra-lightweight and fast file manager.
  17. qBittorrent: µTorrent for Linux. Supports torrents, magnets and DHT.
  18. Subtitle Editor: Do anything with movie subtitles.
  19. rar & unrar: Frequently needed to extract downloaded rar files.
  20. wget: A very very powerful cmdline downloader (a good multi-threaded alternative is axel).
    As an example of the power of wget, a whole website can be downloaded using the following command (instead of installing HTTrack):

    $ wget -r -mirror -p -convert-links -P ./mydir URL
    

    -r: recursive
    -mirror : turn on options suitable for mirroring.
    -p : download all files that are necessary to properly display a given HTML page.
    -convert-links : after the download, convert the links in document for local viewing.
    -P ./mydir : save all the files and directories in ./mydir. To resume a partial downloads (if server supports), run the following command in the same directory:

    $ wget -c file_URL
  21. xpad: A tiny sticky notes application. (#9a99ff is my preferred bg colour)

I used to use wine 3 years back to run some Windows applications but Google helped me find much better alternatives for all my needs on Ubuntu.