#Chromaprint

Attila Györffy 🦜attila@fosstodon.org
2025-06-01

Building a system that deduplicates audio files based on acoustic #fingerprinting. The algorithm used is a normalized Hamming distance over two vectors of 32-bit integers, which represent the raw AcoustID #Chromaprint fingerprint.

When testing the identification of the same audio in different formats, here are my findings:

WAV <> AIFF <> M4A → (identical, no surprise here, all lossless)
MP3 320kbps (CBR) <> WAV → 99.9% similarity.
MP3 VBR (Q5) <> WAV → 99.3%
MP3 (128kbps CBR) <> WAV → 99.3%

グレェ「grey」byterhymer
2022-11-11

chromaprint documentation:

cmake -DFFT_LIB=vDSP .
-- Could not find FFMPEG
CMake Error at CMakeLists.txt:155 (message):
No FFT library found

Sadly, it does not appear as if nor have any developer Mastodon presence nor IRC channel on liberachat.

Their README.md seems to indicate that on macOS, vDSP can be utilized instead of FFmpeg, but in practice? I do not know if that is true.

Meanwhile, the MacPort for chromaprint has an ungodly amount of dependencies.

グレェ「grey」byterhymer
2022-11-01

z0mg, some of the dependencies for e.g. or for are *obscene*.

kerberos5? (my audio library or transcoder needs to authenticate? da fuk?!)

Rust? (ffmpeg is at least a decade older than Rust, such a fuggin plague among techbros chasing another thneed language)

Srsly?

I don't think that is true.

I think that perhaps (almost, certainly), some dependencies, have too many other dependencies, but hasn't done a dependency audit.

This is exasperating.

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst