42

百万量级的汉明距离的数据有没有什么快速计算接近的方法呢?

 4 years ago
source link: https://www.v2ex.com/t/665885
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
问与答 - @phpfpm - 目前有近 100w 图片需要判重,挑了几个 hash 算法,正在跑 hamming code,都是 128bit 的 binary这些图片都是经过 md5 与判重之后的图片了,所以需要找出来一

About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK