识别相似图像的好方法？

I've developed a simple and fast algorithm in PHP to compare images for similarity.

Its fast (~40 per second for 800x600 images) to hash and a unoptimised search algorithm can go through 3,000 images in 22 mins comparing each one against the others (3/sec).

The basic overview is you get a image, rescale it to 8x8 and then convert those pixels for HSV. The Hue, Saturation and Value are then truncated to 4 bits and it becomes one big hex string.

Comparing images basically walks along two strings, and then adds the differences it finds. If the total number is below 64 then its the same image. Different images are usually around 600 - 800. Below 20 and extremely similar.

Are there any improvements upon this model I can use? I havent looked at how relevant the different components (hue, saturation and value) are to the comparison. Hue is probably quite important but the others?

To speed up searches I could probably split the 4 bits from each part in half, and put the most significant bits first so if they fail the check then the lsb doesnt need to be checked at all. I dont know a efficient way to store bits like that yet still allow them to be searched and compared easily.

I've been using a dataset of 3,000 photos (mostly unique) and there havent been any false positives. Its completely immune to resizes and fairly resistant to brightness and contrast changes.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

3条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dorbmd1177 2011-01-30 21:50
关注
What you want to use is:

Feature extraction

Hashing

Locally aware bloom hashing.

Most people use SIFT features, although I've had better experiences with not scale-invariant ones. Basically you use an edge detector to find interesting points and then center your image patches around those points. That way you can also detect sub-images.

What you implemented is a hash method. There's tons to try from, but yours should work fine :)

The crucial step to making it fast is to hash your hashes. You convert your values into unary representation and then take a random subset of the bits as the new hash. Do that with 20-50 random samples and you get 20-50 hash tables. If any feature matches 2 or more out of those 50 hash tables, the feature will be very similar to one you already stored. This allows you to convert the abs(x-y)

Hope it helps, if you'd like to try out my self-developed image similarity search, drop me a mail at hajo at spratpix
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(2条)

报告相同问题？

关注问题

识别相似图像的好方法？ php
2010-05-15 03:11

回答 3 已采纳 What you want to use is: Feature extraction Hashing Locally aware bloom hashing. Most peopl
图像分割后进行识别，和直接进行图像识别那个效果好点？机器学习深度学习神经网络
2021-05-29 12:55

回答 1 已采纳这是两种不一样的解决问题的思路，在很多领域都存在。你导师说的那种思路是Pipeline，你说的那种思路叫做end2end，各有优缺点。Pipeline是将一个问题拆解成若干个子问题一次解决，然后串在一
学习图像识别技术需要用到什么知识？图像处理
2022-08-23 12:11

回答 1 已采纳机器学习开始学起，中间穿插一下数字图像处理（一般是学opencv），最后转到深度学习，OpenBR，yolo是开源仓库，前者是cpp库，用来做生物特征识别的，类似opencv一样的仓库，后者是神经网络
php图片检索,php 图像识别搜索
2021-03-23 13:10

孤灯苦狗的博客根据Neal Krawetz博士的解释，原理非常简单易懂。我们可以用一个快速算法，就达到基本的...结果越接近，就说明图片越相似。下面是一个最简单的实现：第一步，缩小尺寸。将图片缩小到8×8的尺寸，总共64个像素。这...
基于c++与opencv实现图像识别定位？ c++
2020-02-28 11:01

回答 3 已采纳 1. 霍夫直线识别出四个直线(先预处理图片) 2. 取同一直线上的较远的两个点(霍夫直线出来后相当于4个点阵每个点阵就是一条直线),算出直线方程, 4条操作相同 3. 算出两个十字标的交点(第二
细粒度图像识别到底是什么？ python 人工智能有问必答神经网络
2021-09-17 09:36

回答 1 已采纳人脸识别应该包含条件触发、抓取图像、人脸检测、图像预处理、特征提取、特征匹配、活体检测、条件判断及产生动作几个主要动作，其中人脸检测（face detection）、特征提取（feature extr
关于CNN图像识别的简单问题？ tensorflow 深度学习神经网络
2021-03-04 17:57

回答 2 已采纳 1. Tensorflow 是一个用于深度学习的开源库，它帮你封装好了各种深度学习的算法，所以非常容易上手使用，支持python和C/C++ 2. tensorflow, pytorch等库都可以用
如何用php代码实现人脸识别,PHP实现人脸识别技术
2021-04-24 15:42

顾不得的博客这次人脸识别技术，是实现在微信端的，也就是说利用公众微信平台，调用第三的API来实现人脸识别这项技术的。实现的思路：首先呢，将收集的照片，建立一个照片库，然后利用在微信平台发送的照片，去到照片库进行匹配...
php识别验证码问题？ php
2017-08-24 12:03

回答 1 已采纳 PHP识别验证码（适合大部分验证码） 'recognize', 'softID'=>'3', 'softKey'=>'623527b90698a47ec626043d
百度easyDL的图像识别原理是？人工智能深度学习
2019-03-28 08:24

回答 1 已采纳 http://www.sohu.com/a/245070517_115978 可以参考这篇文章
php自动识别验证码问题？ php
2017-08-21 12:41

回答 1 已采纳 https://jingyan.baidu.com/article/456c463b66e5320a583144b7.html
php 比对两张图片,Python+Opencv识别两张相似图片
2021-03-23 19:55

余曉波的博客在网上看到python做图像识别的相关文章后，真心感觉python的功能实在太强大，因此将这些文章总结一下，建立一下自己的知识体系。当然了，图像识别这个话题作为计算机科学...相关背景要识别两张相似图像，我们从感性...
sql语句无法识别，怎么解决啊？ eclipse sql
2022-03-11 20:27

回答 1 已采纳你表里面是"passsword" ,但你sql里写的是 "password" ,这两个不一样,当然会报错了,如果你看不出差别,就一个一个字母去比较
图像识别 php_用PHP查找图像差异
2020-08-30 07:56

culi4814的博客图像识别 phpI recently stumbled across a fascinating question: how could I tell whether an image had changed significantly? As PHP developers, the most troublesome image problem we have to deal with ...
图像识别和机器视觉区别,比较两幅图像的相似度
2022-09-02 11:21

快乐的小蓝猫的博客机器视觉需要用到图像处理库，有很多免费且开源的第三方图像库可以用，如十分著名的OpenCV，有C++，JAVA,PYTHON的版本，它包含了很多现成的函数，可以处理图像的形状，颜色，大小，图像文件保存，找相似图像，物体...
没有解决我的问题, 去提问

悬赏问题

¥15 使用Jdk8自带的算法，和Jdk11自带的加密结果会一样吗，不一样的话有什么解决方案，Jdk不能升级的情况
¥15 画两个图 python或R
¥15 在线请求openmv与pixhawk 实现实时目标跟踪的具体通讯方法
¥15 八路抢答器设计出现故障
¥15 请教一下c语言的代码里有一个地方不懂
¥15 opencv 无法读取视频
¥15 用matlab 实现通信仿真
¥15 按键修改电子时钟，C51单片机
¥60 Java中实现如何实现张量类，并用于图像处理(不运用其他科学计算库和图像处理库）)
¥20 5037端口被adb自己占了

识别相似图像的好方法？

3条回答 默认 最新

悬赏问题

3条回答默认最新