在PHP，Node和Golang中找到两个数组之间的差异

Here is a typical example of what I need to do

$testArr = array(2.05080E6,29400,420);

$stockArrays =  array(
                      array(2.05080E6,29400,0),
                      array(2.05080E6,9800,420),
                      array(1.715E6,24500,280),
                      array(2.05080E6,29400,140),
                      array(2.05080E6,4900,7));

I need to identify the stockArray that is the least different. A few clarifications

The numeric values of array elements at each position are guaranteed not to overlap. (i.e. arr[0] will always have the biggest values, arr1 will be at least an order of 10 magnitude smaller etc).
The absolute values of the differences do not count when determining least different. Only, the number of differing array indices matter.
Positional differences do have a weighting. Thus in my example stockArr1 is "more different" thought it too - like its stockArr[0] & stockArr[3] counterparts - differs in only one index position because that index position is bigger.
The number of stockArrays elements will typically be less than 10 but could potentially be much more (though never into 3 figures)
The stock arrays will always have the same number of elements. The test array will have the same or fewer elements. However, when fewer testArr would be padded out so that potentially matching elements are always in the same place as the stockArray. e.g.

$testArray(29400,140)

would be transformed to

$testArray(0,29400,140);

prior to being subjected to difference testing.

Finally, a tie is possible. For instance my example above the matches would be stockArrays[0] and stockArrays[3].

In my example the result would be

$result = array(0=>array(0,0,1),3=>array(0,0,1));

indicating that the least different stock arrays are at indices 0 & 3 with the differences being at position 2.

In PHP I would handle all of this with array_diff as my starting point. For Node/JavaScript I would probably be tempted to the php.js array_diff port though I would be inclined to explore a bit given that in the worst cast scenario it is an O(n2) affair.

I am a newbie when it comes to Golang so I am not sure how I would implement this problem there. I have noted that Node does have an array_diff npm module.

One off-beat idea I have had is converting the array to a padded string (smaller array elements are 0 padded) and effectively do an XOR on the ordinal value of each character but have dismissed that as probably a rather nutty thing to do.

I am concerned with speed but not at all costs. In an ideal world the same solution (algorithm) would be used in each target language though in reality the differences between them might mean that is not possible/not a good idea.

Perhaps someone here might be able to point me to less pedestrian ways of accomplishing this - i.e. not just array_diff ports.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
duancuan7057 2015-02-13 22:10
关注
Here's the equivalent of the array_diff solution: (assuming I didn't make a mistake)

package main import "fmt" func FindLeastDifferent(needle []float64, haystack [][]float64) int { if len(haystack) == 0 { return -1 } var currentIndex, currentDiff int for i, arr := range haystack { diff := 0 for j := range needle { if arr[j] != needle[j] { diff++ } } if i == 0 || diff < currentDiff { currentDiff = diff currentIndex = i } } return currentIndex } func main() { idx := FindLeastDifferent( []float64{2.05080E6, 29400, 420}, [][]float64{ {2.05080E6, 29400, 0}, {2.05080E6, 9800, 420}, {1.715E6, 24500, 280}, {2.05080E6, 29400, 140}, {2.05080E6, 4900, 7}, {2.05080E6, 29400, 420}, }, ) fmt.Println(idx) }

Like you said its O(n * m) where n is the number of elements in the needle array, and m is the number of arrays in the haystack.

If you don't know the haystack ahead of time, then there's probably not much you can do to improve this. But if, instead, you're storing this list in a database, I think your intuition about string search has some potential. PostgreSQL for example supports string similarity indexes. (And here's an explanation of a similar idea for regular expressions: http://swtch.com/~rsc/regexp/regexp4.html)

One other idea: if your arrays are really big you can calculate fuzzy hashes (http://ssdeep.sourceforge.net/) which would make your n smaller.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

在PHP，Node和Golang中找到两个数组之间的差异 javascript node.js php
2015-02-13 09:11

回答 1 已采纳 Here's the equivalent of the array_diff solution: (assuming I didn't make a mistake) package main
Golang中找到两个数组的交点哪个更快？
2015-02-27 21:26

回答 1 已采纳 As was pointed out in the comments, you are not finding an intersection rather you are finding if
gin获取前端请求中的body的数组 golang
2022-07-20 00:14

回答 2 已采纳正常的定义 struct，并且用 BindJSON() 函数即可，例如： type Res struct { Messages []string `binding:"required` }
如何使用 Python、Node.js 和 Go 创建基于 YOLOv8 的对象检测 Web 服务
2024-01-05 18:30

guohuang的博客在本文中，展示了如何在不需要PyTorch和官方API的情况下使用 YOLOv8 模型，需要将模型部署在不同的端上，让模型使用的资源减少十倍，并且使用了如何在 Node.js、和 Go 上创建由 YOLOv8 的 Web 服务。
如何使用golang从数组中找到不在另一个数组中的元素？
2017-08-24 10:41

回答 1 已采纳 Here's a Playground solving your problem (Quick and Dirty, there may be better solutions out there
AES-256-CBC加密在golang和node / php之间不匹配 node.js php
2018-06-08 13:42

回答 1 已采纳 Okay, I managed to get the go output to match your other outputs, but I'm not 100% clear on all of
如何在golang中从不安全的数组创建数组或切片？
2018-07-05 09:36

回答 1 已采纳 Foreword: You should know: if you get the pointer as a value of uintptr type, that does not preve
Golang底层原理学习笔记（一）
2023-01-13 23:57

lcy～的博客 Golang底层原理
如何在golang中对定长数组进行排序？
2018-09-21 21:44

回答 3 已采纳 Notwithstanding this other answer, which provides guidance regarding appropriate slicing to use th
在Golang中创建数组文字数组
2015-10-14 05:00

回答 4 已采纳 You almost have the right thing however your syntax for the inner arrays is slightly off, needing
在golang中创建二维字符串数组
2018-08-14 19:55

回答 2 已采纳 You have a slice of slices, and the outer slice is nil until it's initialized: matrix := make([][
PHP程序员进阶之路
2022-11-11 15:34

Lambert-XG的博客该文是按照目前主流技术做了一个最基本的梳理而且假设PHP程序员不是基础非常扎实的情况进行的设定，并且所有设定都非常具体明确清晰。
如何在Golang中将字符串类型数组的值映射为int类型数组？
2016-08-10 04:22

回答 2 已采纳 Use int(s[0]-'A')*8 + int(s[1]-'0') like this working sample code: package main import "fmt"
PHP程序员的技术成长规划
2018-05-23 17:10

铁柱同学的博客按照了解的很多PHP/LNMP程序员的发展轨迹，结合个人经验体会，抽象出很多程序员对未来的迷漫，特别对技术学习的盲目和慌乱，简单梳理了这个每个阶段PHP程序员的技术要求，来帮助很多PHP程序做对照设定学习成长目标。...
go 面试题
2024-01-29 18:59

我但行好事莫问前程的博客上述代码中，goroutine会向channel中发送两个整数然后关闭它，主goroutine则会从channel中接收三次数据，前两次能成功接收到值并打印出来，第三次由于channel已关闭且无数据可读，因此会输出"Channel is closed and ...
没有解决我的问题, 去提问

悬赏问题

¥20 基于MSP430f5529的MPU6050驱动，求出欧拉角
¥20 Java-Oj-桌布的计算
¥15 请问如何在openpcdet上对KITTI数据集的测试集进行结果评估？
¥15 powerbuilder中的datawindow数据整合到新的DataWindow
¥20 有人知道这种图怎么画吗？
¥15 pyqt6如何引用qrc文件加载里面的的资源
¥15 安卓JNI项目使用lua上的问题
¥20 RL+GNN解决人员排班问题时梯度消失
¥60 要数控稳压电源测试数据
¥15 能帮我写下这个编程吗

在PHP，Node和Golang中找到两个数组之间的差异

1条回答 默认 最新

悬赏问题

1条回答默认最新