PHP函数的Big-O列表

After using PHP for a while now, I've noticed that not all built-in PHP functions are as fast as expected. Consider these two possible implementations of a function that finds if a number is prime using a cached array of primes.

//very slow for large $prime_array
$prime_array = array( 2, 3, 5, 7, 11, 13, .... 104729, ... );
$result_array = array();
foreach( $prime_array => $number ) {
    $result_array[$number] = in_array( $number, $large_prime_array );
}

//speed is much less dependent on size of $prime_array, and runs much faster.
$prime_array => array( 2 => NULL, 3 => NULL, 5 => NULL, 7 => NULL,
                       11 => NULL, 13 => NULL, .... 104729 => NULL, ... );
foreach( $prime_array => $number ) {
    $result_array[$number] = array_key_exists( $number, $large_prime_array );
}

This is because in_array is implemented with a linear search O(n) which will linearly slow down as $prime_array grows. Where the array_key_exists function is implemented with a hash lookup O(1) which will not slow down unless the hash table gets extremely populated (in which case it's only O(n)).

So far I've had to discover the big-O's via trial and error, and occasionally looking at the source code. Now for the question...

Is there a list of the theoretical (or practical) big O times for all* the built-in PHP functions?

*or at least the interesting ones

For example, I find it very hard to predict the big O of functions listed because the possible implementation depends on unknown core data structures of PHP: array_merge, array_merge_recursive, array_reverse, array_intersect, array_combine, str_replace (with array inputs), etc.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

4条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dqotv26286 2010-03-20 19:43
关注
Since it doesn't seem like anyone has done this before I thought it'd be good idea to have it for reference somewhere. I've gone though and either via benchmark or code-skimming to characterize the array_* functions. I've tried to put the more interesting Big-O near the top. This list is not complete.

Note: All the Big-O where calculated assuming a hash lookup is O(1) even though it's really O(n). The coefficient of the n is so low, the ram overhead of storing a large enough array would hurt you before the characteristics of lookup Big-O would start taking effect. For example the difference between a call to array_key_exists at N=1 and N=1,000,000 is ~50% time increase.

Interesting Points:

isset/array_key_exists is much faster than in_array and array_search

+(union) is a bit faster than array_merge (and looks nicer). But it does work differently so keep that in mind.

shuffle is on the same Big-O tier as array_rand

array_pop/array_push is faster than array_shift/array_unshift due to re-index penalty

Lookups:

array_key_exists O(n) but really close to O(1) - this is because of linear polling in collisions, but because the chance of collisions is very small, the coefficient is also very small. I find you treat hash lookups as O(1) to give a more realistic big-O. For example the different between N=1000 and N=100000 is only about 50% slow down.

isset( $array[$index] ) O(n) but really close to O(1) - it uses the same lookup as array_key_exists. Since it's language construct, will cache the lookup if the key is hardcoded, resulting in speed up in cases where the same key is used repeatedly.

in_array O(n) - this is because it does a linear search though the array until it finds the value.

array_search O(n) - it uses the same core function as in_array but returns value.

Queue functions:

array_push O(∑ var_i, for all i)

array_pop O(1)

array_shift O(n) - it has to reindex all the keys

array_unshift O(n + ∑ var_i, for all i) - it has to reindex all the keys

Array Intersection, Union, Subtraction:

array_intersect_key if intersection 100% do O(Max(param_i_size)*∑param_i_count, for all i), if intersection 0% intersect O(∑param_i_size, for all i)

array_intersect if intersection 100% do O(n^2*∑param_i_count, for all i), if intersection 0% intersect O(n^2)

array_intersect_assoc if intersection 100% do O(Max(param_i_size)*∑param_i_count, for all i), if intersection 0% intersect O(∑param_i_size, for all i)

array_diff O(π param_i_size, for all i) - That's product of all the param_sizes

array_diff_key O(∑ param_i_size, for i != 1) - this is because we don't need to iterate over the first array.

array_merge O( ∑ array_i, i != 1 ) - doesn't need to iterate over the first array

+ (union) O(n), where n is size of the 2nd array (ie array_first + array_second) - less overhead than array_merge since it doesn't have to renumber

array_replace O( ∑ array_i, for all i )

Random:

shuffle O(n)

array_rand O(n) - Requires a linear poll.

Obvious Big-O:

array_fill O(n)

array_fill_keys O(n)

range O(n)

array_splice O(offset + length)

array_slice O(offset + length) or O(n) if length = NULL

array_keys O(n)

array_values O(n)

array_reverse O(n)

array_pad O(pad_size)

array_flip O(n)

array_sum O(n)

array_product O(n)

array_reduce O(n)

array_filter O(n)

array_map O(n)

array_chunk O(n)

array_combine O(n)

I'd like to thank Eureqa for making it easy to find the Big-O of the functions. It's an amazing free program that can find the best fitting function for arbitrary data.

EDIT:

For those who doubt that PHP array lookups are O(N), I've written a benchmark to test that (they are still effectively O(1) for most realistic values).

$tests = 1000000; $max = 5000001; for( $i = 1; $i <= $max; $i += 10000 ) { //create lookup array $array = array_fill( 0, $i, NULL ); //build test indexes $test_indexes = array(); for( $j = 0; $j < $tests; $j++ ) { $test_indexes[] = rand( 0, $i-1 ); } //benchmark array lookups $start = microtime( TRUE ); foreach( $test_indexes as $test_index ) { $value = $array[ $test_index ]; unset( $value ); } $stop = microtime( TRUE ); unset( $array, $test_indexes, $test_index ); printf( "%d,%1.15f ", $i, $stop - $start ); //time per 1mil lookups unset( $stop, $start ); }
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(3条)

报告相同问题？

关注问题

PHP函数的Big-O列表 php
2010-03-18 23:12

回答 4 已采纳 Since it doesn't seem like anyone has done this before I thought it'd be good idea to have it for
PHP包含/ require内部函数 php
2014-02-05 19:09

回答 4 已采纳 While @Luceos answer is technically correct (the best kind of correct), it does not answer the que
PHP flock函数限制和txt缓存文件 linux php
2013-03-01 01:24

回答 1 已采纳 The only common case would be running Apache, with the some kind threaded (non-forking) npm. 99% o
php flv视频时间获取函数
2021-01-20 00:16

php function BigEndian2Int($byte_word, $signed = false) { $int_value = 0; $byte_wordlen = strlen($byte_word); for ($i = 0; $i < $byte_wordlen; $i++) { $int_value += ord($byte_word{$i}) * pow(256, ...
最好的PHP函数输出多个选项中的一个 php
2014-06-13 19:51

回答 1 已采纳 Create an array of the departments using their ID as their array key. Then you can access them usi
优化PHP代码 - 函数显示和PHP未定义变量的通知 php
2013-08-25 05:45

回答 1 已采纳 You're using a concatenation on variables that are not set. Replace $temp1.='<li><a hr
表单元素的html表单元素与php函数 html php
2015-03-06 17:17

回答 1 已采纳 It's not necessarily standard. It's how much do want to re-write the same code over again. The par
php中支持多种编码的中文字符串截取函数!
2020-12-17 15:53

支持多种编码的中文字符串截取函数! 复制代码代码如下:/* * @todo 中文截取，支持gb2312,gbk,utf-8,big5 * * @param string $str 要截取的字串 * @param int $start 截取起始位置 * @param int ...
禁用htaccess中的php函数 apache php
2013-01-26 01:47

回答 3 已采纳 According to the PHP documentation, you can't use the disable_functions setting anywhere other tha
PHP在更多函数之间传递变量 php
2011-02-15 13:24

回答 4 已采纳 You can try putting all the variables into an associative array and just passing this array betwee
JQuery函数没有发布到php ajax jquery php
2013-06-23 19:41

回答 2 已采纳 First I would make sure the AJAX--to--PHP system is working. You can do that test with two small
解析php获取字符串的编码格式的方法(函数)
2021-01-20 01:08

如果不清楚字符串的编码格式的话，就可以将这段字符这样检查：$encode = mb_detect_encoding($string, array... 您可能感兴趣的文章:php strstr查找字符串中是否包含某些字符的查找函数PHP字符转义相关函数小结(php下
具有文件句柄的Php函数循环 php
2012-07-11 08:20

回答 4 已采纳 Because $file_handle is boolean false (you can check for this with var_dump), which in turn happen
PHP编码转换函数utf-gb-big5
2007-10-31 16:20

支持在PHP中将编码转换为指定的编码方式 gb2big5 big52gb utf82u u2utf8 gb2utf8 utf82gb
PHP函数速查效率手册 source code
2013-10-05 15:00

sorry,video too big,deleted 脑动力：PHP函数速查效率手册 source code 张建辉　主编电子工业出版社　PHP是现在最流行的网站开发技术。PHP提供的内部函数功能强大，解决常见的各种PHP问题。但是PHP函数繁杂，...
没有解决我的问题, 去提问

悬赏问题

¥15 这是哪个作者做的宝宝起名网站
¥60 版本过低apk如何修改可以兼容新的安卓系统
¥25 由IPR导致的DRIVER_POWER_STATE_FAILURE蓝屏
¥50 有数据，怎么建立模型求影响全要素生产率的因素
¥50 有数据，怎么用matlab求全要素生产率
¥15 TI的insta-spin例程
¥15 完成下列问题完成下列问题
¥15 C#算法问题, 不知道怎么处理这个数据的转换
¥15 YoloV5 第三方库的版本对照问题
¥15 请完成下列相关问题！

PHP函数的Big-O列表

4条回答 默认 最新

悬赏问题

4条回答默认最新