为什么我的Python代码比PHP中的相同代码慢100倍？

I have two points (x1 and x2) and want to generate a normal distribution in a given step count. The sum of y values for the x values between x1 and x2 is 1. To the actual problem:

I'm fairly new to Python and wonder why the following code produces the desired result, but about 100x slower than the same program in PHP. There are about 2000 x1-x2 pairs and about 5 step values per pair.

I tried to compile with Cython, used multiprocessing but it just improved things 2x, which is still 50x slower than PHP. Any suggestions how to improve speed to match at least PHP performance?

from scipy.stats import norm
import numpy as np
import time

# Calculates normal distribution
def calculate_dist(x1, x2, steps, slope):
    points = []
    range = np.linspace(x1, x2, steps+2)

    for x in range:
        y = norm.pdf(x, x1+((x2-x1)/2), slope)
        points.append([x, y])

    sum = np.array(points).sum(axis=0)[1]

    norm_points = []
    for point in points:
        norm_points.append([point[0], point[1]/sum])

    return norm_points

start = time.time()
for i in range(0, 2000):
    for j in range(10, 15):
        calculate_dist(0, 1, j, 0.15)

print(time.time() - start) # Around 15 seconds or so

Edit, PHP Code:

$start = microtime(true);

for ($i = 0; $i<2000; $i++) {
    for ($j = 10; $j<15; $j++) {
        $x1 = 0; $x2 = 1; $steps = $j; $slope = 0.15;
        $step = abs($x2-$x1) / ($steps + 1);

        $points = [];
        for ($x = $x1; $x <= $x2 + 0.000001; $x += $step) {
            $y = stats_dens_normal($x, $x1 + (($x2 - $x1) / 2), $slope);
            $points[] = [$x, $y];
        }

        $sum = 0;
        foreach ($points as $point) {
            $sum += $point[1];
        }

        $norm_points = [];
        foreach ($points as &$point) {
            array_push($norm_points, [$point[0], $point[1] / $sum]);
        }
    }
}

return microtime(true) - $start; # Around 0.1 seconds or so

Edit 2, profiled each line and found that norm.pdf() was taking 98% of time, so found a custom normpdf function and defined it, now time is around 0.67s which is considerably faster, but still around 10x slower than PHP. Also I think redefining common functions goes against the idea of Pythons simplicity?!

The custom function (source is some other Stackoverflow answer):

from math import sqrt, pi, exp
def normpdf(x, mu, sigma):
    u = (x-mu)/abs(sigma)
    y = (1/(sqrt(2*pi)*abs(sigma)))*exp(-u*u/2)
    return y

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
douan4347 2018-10-12 08:39
关注
The answer is, you aren't using the right tools/data structures for the tasks in python.

Calling numpy functionality has quite an overhead (scipy.stats.norm.pdf uses numpy under the hood) in python and thus one would never call this functions for one element but for the whole array (so called vectorized computation), that means instead of

for x in range: y = norm.pdf(x, x1+((x2-x1)/2), slope) ys.append(y)

one would rather use:

ys = norm.pdf(x,x1+((x2-x1)/2), slope)

calculating pdf for all elements in x and paying the overhead only once rather than len(x) times.

For example to calculate pdf for 10^4 elements takes less than 10 times more time than for one element:

%timeit norm.pdf(0) # 68.4 µs ± 1.62 µs %timeit norm.pdf(np.zeros(10**4)) # 415 µs ± 12.4 µs

Using vectorized computation will not only make your program faster but often also shorter/easier to understand, for example:

def calculate_dist_vec(x1, x2, steps, slope): x = np.linspace(x1, x2, steps+2) y = norm.pdf(x, x1+((x2-x1)/2), slope) ys = y/np.sum(y) return x,ys

Using this vectorized version gives you a speed-up around 10.

The problem: norm.pdf is optimized for long vectors (nobody really cares how fast/slow it is for 10 elements if it is very fast for one million elements), but your test is biased against numpy, because it uses/creates only short arrays and thus norm.pdf cannot shine.

So if it is really about small arrays and you are serious about speeding it up you will have to roll out your own version of norm.pdf Using cython for creating this fast and specialized function might be worth a try.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

python语法中selenium浏览器驱动为什么我的代码中间有一个横线？ python
2022-07-17 19:48

回答 1 已采纳 selenium更新了怕
VScode中Python代码不高亮显示？？ python vscode 有问必答
2022-04-10 22:05

回答 2 已采纳安装这两个插件然后设置颜色主题或者你也可以安装其它你喜欢的然后颜色主题插件
Python代码，什么意思？ python
2023-04-15 13:39

回答 2 已采纳这段代码是一个二分查找算法。lst 是一个有序列表，num 是要在列表中查找的数字。high 减一是因为如果 lst[mid] 大于 num，那么 num 一定不在 lst[mid] 及其右边的位置，
python代码转成php代码的工具或者go转成php的代码，想把odoo改成成php swoole当成web服务+go的架构
2023-07-16 22:35

zhangfeng1133的博客 python代码转成php代码的工具或者go转成php的代码，想把odoo改成成php swoole当成web服务+go的架构
python中print(),括号里为空，在代码末尾代表什么？ python
2021-05-12 21:11

回答 2 已采纳换行理解没错也没其他作用了
这几行python代码怎么缩减为一行呢？ python
2022-05-16 15:47

回答 2 已采纳 print(*[[["%d*%d=%2d" %(i, j, i*j) for j in range(i,10)] for k in range(1,i)] for i in range(1,10)]
从Python代码转换为程序？ python
2021-10-04 14:18

回答 4 已采纳你可能想找 pyinstaller
WTF？能把Python代码写得这么优雅的都是神仙程序员吧！
2020-12-21 02:30

“人生苦短，我用Python”，说的就是Python开发“快”的优势，相同的代码量能够完成其他语言数倍代码量的任务。一般情况下，像C++、C、JAVA 、GO这类编译型语言要比PHP、Python、JavaScript这类解释性语言要快一些，...
为什么Python运行不出结果，只有进程已结束退出代码为0 python
2023-01-14 16:09

回答 2 已采纳你的初始化函数写错了，导致初始化失败，r没有被赋值是 init 而不是 int def __init__(self, r):
python代码一段代码被划横线是什么原因 chrome python 有问必答爬虫
2021-11-30 12:48

回答 5 已采纳这个是提示这个函数已过期，但不影响运行！最好不要用，一般过期函数会有其替代的函数，你可以进入函数实现看看说明就知道了！如果是你来处理后续，我建议是这样做！
python，虚拟机中代码如何复制粘贴出来，快捷键是什么？ python
2022-08-04 23:14

回答 2 已采纳你试一下在虚拟机中直接选中要复制的代码，然后直接在外面粘贴，这样如果不行的话，你在虚拟机中选中之后，右键试试
python用什么来区分代码块_python通过什么来区分不同的语句块？
2020-11-21 03:30

weixin_39793189的博客 Python语句块1、语句块是在条件为真（条件语句）时执行或者执行多次（循环语句）的一组语句；2、在代码前放置空格来缩进语句即可创建语句块，语句块中的每行必须是同样的缩进量；3、缩进：Python开发者有意让违反了...
为什么在终端运行python打印f字符串会出错？明明代码没有错 python
2022-08-22 14:51

回答 2 已采纳 python2，不支持f-string表示法。f-string 语法是 python3.6 之后版本添加的，称之为字面量格式化字符串。python3 没有raw_input，只有input。pytho
c语言如何调用python代码_c中调用python
2020-11-24 13:23

weixin_39951112的博客广告关闭腾讯云11.11云上盛惠 ...调用脚本文件中对应的函数 console.writeline(restr); console.readkey(); * scriptruntime pyruntime = python.createruntime(); dynamic obj = ...为了记录自己的学习成果，防止遗...
java比python快多少倍_java为什么比python快
2020-12-29 09:20

言午小姐姐的博客详细内容Python慢，这几种是常见的原因：“因为它是GIL(全局解释器锁)”，“因为它是解释语言不是编译语言”，“因为它是动态类型语言”。推荐课程：Java教程。究竟哪个原因对性能的影响最大？“因为它是GIL”现代...
没有解决我的问题, 去提问

悬赏问题

¥15 r语言神经网络自变量重要性分析
¥15 基于双目测规则物体尺寸
¥15 wegame打不开英雄联盟
¥15 公司的电脑，win10系统自带远程协助，访问家里个人电脑，提示出现内部错误，各种常规的设置都已经尝试，感觉公司对此功能进行了限制（我们是集团公司）
¥15 救！ENVI5.6深度学习初始化模型报错怎么办？
¥30 eclipse开启服务后，网页无法打开
¥30 雷达辐射源信号参考模型
¥15 html+css+js如何实现这样子的效果？
¥15 STM32单片机自主设计
¥15 如何在node.js中或者java中给wav格式的音频编码成sil格式呢

为什么我的Python代码比PHP中的相同代码慢100倍？

1条回答 默认 最新

悬赏问题

1条回答默认最新