为什么我的Python代码比PHP中的相同代码慢100倍？

I have two points (x1 and x2) and want to generate a normal distribution in a given step count. The sum of y values for the x values between x1 and x2 is 1. To the actual problem:

I'm fairly new to Python and wonder why the following code produces the desired result, but about 100x slower than the same program in PHP. There are about 2000 x1-x2 pairs and about 5 step values per pair.

I tried to compile with Cython, used multiprocessing but it just improved things 2x, which is still 50x slower than PHP. Any suggestions how to improve speed to match at least PHP performance?

from scipy.stats import norm
import numpy as np
import time

# Calculates normal distribution
def calculate_dist(x1, x2, steps, slope):
    points = []
    range = np.linspace(x1, x2, steps+2)

    for x in range:
        y = norm.pdf(x, x1+((x2-x1)/2), slope)
        points.append([x, y])

    sum = np.array(points).sum(axis=0)[1]

    norm_points = []
    for point in points:
        norm_points.append([point[0], point[1]/sum])

    return norm_points

start = time.time()
for i in range(0, 2000):
    for j in range(10, 15):
        calculate_dist(0, 1, j, 0.15)

print(time.time() - start) # Around 15 seconds or so

Edit, PHP Code:

$start = microtime(true);

for ($i = 0; $i<2000; $i++) {
    for ($j = 10; $j<15; $j++) {
        $x1 = 0; $x2 = 1; $steps = $j; $slope = 0.15;
        $step = abs($x2-$x1) / ($steps + 1);

        $points = [];
        for ($x = $x1; $x <= $x2 + 0.000001; $x += $step) {
            $y = stats_dens_normal($x, $x1 + (($x2 - $x1) / 2), $slope);
            $points[] = [$x, $y];
        }

        $sum = 0;
        foreach ($points as $point) {
            $sum += $point[1];
        }

        $norm_points = [];
        foreach ($points as &$point) {
            array_push($norm_points, [$point[0], $point[1] / $sum]);
        }
    }
}

return microtime(true) - $start; # Around 0.1 seconds or so

Edit 2, profiled each line and found that norm.pdf() was taking 98% of time, so found a custom normpdf function and defined it, now time is around 0.67s which is considerably faster, but still around 10x slower than PHP. Also I think redefining common functions goes against the idea of Pythons simplicity?!

The custom function (source is some other Stackoverflow answer):

from math import sqrt, pi, exp
def normpdf(x, mu, sigma):
    u = (x-mu)/abs(sigma)
    y = (1/(sqrt(2*pi)*abs(sigma)))*exp(-u*u/2)
    return y

展开全部

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
douan4347 2018-10-12 00:39
关注
The answer is, you aren't using the right tools/data structures for the tasks in python.

Calling numpy functionality has quite an overhead (scipy.stats.norm.pdf uses numpy under the hood) in python and thus one would never call this functions for one element but for the whole array (so called vectorized computation), that means instead of

for x in range: y = norm.pdf(x, x1+((x2-x1)/2), slope) ys.append(y)

one would rather use:

ys = norm.pdf(x,x1+((x2-x1)/2), slope)

calculating pdf for all elements in x and paying the overhead only once rather than len(x) times.

For example to calculate pdf for 10^4 elements takes less than 10 times more time than for one element:

%timeit norm.pdf(0) # 68.4 µs ± 1.62 µs %timeit norm.pdf(np.zeros(10**4)) # 415 µs ± 12.4 µs

Using vectorized computation will not only make your program faster but often also shorter/easier to understand, for example:

def calculate_dist_vec(x1, x2, steps, slope): x = np.linspace(x1, x2, steps+2) y = norm.pdf(x, x1+((x2-x1)/2), slope) ys = y/np.sum(y) return x,ys

Using this vectorized version gives you a speed-up around 10.

The problem: norm.pdf is optimized for long vectors (nobody really cares how fast/slow it is for 10 elements if it is very fast for one million elements), but your test is biased against numpy, because it uses/creates only short arrays and thus norm.pdf cannot shine.

So if it is really about small arrays and you are serious about speeding it up you will have to roll out your own version of norm.pdf Using cython for creating this fast and specialized function might be worth a try.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报
编辑

预览
轻敲空格完成输入
显示为

卡片

标题

链接
评论

按下Enter换行，Ctrl+Enter发表内容

编辑

预览

报告相同问题？

关注问题

python语法中selenium浏览器驱动为什么我的代码中间有一个横线？ python
2022-07-17 11:48

回答 1 已采纳 selenium更新了怕
VScode中Python代码不高亮显示？？ python vscode 有问必答
2022-04-10 14:05

回答 2 已采纳安装这两个插件然后设置颜色主题或者你也可以安装其它你喜欢的然后颜色主题插件
Python代码，什么意思？ python
2023-04-15 05:39

回答 2 已采纳这段代码是一个二分查找算法。lst 是一个有序列表，num 是要在列表中查找的数字。high 减一是因为如果 lst[mid] 大于 num，那么 num 一定不在 lst[mid] 及其右边的位置，
python比php慢多了_为什么我的Python代码比PHP中的相同代码慢100倍？
2021-01-29 15:44

AI众智新媒体一阿荣的博客对于实际问题：我对Python相当陌生，不知道为什么下面的代码会产生预期的结果，但是比PHP中的同一个程序慢了大约100倍。大约有2000个x1-x2对，每对大约有5个阶跃值。在我试着用Cython编译，使用了多处理，但它只改进...
Python代码中的 break为什么汇报错 python 有问必答
2021-06-25 09:53

回答 2 已采纳 else 和上面的if缩进没对齐，所以break语句也报错了如果有帮助请点一下我回答右上方的采纳，谢谢！以后有什么问题可以互相交流。
python中print(),括号里为空，在代码末尾代表什么？ python
2021-05-12 13:11

回答 2 已采纳换行理解没错也没其他作用了
这几行python代码怎么缩减为一行呢？ python
2022-05-16 07:47

回答 2 已采纳 print(*[[["%d*%d=%2d" %(i, j, i*j) for j in range(i,10)] for k in range(1,i)] for i in range(1,10)]
python中收银台模拟器为什么if里的代码运行不起来？如图 python
2022-01-18 12:44

回答 1 已采纳啥啊，当然错了，怎么可能同时满足大于100和等于100？
从Python代码转换为程序？ python
2021-10-04 06:18

回答 4 已采纳你可能想找 pyinstaller
为什么相同的代码在不同的电脑上运行结果不同？ c# c++ c语言
2022-10-15 08:42

回答 3 已采纳代码里：sum 没有初始化。
python代码转成php代码的工具或者go转成php的代码，想把odoo改成成php swoole当成web服务+go的架构
2023-07-16 14:35

zhangfeng1133的博客 python代码转成php代码的工具或者go转成php的代码，想把odoo改成成php swoole当成web服务+go的架构
为什么Python运行不出结果，只有进程已结束退出代码为0 python
2023-01-14 08:09

回答 2 已采纳你的初始化函数写错了，导致初始化失败，r没有被赋值是 init 而不是 int def __init__(self, r):
国外开发者谈为何放弃PHP而改用Python
2020-10-29 09:59

在本文中，一位具有11年PHP开发经验的国外开发者，详细阐述了他放弃PHP转而使用Python的理由。在他的观点中，PHP被描述为一种复杂的插件结构，其API存在不一致性，语言的管理混乱，缺乏标准，且与现代编程语言相比...
Convert-Trained-ML-Models-To-Native-Code：如何使用m2gen python库将经过训练的机器学习模型转换为本地代码，例如python，php和javascript
2021-02-07 02:32

例如，如果你想要将模型转换为Python代码，可以这样操作： ```python from m2gen import to_python # 假设 `clf` 是你的训练好的Scikit-Learn分类器 native_code = to_python(clf) with open('model.py', 'w')...
没有解决我的问题, 去提问

悬赏问题

¥15 全志t113i启动qt应用程序提示internal error
¥15 ensp可以看看嘛.
¥80 51单片机C语言代码解决单片机为AT89C52是清翔单片机
¥60 优博讯DT50高通安卓11系统刷完机自动进去fastboot模式
¥15 minist数字识别
¥15 在安装gym库的pygame时遇到问题，不知道如何解决
¥20 uniapp中的webview 使用的是本地的vue页面，在模拟器上显示无法打开
¥15 网上下载的3DMAX模型，不显示贴图怎么办
¥15 关于#stm32#的问题：寻找一块开发版，作为智能化割草机的控制模块和树莓派主板相连，要求：最低可控制 3 个电机（两个驱动电机，1 个割草电机），其次可以与树莓派主板相连电机照片如下：
¥15 Mac(标签-IDE|关键词-File) idea

为什么我的Python代码比PHP中的相同代码慢100倍？

1条回答 默认 最新

悬赏问题

1条回答默认最新