自定义损失函数无梯度

最近在自定义损失函数来优化LSTM模型，想预测未来五天的股票收盘价格，然后根据预测的价格确定这五天什么时候买入，什么时候卖出，比如未来五天是[1,2,3,2,5]那么就是在第一天买入，第三天卖出，在第四天买入，第五天卖出，这样能获得最大的收益。那么根据预测的价格就确定了具体的买入卖出时间，然后根据这个时间在真实价格上进行操作，就计算得到预测的最大收益。那么真实的收益就自然是根据相同的策略在真实价格上进行操作。
具体代码如下

def strategy(fivedayprice):    
    actlist=[]
    label=False
    for i in range(4):
        if fivedayprice[i+1]>fivedayprice[i] and label==False:
            actlist.append(1)
            label=True
            
        elif fivedayprice[i+1]<fivedayprice[i] and label==True:
            actlist.append(-1)
            label=False
            
        elif fivedayprice[i+1]<fivedayprice[i] and label==False:
            actlist.append(0)
            
        elif fivedayprice[i+1]>fivedayprice[i] and label==True:
            actlist.append(0)           
    if  label ==True:
        actlist.append(-1)
    if  label ==False:
        actlist.append(0)
    return actlist

def get_profit(y_pred,y_true):
    labellist=[]
    profitlist=[]
    num=len(y_pred)
    for i in range(num):
        labellist.append(strategy(y_pred[i]))
    for j in range(len(labellist)):
        profit=0
        for n in range(5):
            if labellist[j][n]==1:
                profit=profit-y_true[j][n]
            elif labellist[j][n]==0:
                pass
            elif labellist[j][n]==-1:
                profit =profit+y_true[j][n]
        profitlist.append(profit)
    return profitlist

import tensorflow.keras.backend as K
def profitloss(y_true, y_pred):
    y_true = tf.convert_to_tensor(y_true)
    y_pred = tf.convert_to_tensor(y_pred)
    trueprofitlist = tf.convert_to_tensor(get_profit(y_true, y_true))
    predprofitlist = tf.convert_to_tensor(get_profit(y_pred, y_true))
    loss=K.mean(K.square(trueprofitlist-predprofitlist))           
    return loss

最后在训练模型的时候报错没有梯度

ValueError: No gradients provided for any variable: (['lstm_62/lstm_cell/kernel:0', 'lstm_62/lstm_cell/recurrent_kernel:0', 'lstm_62/lstm_cell/bias:0', 'lstm_63/lstm_cell/kernel:0', 'lstm_63/lstm_cell/recurrent_kernel:0', 'lstm_63/lstm_cell/bias:0', 'dense_62/kernel:0', 'dense_62/bias:0', 'dense_63/kernel:0', 'dense_63/bias:0'],). Provided `grads_and_vars` is ((None, <tf.Variable 'lstm_62/lstm_cell/kernel:0' shape=(9, 512) dtype=float32>), (None, <tf.Variable 'lstm_62/lstm_cell/recurrent_kernel:0' shape=(128, 512) dtype=float32>), (None, <tf.Variable 'lstm_62/lstm_cell/bias:0' shape=(512,) dtype=float32>), (None, <tf.Variable 'lstm_63/lstm_cell/kernel:0' shape=(128, 512) dtype=float32>), (None, <tf.Variable 'lstm_63/lstm_cell/recurrent_kernel:0' shape=(128, 512) dtype=float32>), (None, <tf.Variable 'lstm_63/lstm_cell/bias:0' shape=(512,) dtype=float32>), (None, <tf.Variable 'dense_62/kernel:0' shape=(128, 10) dtype=float32>), (None, <tf.Variable 'dense_62/bias:0' shape=(10,) dtype=float32>), (None, <tf.Variable 'dense_63/kernel:0' shape=(10, 5) dtype=float32>), (None, <tf.Variable 'dense_63/bias:0' shape=(5,) dtype=float32>)).

请问怎么解决啊

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

4条回答默认最新

专家-赤兔[在线] 全栈领域优质创作者 2024-04-25 18:01

关注

引自免费微信小程序：皆我百晓生

您遇到的问题在于自定义损失函数profitloss 中的非线性部分（即get_profit 函数）不支持自动微分，导致梯度无法通过反向传播传递给模型的权重。为了解决这个问题，您可以尝试将整个过程转换成 TensorFlow 的张量操作，使得整个流程都是可微分的。这里我会逐步引导您修改代码，使其可以计算梯度并训练模型。

首先，让我们重构strategy函数，将其转换为TensorFlow操作。我们将创建一个名为compute_actions的函数，它接受预测值和真实值作为输入，并返回动作列表（买卖信号）。

import tensorflow as tf

@tf.function
def compute_actions(fivedayprice):
    actions = tf.zeros((tf.shape(fivedayprice)[0], 5), dtype=tf.int32)

    # 初始化标记变量
    buy_signal = tf.fill(tf.shape(actions), False)
    sell_signal = tf.fill(tf.shape(actions), False)

    # 对于每一步，检查是否满足买入或卖出条件
    for i in tf.range(4):
        next_price = fivedayprice[:, i + 1]
        current_price = fivedayprice[:, i]

        # 检查买入条件
        buy_mask = tf.logical_and(next_price > current_price, tf.math.logical_not(buy_signal))
        actions[buy_mask, i] = 1
        buy_signal = tf.math.logical_or(buy_signal, buy_mask)

        # 检查卖出条件
        sell_mask = tf.logical_and(next_price < current_price, buy_signal)
        actions[sell_mask, i] = -1
        buy_signal = tf.math.logical_and(tf.math.logical_not(sell_mask), buy_signal)
        
        # 更新sell_signal
        sell_signal = tf.math.logical_or(sell_signal, sell_mask)

    # 处理最后一日的卖出情况
    sell_signal = tf.math.logical_or(sell_signal, tf.math.logical_not(buy_signal))
    actions[:, -1] = tf.where(sell_signal, -1, 0)

    return actions

接下来，我们重新定义get_profit函数，使用刚刚创建的compute_actions函数，同时确保所有操作都在TensorFlow环境中完成。

@tf.function
def get_profit(y_pred, y_true):
    actions = compute_actions(y_pred)
    profits = tf.zeros_like(y_true)

    # 计算每个交易日的利润
    for i in tf.range(5):
        profits += tf.reduce_sum(y_true * actions[:, i, tf.newaxis], axis=-1)

    return profits

现在我们可以重写profitloss函数，只包含可微分的操作。

def profitloss(y_true, y_pred):
    y_true = tf.convert_to_tensor(y_true)
    y_pred = tf.convert_to_tensor(y_pred)

    trueprofits = get_profit(y_true, y_true)
    predprofits = get_profit(y_pred, y_true)

    loss = tf.reduce_mean(tf.square(trueprofits - predprofits))

    return loss

现在您的损失函数应该可以正常工作并且支持梯度计算。您可以使用这个新定义的损失函数进行模型训练。例如，如果您有一个名为model的模型，训练过程可能如下所示：

# 假设X_train、y_train是您的训练数据
model.compile(optimizer='adam', loss=profitloss)
model.fit(X_train, y_train, epochs=10, batch_size=32)

这样，模型就可以学习如何最大化预测收益，从而改进其对未来股价的预测。注意，这种方法可能会很复杂且难以收敛，因为它涉及复杂的离散决策。您可能需要调整模型结构和训练参数，以便更好地适应这种类型的损失函数。

编辑记录

报告相同问题？

关注问题

R语言自定义函数条件求和 r语言有问必答
2021-04-18 02:56

回答 4 已采纳在函数内少写了打印语句。在代码中最后两个}之间加上一句print(i)即可。 sumN<-function(x,n){ i = 0 for(x in 1:n){ if
求大神解答，用梯度下降法优化损失函数，损失函数可以由循环产生吗？
2018-11-08 02:36

回答 1 已采纳损失函数本来都是循环，无论是mse mae crossentropy...，但是关优化算法（反向传播算法）什么事呢。很慢就应该用gpu去加速啊，sgd要设置学习率、动量
tensorflow可以在自定义损失函数中使用内置损失函数吗 tensorflow 人工智能机器学习深度学习神经网络
2020-08-01 19:01

回答 1 已采纳完全可以，只要你的损失函数是tensor定义的。
Pyotorch自定义损失函数
2022-03-03 15:52

GoAI的博客本篇包含深度学习损失函数总结及如何使用Pytorch自定义损失函数（Loss Function）,使用torch.Tensor提供的接口实现：继承nn.Module类在__init__函数中定义所需要的超参数，在foward函数中定义loss的计算方法。
C语言编程，用自定义函数 c语言
2021-11-24 10:55

回答 2 已采纳 /*编写一个程序，求s=1+(1+2)+(1+2+3)+....+(1+2+3+....+n)的值*/ #include <stdio.h> int main() { int i
hash_map 自定义hash函数出错
2018-03-18 08:55

回答 2 已采纳参考：https://www.linuxidc.com/Linux/2012-11/73706.htm
thinkphp自定义函数的使用 html5 php
2015-08-19 08:54

回答 3 已采纳 html ``` {$money}元 ``` php ``` $i = $deal['money'];//1300 $j = 300; $this->money =
Tensorflow 2.x(keras)源码详解之第十一章：keras损失函数及自定义损失函数
2022-05-23 17:47

爱编程的喵喵的博客本文主要介绍了Tensorflow 2.x(keras)源码详解之第十一章：keras损失函数及自定义损失函数，希望能对学习TensorFlow 2的同学有所帮助。文章目录 1. API使用(初印象) 1.1 损失函数源码解析 2. 自定义损失函数 ...
自定义的函数调用了但没有执行 c语言有问必答
2021-05-27 08:24

回答 4 已采纳 #include<stdio.h> char *p,*a; void stract(char *x,char *y){ int i,j; i=j=0;
jupyter notebook 怎么样导入自定义函数 python
2021-12-31 17:23

回答 1 已采纳现在jupyter里面import os os.getcwd() 看看目录在哪里，然后把你的函数文件放到那个文件夹里，直接import
R语言怎么编写分段函数 r语言有问必答
2021-10-29 17:41

回答 1 已采纳代码这样写即可： f<-function(x){ if (x<2) y=x+1 else if (x>=2 && x<=8) y=3*x
如何求解逻辑函数的损失函数，及代码实现？
2021-11-09 23:06

好好学习的星熊的博客前言：本文使用的损失函数为KL离散构建的损失函数，无公式推导部分；代码部分为自定义函数，非sklearn。逻辑回归KL离散构建的损失函数为：其中m表示样本数量；p_1表示标签为1的概率；y^{(i)}表示第i条样本的...
自定义函数返回所有参数乘积 python
2022-04-26 15:36

回答 1 已采纳 def multi(*args): fact = 1 count = 1 for i in args: if type(i) == type(1) or ty
基于自定义训练函数的BP神经网络回归分析
2023-02-28 15:15

神经网络机器学习智能算法画图绘图的博客 BP神经网络是一种成熟的神经网络，拥有很多训练函数，传递函数，激活函数，但是依然有扩展空间，最近遇到一组数据需要BP神经网络建模，输出层三个神经元，其中一维输出值总是极端偏大，调试参数都不敏感，于是对训练...
梯度下降python编程实现_梯度下降实现案例（含python代码）
2020-12-15 11:53

weixin_39923110的博客基础：损失函数的定义，参考http://blog.csdn.net/l18930738887/article/details/50615029目标：已知学习样本，求解预测函数的系数，希望损失函数取到最小值。一、原理介绍：假设我们已知门店销量为门店数X实际销量Y...
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
创建了问题 4月25日

悬赏问题

¥60 pb数据库修改或者求完整pb库存系统，需为pb自带数据库
¥15 spss统计中二分类变量和有序变量的相关性分析可以用kendall相关分析吗？
¥15 拟通过pc下指令到安卓系统，如果追求响应速度，尽可能无延迟，是不是用安卓模拟器会优于实体的安卓手机？如果是，可以快多少毫秒？
¥20 神经网络Sequential name=sequential, built=False
¥16 Qphython 用xlrd读取excel报错
¥15 单片机学习顺序问题！！
¥15 ikuai客户端多拨vpn，重启总是有个别重拨不上
¥20 关于#anlogic#sdram#的问题，如何解决？(关键词-performance)
¥15 相敏解调 matlab
¥15 求lingo代码和思路

自定义损失函数无梯度

4条回答 默认 最新

问题事件

悬赏问题

4条回答默认最新