怎么把lstm模型换为wdcnnlstm模型

这是用LSTM找预测股票价格的最优模型，怎么在这个代码的基础上使用WD_CNN_LSTM来寻找最优模型


import pandas as pd
def parse_date(date_string):
    return pd.Timestamp(date_string.replace('_', '-'))
df = pd.read_csv('D:/LSTMdata.csv', index_col='Date', parse_dates=True, date_parser=parse_date)
df.sort_index(inplace=True)
def Stock_Price_LSTM_Data_Precesing(df,mem_his_days,pre_days):
    df.dropna(inplace=True)
    df.sort_index(inplace=True)
    df['label']= df['Close'].shift(-pre_days)
    from sklearn.preprocessing import StandardScaler
    scaler = StandardScaler()
    sca_X=scaler.fit_transform(df.iloc[:,:-1])
    
    
    mem_his_days = 10
    
    from collections import deque
    deq = deque(maxlen=mem_his_days)
    
    X = []
    for i in sca_X:
        deq.append(list(i))
        if len(deq)==mem_his_days:
            X.append(list(deq))
    X_lately = X[-pre_days:]
    X = X[:-pre_days]
    y = df['label'].values[mem_his_days-1:-pre_days]
    
    
    import numpy as np
    X = np.array(X)
    y = np.array(y)
    return X,y,X_lately
X,y,X_lately = Stock_Price_LSTM_Data_Precesing(df,5,10)
print(len(X))
print(len(y))
print(len(X_lately))
pre_days = 10
mem_days=[5,10,15]
lstm_layers=[1,2,3]
dense_layers=[1,2,3]
units = [16,32]
# mem_days=[10]
# lstm_layers=[1]
# dense_layers=[1]
# units = [32]

from tensorflow.keras.callbacks import ModelCheckpoint
for the_mem_days in mem_days:
    for the_lstm_layers in lstm_layers:
        for the_dense_layers in dense_layers:
            for the_units in units:
                filepath=filepath=f"./theLSTMbestmodel1/{{val_mape:.2f}}{{epoch:02d}}men{the_mem_days}lstm{the_lstm_layers}dense{the_dense_layers}unit{the_units}.keras"
                checkpoint = ModelCheckpoint(
                    filepath=filepath,
                    save_weights_only=False,
                    monitor='val_mape',
                    mode='min',
                    save_best_only=True)
                X,y,X_lately = Stock_Price_LSTM_Data_Precesing(df,the_mem_days,pre_days)
                from sklearn.model_selection import train_test_split
                X_train,X_test,y_train,y_test = train_test_split(X,y,shuffle=False,test_size=0.1)
                import tensorflow as tf
                from tensorflow.keras.models import Sequential
                from tensorflow.keras.layers import LSTM,Dense,Dropout
                model = Sequential()
                model.add(LSTM(the_units,input_shape=X.shape[1:],activation='relu',return_sequences=True))
                model.add(Dropout(0.1))
                for i in range(the_lstm_layers):
                    model.add(LSTM(the_units,activation='relu',return_sequences=True))
                    model.add(Dropout(0.1))
                
                model.add(LSTM(the_units,activation='relu'))
                model.add(Dropout(0.1))
                for i in range(the_dense_layers):
                    model.add(Dense(the_units,activation='relu'))
                    model.add(Dropout(0.1))
                
                model.add(Dense(1))
                model.compile(optimizer='adam',
                             loss='mse',
                             metrics=['mape'])
                model.fit(X_train,y_train,batch_size=32,epochs=50,validation_data=(X_test,y_test),callbacks=[checkpoint])

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除
收藏举报

3条回答默认最新

专家-赤兔[在线] 优质创作者: 编程框架技术领域 2024-05-06 22:54

关注

引自免费微信小程序：皆我百晓生

要将LSTM模型替换为WD_CNN_LSTM（卷积门控循环单元）模型，首先我们需要了解WD_CNN_LSTM的基本结构。WD_CNN_LSTM结合了卷积神经网络（CNN）的局部感知能力和长短期记忆（LSTM）在网络中的时间序列处理优势。在WD_CNN_LSTM中，通常会先使用一个卷积层来捕获输入数据的局部特征，然后这些特征被传递给LSTM层进行时序建模。

下面是修改后的代码，将原始LSTM模型替换为WD_CNN_LSTM模型：

import pandas as pd

def parse_date(date_string):
    return pd.Timestamp(date_string.replace('_', '-'))

df = pd.read_csv('D:/LSTMdata.csv', index_col='Date', parse_dates=True, date_parser=parse_date)
df.sort_index(inplace=True)

def Stock_Price_LSTM_Data_Precesing(df, mem_his_days, pre_days):
    df.dropna(inplace=True)
    df.sort_index(inplace=True)
    df['label'] = df['Close'].shift(-pre_days)
    from sklearn.preprocessing import StandardScaler
    scaler = StandardScaler()
    sca_X = scaler.fit_transform(df.iloc[:, :-1])

    mem_his_days = 10

    from collections import deque
    deq = deque(maxlen=mem_his_days)

    X = []
    for i in sca_X:
        deq.append(list(i))
        if len(deq) == mem_his_days:
            X.append(list(deq))
    X_lately = X[-pre_days:]
    X = X[:-pre_days]
    y = df['label'].values[mem_his_days - 1:-pre_days]

    import numpy as np
    X = np.array(X)
    y = np.array(y)
    return X, y, X_lately

X, y, X_lately = Stock_Price_LSTM_Data_Precesing(df, 5, 10)
print(len(X))
print(len(y))
print(len(X_lately))

pre_days = 10
mem_days = [5, 10, 15]
wd_cnn_lstm_layers = [1, 2, 3]
dense_layers = [1, 2, 3]
units = [16, 32]

# mem_days=[10]
# wd_cnn_lstm_layers=[1]
# dense_layers=[1]
# units = [32]

from tensorflow.keras.callbacks import ModelCheckpoint
for the_mem_days in mem_days:
    for the_wd_cnn_lstm_layers in wd_cnn_lstm_layers:
        for the_dense_layers in dense_layers:
            for the_units in units:
                filepath = f"./theLSTMbestmodel1/{{val_mape:.2f}}{{epoch:02d}}men{the_mem_days}wdclstm{the_wd_cnn_lstm_layers}dense{the_dense_layers}unit{the_units}.keras"
                checkpoint = ModelCheckpoint(
                    filepath=filepath,
                    save_weights_only=False,
                    monitor='val_mape',
                    mode='min',
                    save_best_only=True)
                X, y, X_lately = Stock_Price_LSTM_Data_Precesing(df, the_mem_days, pre_days)
                from sklearn.model_selection import train_test_split
                X_train, X_test, y_train, y_test = train_test_split(X, y, shuffle=False, test_size=0.1)
                import tensorflow as tf
                from tensorflow.keras.models import Sequential
                from tensorflow.keras.layers import Conv1D, LSTM, Dense, Dropout

                # 使用卷积层作为输入层
                model = Sequential()
                model.add(Conv1D(filters=the_units, kernel_size=3, activation='relu', input_shape=X.shape[1:]))
                model.add(MaxPooling1D(pool_size=2))
                model.add(Flatten())

                # 添加WD_CNN_LSTM层
                for i in range(the_wd_cnn_lstm_layers):
                    model.add(WDCLSTM(units=the_units, return_sequences=True))  # 假设WDCLSTM是定义好的WD_CNN_LSTM层
                    model.add(Dropout(0.1))

                model.add(WDCLSTM(units=the_units))
                model.add(Dropout(0.1))

                # 添加全连接层
                for i in range(the_dense_layers):
                    model.add(Dense(units=the_units, activation='relu'))
                    model.add(Dropout(0.1))

                model.add(Dense(1))
                model.compile(optimizer='adam',
                              loss='mse',
                              metrics=['mape'])

                model.fit(X_train, y_train, batch_size=32, epochs=50, validation_data=(X_test, y_test), callbacks=[checkpoint])

# 注意：这里我们假设了一个名为WDCLSTM的层，你需要自己实现这个WD_CNN_LSTM层。
# 可以参考Keras官方文档或其他库实现类似功能的层，例如tf.keras.layers.CuDNNGRU等。
# 或者使用预训练的模块，如TF-Hub的模块。

在这个代码中，我做了以下更改：

将LSTM层替换为WDCLSTM层，这是一个假设的层，你需要根据实际情况实现它。
在输入层添加了一个卷积层（Conv1D），用于提取局部特征。
修改了文件路径中的模型名称，以便区分WD_CNN_LSTM模型。

记得在运行这段代码之前，确保你已经实现了WDCLSTM层或者找到了可用的预训练模块。此外，由于WD_CNN_LSTM可能不是标准的Keras层，因此你可能需要自行搜索或构建这样的层。如果找不到现成的实现，可以尝试将一个卷积层与LSTM层相结合，创建一个自定义的混合层来模拟WD_CNN_LSTM的功能。

本回答被题主选为最佳回答 , 对您是否有帮助呢?

编辑记录

查看更多回答(2条)

报告相同问题？

关注问题

用于预测和预报的Python和MATLAB RNN-LSTM模型 RNN和LSTM模型在Python和MATLAB中编程用于温度
2024-06-28 11:14

用于温度预测的Python和MATLAB RNN-LSTM模型这项工作使用Python和MATLAB实现了RNN和LSTM模型，用于温度预测，包括设置、数据预处理、模型训练和使用MAE和RMSE等指标进行评估。它采用时间序列分析和统计评估技术，...
基于Matlab的LSTM模型时间序列多步预测——多对单
2022-05-12 21:57

LSTM在时间序列预测方面的应用非常广，但有相当一部分没有考虑使用多长的数据预测下一个，类似AR模型中的阶数P。我基于matlab2021版编写了用LSTM模型实现多步预测时间序列的程序代码，可以自己调整使用的数据“阶数...
LSTM模型学习
2018-12-06 21:23

基于python3.6实现的，Keras相关资源：LSTM预测模型训练，IMDB数据加载，国际旅行人数预测，IMDB影评分类预测，数据标准化，模型保存到本地，从本地加载训练好的模型，plt图形绘制，以及IMDB数据和国际旅行人数...
LSTM模型全面解析
2024-11-12 09:47

Hello.Reader的博客 LSTM可以在长时间的序列中捕捉依赖关系，是一种非常适合处理时间序列、自然语言处理、语音识别等任务的深度学习模型。在LSTM中，记忆单元（Cell State）是一个贯穿整个序列的数据通道，用于存储和传递关键信息。记忆...
15.时间序列预测（LSTM模型）python代码实现
2019-05-08 17:10

在这个项目中，我们将探讨如何使用Python编程语言和LSTM模型来解决此类问题。首先，我们需要理解时间序列数据的特点。时间序列数据是按照时间顺序排列的数据，每个观测值都与特定的时间戳相关联。在时间序列预测中...
LSTM模型
2024-08-07 15:23

Ice-cream-AI的博客 LSTM（长短期记忆）是一种用于处理和预测时间序列数据的递归神经网络（RNN）架构旨在解决传统RNN在处理长序列数据时存在的梯度消失和梯度爆炸问题。LSTM的关键在于其特殊的单元结构，每个单元包含三个门：输入门、...
Python中利用LSTM模型进行时间序列预测分析的实现
2020-09-18 23:42

在Python中，时间序列预测分析是一项重要的任务，尤其在...总的来说，Python中的LSTM模型为时间序列预测提供了一种强大且灵活的工具。通过理解和掌握LSTM的工作原理及实现方法，可以有效地解决各种时间序列预测问题。
预训练语言模型（三）：RNN和LSTM
2022-01-30 08:37

Dream_Poem的博客预训练语言模型的前世今生 - 从Word Embedding到BERT 这儿对预训练模型又有了一点理解，也是之前在做VGG实验时在困惑的点，预训练模型在使用时可以有两种做法：一种是Frozen，将参数锁住，在下游应用时不再改变；另...
espnet中的transformer和LSTM语言模型对比实验
2022-03-30 11:52

华为云开发者联盟的博客本文以aishell为例，通过对比实验为大家介绍transformer和LSTM语言模型。
时间序列预测——LSTM模型（附代码实现）
2022-05-14 17:45

噜噜啦啦咯的博客长短时记忆网络（ Long short-term memory，LSTM ）是一种循环神经网络 (Recurrent neural network, RNN)的特殊变体。
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
系统已结题 5月14日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
已采纳回答 5月6日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
创建了问题 5月6日

怎么把lstm模型换为wdcnnlstm模型

3条回答 默认 最新

问题事件

3条回答默认最新