李 1 2021-06-09 09:28 采纳率: 0%
浏览 46

请问这样构建数据集给model.fit正确吗?

使用for filename in glob.glob读取txt文件,之后转化成np数组,然后扔给model.fit

中间网络部分省略  请大家帮帮忙  这样使用后model.fit的结果一成不变,如截图。

应该怎么做???

a_test = []
a_test_lable = []
for filename in glob.glob('E:/datasets/demon/target_1/test/*.txt'):
     with open(filename) as f:
        a_test.append(f.read().replace('\n',' ').strip(' ').split())
        a_test_lable.append(1)
        
b_test = []
b_test_lable = []
for filename in glob.glob('E:/datasets/demon/target_2/test/*.txt'):
     with open(filename) as f:
        b_test.append(f.read().replace('\n',' ').strip(' ').split())
        b_test_lable.append(0)

a_train = []
a_train_lable = []
for filename in glob.glob('E:/datasets/demon/target_1/train/*.txt'):
     with open(filename) as f:
        a_train.append(f.read().replace('\n',' ').strip(' ').split())
        a_train_lable.append(1)
        
b_train = []
b_train_lable = []
for filename in glob.glob('E:/datasets/demon/target_2/train/*.txt'):
     with open(filename) as f:
        b_train.append(f.read().replace('\n',' ').strip(' ').split())
        b_train_lable.append(0)

a_train = np.array(a_train, dtype = float)
a_train_lable = np.array(a_train_lable, dtype = int)

b_train = np.array(b_train, dtype = float)
b_train_lable = np.array(b_train_lable, dtype = int)

a_test = np.array(a_test, dtype = float)
a_test_lable = np.array(a_test_lable, dtype = int)

b_test = np.array(b_test, dtype = float)
b_test_lable = np.array(b_test_lable, dtype = int)

a_train = a_train.reshape(9000, 2, 2048, 1)
b_train = b_train.reshape(9000, 2, 2048, 1)
a_test = a_test.reshape(1000, 2, 2048, 1)
b_test = b_test.reshape(1000, 2, 2048, 1)

x_train = np.concatenate((a_train, b_train), axis = 0)
y_train = np.concatenate((a_train_lable, b_train_lable), axis = 0).reshape(18000, 1)
x_test = np.concatenate((a_test, b_test), axis = 0)
y_test = np.concatenate((a_test_lable, b_test_lable), axis = 0).reshape(2000, 1 )

model.fit(x_train, y_train, epochs = 200, verbose = 2, validation_split = 0.15)
  • 写回答

2条回答 默认 最新

  • cbbjiayou 2021-06-09 11:11
    关注

    我个人感觉哈,不知道是不是正确的,你在模型中只对训练集和验证集进行训练,我觉得你先把验证集分好,采用训练集的百分之多少。可以使用sklearn库的test_split函数,希望能解决你的问题,个人理解哈

    评论

报告相同问题?

悬赏问题

  • ¥15 算能的sail库的运用
  • ¥15 'Content-Type': 'application/x-www-form-urlencoded' 请教 这种post请求参数,该如何填写??重点是下面那个冒号啊
  • ¥15 找代写python里的jango设计在线书店
  • ¥15 请教如何关于Msg文件解析
  • ¥200 sqlite3数据库设置用户名和密码
  • ¥15 AutoDL无法使用docker install吗?
  • ¥15 cups交叉编译后移植到tina sdk的t113,只需要实现usb驱动打印机,打印pdf文件
  • ¥30 关于#wireshark#的问题:需要网络应用流量数据集需要做长度序列的实验,需要与应用产生的会话的数据包的长度,如视频类或者聊天类软件
  • ¥15 根据上述描述表示泥浆密度沿着管路的长度方向在不断变化,如何来表示泥浆密度随管路的变化(标签-matlab|关键词-流计算)
  • ¥21 matlab可以把图像数据转换为小波分析吗