fashion_mnist识别准确率问题

fashion_mnist识别准确率一般为多少呢？我看好多人都是92%左右，但是我用一个网络达到了94%，想问问做过的小伙伴到底是多少？

#这是我的结果示意
x_shape: (60000, 28, 28)
y_shape: (60000,)
epoches:  0 val_acc:  0.4991 train_acc 0.50481665
epoches:  1 val_acc:  0.6765 train_acc 0.66735
epoches:  2 val_acc:  0.755 train_acc 0.7474
epoches:  3 val_acc:  0.7846 train_acc 0.77915
epoches:  4 val_acc:  0.798 train_acc 0.7936
epoches:  5 val_acc:  0.8082 train_acc 0.80365
epoches:  6 val_acc:  0.8146 train_acc 0.8107
epoches:  7 val_acc:  0.8872 train_acc 0.8872333
epoches:  8 val_acc:  0.896 train_acc 0.89348334
epoches:  9 val_acc:  0.9007 train_acc 0.8986
epoches:  10 val_acc:  0.9055 train_acc 0.90243334
epoches:  11 val_acc:  0.909 train_acc 0.9058833
epoches:  12 val_acc:  0.9112 train_acc 0.90868336
epoches:  13 val_acc:  0.9126 train_acc 0.91108334
epoches:  14 val_acc:  0.9151 train_acc 0.9139
epoches:  15 val_acc:  0.9172 train_acc 0.91595
epoches:  16 val_acc:  0.9191 train_acc 0.91798335
epoches:  17 val_acc:  0.9204 train_acc 0.91975
epoches:  18 val_acc:  0.9217 train_acc 0.9220333
epoches:  19 val_acc:  0.9252 train_acc 0.9234667
epoches:  20 val_acc:  0.9259 train_acc 0.92515
epoches:  21 val_acc:  0.9281 train_acc 0.9266667
epoches:  22 val_acc:  0.9289 train_acc 0.92826664
epoches:  23 val_acc:  0.9301 train_acc 0.93005
epoches:  24 val_acc:  0.9315 train_acc 0.93126667
epoches:  25 val_acc:  0.9322 train_acc 0.9328
epoches:  26 val_acc:  0.9331 train_acc 0.9339667
epoches:  27 val_acc:  0.9342 train_acc 0.93523335
epoches:  28 val_acc:  0.9353 train_acc 0.93665
epoches:  29 val_acc:  0.9365 train_acc 0.9379333
epoches:  30 val_acc:  0.9369 train_acc 0.93885
epoches:  31 val_acc:  0.9387 train_acc 0.9399
epoches:  32 val_acc:  0.9395 train_acc 0.9409
epoches:  33 val_acc:  0.94 train_acc 0.9417667
epoches:  34 val_acc:  0.9403 train_acc 0.94271666
epoches:  35 val_acc:  0.9409 train_acc 0.9435167
epoches:  36 val_acc:  0.9418 train_acc 0.94443333
epoches:  37 val_acc:  0.942 train_acc 0.94515
epoches:  38 val_acc:  0.9432 train_acc 0.9460667
epoches:  39 val_acc:  0.9443 train_acc 0.9468833
epoches:  40 val_acc:  0.9445 train_acc 0.94741666
epoches:  41 val_acc:  0.9462 train_acc 0.9482
epoches:  42 val_acc:  0.947 train_acc 0.94893336
epoches:  43 val_acc:  0.9472 train_acc 0.94946665
epoches:  44 val_acc:  0.948 train_acc 0.95028335
epoches:  45 val_acc:  0.9486 train_acc 0.95095
epoches:  46 val_acc:  0.9488 train_acc 0.9515833
epoches:  47 val_acc:  0.9492 train_acc 0.95213336
epoches:  48 val_acc:  0.9495 train_acc 0.9529833
epoches:  49 val_acc:  0.9498 train_acc 0.9537
val_acc:  0.9498

import tensorflow as tf
from tensorflow import keras
import numpy as np
import matplotlib.pyplot as plt

def to_onehot(y,num):
    lables = np.zeros([num,len(y)])
    for i in range(len(y)):
        lables[y[i],i] = 1
    return lables.T

# 预处理数据
mnist = keras.datasets.fashion_mnist
(train_images,train_lables),(test_images,test_lables) = mnist.load_data()

print('x_shape:',train_images.shape)
#(60000)
print('y_shape:',train_lables.shape)

X_train = train_images.reshape((-1,train_images.shape[1]*train_images.shape[1])) / 255.0
#X_train = tf.reshape(X_train,[-1,X_train.shape[1]*X_train.shape[2]])
Y_train = to_onehot(train_lables,10)
X_test = test_images.reshape((-1,test_images.shape[1]*test_images.shape[1])) / 255.0
Y_test = to_onehot(test_lables,10)

#双隐层的神经网络
input_nodes = 784
output_nodes = 10
layer1_nodes = 100
layer2_nodes = 50
batch_size = 100
learning_rate_base = 0.8
learning_rate_decay = 0.99
regularization_rate = 0.0000001
epochs = 50
mad = 0.99
learning_rate  = 0.005

# def inference(input_tensor,avg_class,w1,b1,w2,b2):
#     if avg_class == None:
#         layer1 = tf.nn.relu(tf.matmul(input_tensor,w1)+b1)
#         return tf.nn.softmax(tf.matmul(layer1,w2) + b2)
#     else:
#         layer1 = tf.nn.relu(tf.matmul(input_tensor,avg_class.average(w1)) + avg_class.average(b1))
#         return  tf.matual(layer1,avg_class.average(w2)) + avg_class.average(b2)

def train(mnist):
    X = tf.placeholder(tf.float32,[None,input_nodes],name = "input_x")
    Y = tf.placeholder(tf.float32,[None,output_nodes],name = "y_true")
    w1 = tf.Variable(tf.truncated_normal([input_nodes,layer1_nodes],stddev=0.1))
    b1 = tf.Variable(tf.constant(0.1,shape=[layer1_nodes]))
    w2 = tf.Variable(tf.truncated_normal([layer1_nodes,layer2_nodes],stddev=0.1))
    b2 = tf.Variable(tf.constant(0.1,shape=[layer2_nodes]))
    w3 = tf.Variable(tf.truncated_normal([layer2_nodes,output_nodes],stddev=0.1))
    b3 = tf.Variable(tf.constant(0.1,shape=[output_nodes]))

    layer1 = tf.nn.relu(tf.matmul(X,w1)+b1)
    A2 = tf.nn.relu(tf.matmul(layer1,w2)+b2)
    A3 = tf.nn.relu(tf.matmul(A2,w3)+b3)

    y_hat = tf.nn.softmax(A3)
#     y_hat = inference(X,None,w1,b1,w2,b2)

#     global_step = tf.Variable(0,trainable=False)
#     variable_averages = tf.train.ExponentialMovingAverage(mad,global_step)
#     varible_average_op = variable_averages.apply(tf.trainable_variables())

    #y = inference(x,variable_averages,w1,b1,w2,b2)
    cross_entropy = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits_v2(logits=A3,labels=Y))
    regularizer = tf.contrib.layers.l2_regularizer(regularization_rate)

    regularization = regularizer(w1) + regularizer(w2) +regularizer(w3)
    loss = cross_entropy + regularization * regularization_rate

#     learning_rate = tf.train.exponential_decay(learning_rate_base,global_step,epchos,learning_rate_decay)

#     train_step = tf.train.GradientDescentOptimizer(learning_rate).minimize(loss,global_step=global_step)
    train_step = tf.train.GradientDescentOptimizer(learning_rate).minimize(loss)


#     with tf.control_dependencies([train_step,varible_average_op]):
#         train_op = tf.no_op(name="train")


    correct_prediction = tf.equal(tf.argmax(y_hat,1),tf.argmax(Y,1))
    accuracy = tf.reduce_mean(tf.cast(correct_prediction,tf.float32))
    total_loss = []
    val_acc = []
    total_train_acc = []
    x_Xsis = []

    with tf.Session() as sess:
        tf.global_variables_initializer().run()

        for i in range(epochs):
#             x,y = next_batch(X_train,Y_train,batch_size)
            batchs = int(X_train.shape[0] / batch_size + 1)
            loss_e = 0.
            for j in range(batchs):

                batch_x = X_train[j*batch_size:min(X_train.shape[0],j*(batch_size+1)),:]
                batch_y = Y_train[j*batch_size:min(X_train.shape[0],j*(batch_size+1)),:]
                sess.run(train_step,feed_dict={X:batch_x,Y:batch_y})
                loss_e += sess.run(loss,feed_dict={X:batch_x,Y:batch_y})
#             train_step.run(feed_dict={X:x,Y:y})
            validate_acc = sess.run(accuracy,feed_dict={X:X_test,Y:Y_test})
            train_acc = sess.run(accuracy,feed_dict={X:X_train,Y:Y_train})
            print("epoches: ",i,"val_acc: ",validate_acc,"train_acc",train_acc) 
            total_loss.append(loss_e / batch_size)
            val_acc.append(validate_acc)
            total_train_acc.append(train_acc)
            x_Xsis.append(i)
        validate_acc = sess.run(accuracy,feed_dict={X:X_test,Y:Y_test})
        print("val_acc: ",validate_acc)
    return (x_Xsis,total_loss,total_train_acc,val_acc)

result = train((X_train,Y_train,X_test,Y_test))

def plot_acc(total_train_acc,val_acc,x):
    plt.figure()
    plt.plot(x,total_train_acc,'--',color = "red",label="train_acc")
    plt.plot(x,val_acc,color="green",label="val_acc")
    plt.xlabel("Epoches")
    plt.ylabel("acc")
    plt.legend()
    plt.show()

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
zqbnqsdsmd 2019-11-05 23:17
关注
https://blog.csdn.net/MANX98/article/details/102529145

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

load_mnist(flatten=True, normalize=False)报错 python 深度学习神经网络
2021-03-22 22:08

回答 1 已采纳解决了，原来导入数据模块里有flatten和normalize函数的界定，load（）里不能直接用。
如何自己做一个类似Fashion-MNIST的数据集 python 深度学习神经网络
2019-09-03 16:43

回答 1 已采纳 https://blog.csdn.net/sdoddyjm68/article/details/78430209
做mnist识别时出现错误 AttributeError: module 'keras.api._v2.keras' has no attribute 'train' keras tensorflow 神经网络
2022-09-19 20:31

回答 1 已采纳现在用的是tf2吧，那应该要这样子写tf.keras.optimizers.Adam
Fashion_mnist数据集介绍
2019-11-24 14:42

静待花开s0的博客 Fashion-MNIST ](https://hanxiao.github.io/2018/09/28/Fashion-MNIST-Year-In-Review/) 目录为什么要做这个数据集？获取数据如何载入数据？基准测试数据可视化参与贡献联系在论文中引用...
关于mnist识别的问题：ValueError: Data cardinality is ambiguous 深度学习
2022-04-14 16:03

回答 1 已采纳数据的长度不一致，需要reshape一下，数据集修改过吗？
提升knn算法的准确率 python 人工智能机器学习
2022-09-30 18:14

回答 5 已采纳首先，手写识别的关键是特征描述，如果这一步没有做好，用什么方法，怎么调参，也不会有好的结果。将图像像素值直接作为输入向量，原则上是不适当的。推荐实现方法如下：（1）首先，样本均匀，标准化，归一化，这些
tensorflow CNN训练mnist数据集后识别自己写的数字效果不好 cnn tensorflow 神经网络
2018-04-15 16:32

回答 5 已采纳 MNIST数据集与你自己采集的图像，实际上是两个不同的数据集，你在MNIST上训练，然后在你的数据集上测试，测试性能不好是十分正常的。这实际上涉及在两个相似但是不同的域之间的迁移学习的问题。有三个办法
【人工智能项目】Fashion Mnist识别实验
2021-11-02 15:21

mind_programmonkey的博客【人工智能项目】Fashion Mnist识别实验本次主要通过四个方法对fashion mnist进行识别实验，主要为词袋模型、hog特征、mlp多层感知器和cnn卷积神经网络。那么话不多说，走起来瓷！！！ Fashion Mnist Fashion ...
关于使用tensoeflow2.0加载mnist数据集的问题 pycharm python tensorflow 有问必答
2021-08-11 17:48

回答 2 已采纳你这不就是pycharm没有导包成功吗，这些包我安装过很多遍去我博客看，我都有总结，对你有帮助的话采纳顺手点个赞
用Python实现MNIST字体识别语法错误 python tensorflow 人工智能深度学习神经网络
2020-05-06 13:20

回答 2 已采纳 print (sess.run(accuracy, feed_dict={x: mnist.test.images, y_: mnist.test.labels})) 加上括号看看
MNIST手写图片分类问题关于数据转换的问题 python sklearn 机器学习
2023-03-06 21:23

回答 3 已采纳 X[:,0]
神经网络衣服分类器详解（Fashion-MNIST数据集）
2021-01-30 15:31

仓鼠球球－O－的博客文章目录前言一、Fashion-MNIST是什么？二、代码实现1.引入库2.读取数据集3.数据预处理4.搭建神经网络5.编译和训练神经网络模型6.神经网络预测总结前言每个想要学习深度学习、图像识别的同学，想要用到神经网络...
FashionMNIST预加载的数据集为啥只有轮廓 python 深度学习
2023-03-02 09:40

回答 1 已采纳 FashionMNIST 是一个经典的图像分类数据集，它包含了一系列服装类别的灰度图像。这些图像的尺寸为 $28\times28$ 像素，每个像素的灰度值在 $0$ 到 $255$ 之间。当你使用预
基于卷积神经网络的Fashion-MNIST图像识别
2023-01-09 10:38

我是小石呀的博客基于卷积神经网络的Fashion-MNIST图像识别，通常指的是使用卷积神经网络来对Fashion-MNIST数据集中的图像进行分类。在这种情况下，我们需要训练一个卷积神经网络模型，让它能够根据图像的特征来预测图像所属的类别。
卷积神经网络（LeNet5实现对Fashion_MNIST分类
2024-04-14 12:37

全是头发的羊羊羊的博客在实验过程中，我逐步优化了模型架构，并对比了不同模型的性能表现，以达到更好的分类准确率。首先，Fashion_MNIST数据集包含10个类别的衣物和配饰图像，每个类别包含7000张28x28像素的灰度图像。我采用了LeNet-5...
没有解决我的问题, 去提问

悬赏问题

¥15 如何在scanpy上做差异基因和通路富集？
¥20 关于#硬件工程#的问题，请各位专家解答！
¥15 关于#matlab#的问题：期望的系统闭环传递函数为G(s)=wn^2/s^2+2¢wn+wn^2阻尼系数¢=0.707，使系统具有较小的超调量
¥15 FLUENT如何实现在堆积颗粒的上表面加载高斯热源
¥30 截图中的mathematics程序转换成matlab
¥15 动力学代码报错，维度不匹配
¥15 Power query添加列问题
¥50 Kubernetes&Fission&Eleasticsearch
¥15 報錯：Person is not mapped，如何解決？
¥15 c++头文件不能识别CDialog

fashion_mnist识别准确率问题

1条回答 默认 最新

悬赏问题

1条回答默认最新