寻找会深度学习，图像生成方面的大神（DCGAN）

https://github.com/eriklindernoren/Keras-GAN/blob/master/dcgan/dcgan.py

我想用github上的DCGAN的opensource来训练我自己的dataset，请问如何导入自己的dataset

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

3条回答默认最新

ProfSnail 2021-03-04 19:44

关注

我把你的链接中的代码下载下来了，运行了一遍，是可以用的代码。

dcgan.py在第109行用到了mnist.load_data()这个函数，读取的是自带的mnist.npz数据集。我看到mnist.load_data()函数的原文是这样的：

# Copyright 2015 The TensorFlow Authors. All Rights Reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#     http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
# ==============================================================================
"""MNIST handwritten digits dataset.
"""
from __future__ import absolute_import
from __future__ import division
from __future__ import print_function

import numpy as np

from tensorflow.python.keras.utils.data_utils import get_file
from tensorflow.python.util.tf_export import keras_export


@keras_export('keras.datasets.mnist.load_data')
def load_data(path='mnist.npz'):
  """Loads the [MNIST dataset](http://yann.lecun.com/exdb/mnist/).

  This is a dataset of 60,000 28x28 grayscale images of the 10 digits,
  along with a test set of 10,000 images.
  More info can be found at the
  [MNIST homepage](http://yann.lecun.com/exdb/mnist/).


  Arguments:
      path: path where to cache the dataset locally
          (relative to `~/.keras/datasets`).

  Returns:
      Tuple of Numpy arrays: `(x_train, y_train), (x_test, y_test)`.

      **x_train, x_test**: uint8 arrays of grayscale image data with shapes
        (num_samples, 28, 28).

      **y_train, y_test**: uint8 arrays of digit labels (integers in range 0-9)
        with shapes (num_samples,).

  License:
      Yann LeCun and Corinna Cortes hold the copyright of MNIST dataset,
      which is a derivative work from original NIST datasets.
      MNIST dataset is made available under the terms of the
      [Creative Commons Attribution-Share Alike 3.0 license.](
      https://creativecommons.org/licenses/by-sa/3.0/)
  """
  origin_folder = 'https://storage.googleapis.com/tensorflow/tf-keras-datasets/'
  path = get_file(
      path,
      origin=origin_folder + 'mnist.npz',
      file_hash=
      '731c5ac602752760c8e48fbffcf8c3b850d9dc2a2aedcf2cc48468fc17b673d1')
  with np.load(path, allow_pickle=True) as f:
    x_train, y_train = f['x_train'], f['y_train']
    x_test, y_test = f['x_test'], f['y_test']

    return (x_train, y_train), (x_test, y_test)

可以根据该函数仿写一个读取数据的函数。经过查验，mnist.npz里面的样本是28*28的，需要缩放到28*28的样本。

最后的函数是这样的：

import numpy as np
import cv2
import cv
import os
import random

def get_image(image_index, path=r'C:\Coding\Python\CSDN\Image\bibimbap', img_predix="hed"):
	# 扩充前导0
	'''
	image_index 是数字，从0到999
	path是数据集的绝对路径。也可以换成相对路径。
	img_predix是数据集的前缀。
	'''
	image_index = "%04d" % image_index
	image_path = os.path.join(path, img_predix+image_index+'.png')
	# 转为灰度图
	img = cv2.imread(image_path, cv2.IMREAD_GRAYSCALE)
	img = Reduce(img)
	return img

def Reduce(image):
    shrink = cv2.resize(image, (28,28), interpolation=cv2.INTER_AREA)  
    return shrink

def load_data_mydatabase(path=r'C:\Coding\Python\CSDN\Image\bibimbap'):
	# 由于mnist.npz中的数据集是单色值，28*28像素的数据。
	# 因此需要选择预训练之后的bibimbap下head0000.png作为训练集和标签集合
	# 由于只有1000张图，可以采用前900张作为训练集，最后100张作为数据集。
	x_train = []
	x_test = []
	y_train = []
	y_test = []
	# 这个烤冷面我不太清楚你希望做成哪些类别，所以这里随机生成十个类别。
	for train_index in range(900):
		x_train.append(get_image(train_index))
		y_train.append(int(random.uniform(0,10)))
	x_train = np.array(x_train)
	for test_index in range(900, 1000):
		x_test.append(get_image(test_index))
		y_test.append(int(random.uniform(0,10)))
	x_test = np.array(x_test)
	y_test = np.array(y_test)
	return (x_train,y_train), (x_test,y_test)

因为我记得你之前是要做冷面的数据集，我还下载下来了一份。但是不清楚你这是要几分类。所以我随机生成了一个十分类的标签值，题主根据自己需要生成新的标签值。

使用的时候，将这片代码放到原代码中。并将dcgan.py中第109行换成

(X_train, _), (_, _) = load_data_mydatabase()

即可。

本回答被题主选为最佳回答 , 对您是否有帮助呢?

查看更多回答(2条)

报告相同问题？

关注问题

深度学习核心技术精讲100篇（十二）-DCGAN(对抗生成网络）算法应用及代码实现
2020-09-21 09:12

文宇肃然的博客原来背后有一个极为有意思的算法思想——对抗生成。今天笔者斗胆来介绍一下在学术界大名鼎鼎的GAN(Generative Adversarial Networks ),此网络结构由Ian J. Goodfellow大神在2014年提出，一经推出，就引爆了学术界。 ...
深度卷积生成对抗网络
2022-02-06 16:51

皇儒无上的博客理解与学习深度卷积生成对抗网络一.GAN 引言：生成对抗网络GAN，是当今的一大热门研究方向。在2014年，被Goodfellow大神提出来，当时的G神还是蒙特利尔大学的博士生。据有关媒体统计：CVPR2018的论文里，有三分之一...
深度学习论文精读【持续更新中】
2024-07-24 23:27

Donvink的博客包括已经精读完成和之后将要精读的论文，10年内深度学习里有影响力文章（必读文章），或者近期比较有意思的文章。总论文数 96，阅读完成数 79，博文完成数1。
superior哥深度学习系列（大纲）
2025-05-30 18:49

superior tigre的博客 ️ 完整知识图谱梳理不同应用场景的技术选型持续学习方法论推荐学习资源学习历程回顾成果展示指南 AI未来发展趋势个人发展建议。
Stable Diffusion：一种新型的深度学习AIGC模型！
2024-09-02 10:28

网络安全入门学习教程的博客随着生成型AI技术的能力提升，越来越多的注意力放在了通过AI模型提升研发效率上。业内比较火的AI模型有很多，比如画图神器Midjourney、用途多样的Stable Diffusion，以及OpenAI此前刚刚迭代的DALL-E 2。对于研发团队...
深度学习领域，你心目中 idea 最惊艳的论文是哪篇？
2024-10-19 20:29

AI大模型-王哥的博客：生成领域的新贵，比如OpenAI的DALL·E 2和Google的Imagen，引领文本生成图像领域的新风向，效果令人惊艳，甚至引发了AI绘画与画师之争！，当时还在搞物理，买了数学之美看着玩儿，被这个经典算法狠狠的惊艳到了，...
万字详解什么是生成对抗网络GAN
2021-12-09 15:26

华为云开发者联盟的博客摘要：这篇文章将详细介绍生成对抗网络GAN的基础知识，包括什么是GAN、常用算法（CGAN、DCGAN、infoGAN、WGAN）、发展历程、预备知识，并通过Keras搭建最简答的手写数字图片生成案。
【菜鸟窝免费视频】如何生成数字、人脸和二次元美少女头像（AI生成二次元头像）
2019-07-29 14:17

Bella人工智能爱好者的博客本视频课程是由菜鸟窝研发的，免费提供给AI人工智能爱好者学习的。具体视频、代码可以找菜鸟窝助教Bella（weixin：BT474849）无套路领取哦。关于有监督模型、无监督模型是什么？生产模型、生产对抗网络 gan，生成...
生成对抗网络（GAN）的数学原理全解
2020-10-04 18:35

PaperWeekly的博客 ©PaperWeekly 原创 ·作者｜孙裕道学校｜北京邮电大学博士生研究方向｜GAN图像生成、情绪对抗样本生成论文标题：A Mathematical Introduction to ...
深度学习之图像简史
2017-12-04 09:27

weixin_34368949的博客人，是感官的动物。我们的大脑，像一块复杂度极高的CPU，每天在接收着各种格式的数据，进行着无休止的计算...人用这样一双肉眼如何识别不同类别的图像（image classification and pattern recognition），如何在图...
没有解决我的问题, 去提问

码龄粉丝数原力等级 --

寻找会深度学习，图像生成方面的大神（DCGAN）

3条回答默认最新

码龄粉丝数原力等级 --

寻找会深度学习，图像生成方面的大神（DCGAN）

3条回答 默认 最新

3条回答默认最新