bad input shape (60000, 2)

本小白在看机器学习实战时，绘制精度、召回率相对阈值的函数图时报了错。

代码如下：

 from sklearn.datasets import fetch_mldata
import matplotlib
import matplotlib.pyplot as plt
import numpy as np
from sklearn.linear_model import SGDClassifier
from sklearn.model_selection import StratifiedKFold
from sklearn.base import clone
from sklearn.model_selection import cross_val_score
from sklearn.model_selection import cross_val_predict
from sklearn.metrics import confusion_matrix
from sklearn.metrics import precision_score,recall_score
from sklearn.metrics import f1_score
from sklearn.metrics import precision_recall_curve
from sklearn.metrics import roc_curve
from sklearn.metrics import roc_auc_score

#导入部分
mnist = fetch_mldata('MNIST original')
X,y = mnist["data"],mnist["target"]

#显现部分
some_digit = X[36000]
some_digit_image = some_digit.reshape(28,28)
plt.imshow(some_digit_image,cmap=matplotlib.cm.binary,interpolation="nearest")
plt.axis("off")
#plt.show()

#训练集和测试集
X_train,X_test,y_train,y_test=X[:60000],X[60000:],y[:60000],y[60000:]
shuffle_index = np.random.permutation(60000)
X_train,y_train = X_train[shuffle_index],y_train[shuffle_index]

#二分分类器
y_train_5 = (y_train == 5)
y_test_5 = (y_test == 5)

sgd_clf = SGDClassifier(random_state=42)
sgd_clf.fit(X_train,y_train_5)
predict1 = sgd_clf.predict([some_digit])
print(predict1)

#实施交叉验证
skfolds = StratifiedKFold(n_splits=3,random_state=42)
for train_index,test_index in skfolds.split(X_train,y_train_5):
    clone_clf = clone(sgd_clf)
    X_train_folds = X_train[train_index]
    y_train_folds = (y_train_5[train_index])
    X_test_fold = X_train[test_index]
    y_test_fold = (y_train_5[test_index])

    clone_clf.fit(X_train_folds,y_train_folds)
    y_pred = clone_clf.predict(X_test_fold)
    n_correct = sum(y_pred == y_test_fold)
    print(n_correct/len(y_pred))

#kfold方法
print(cross_val_score(sgd_clf,X_train,y_train_5,cv=3,scoring="accuracy"))
y_train_pred = cross_val_predict(sgd_clf,X_train,y_train_5,cv=3)
#print(confusion_matrix(y_train_5,y_train_pred))
#print(precision_score(y_train_5,y_pred))           #精度
#print(recall_score(y_train_5,y_train_pred))        #召回率
#print(f1_score(y_train_5,y_pred))                 #fi分数
y_scores = sgd_clf.decision_function([some_digit])
print(y_scores)
#threshold = 0
#y_some_digit_pred = (y_scores>threshold)
#print(y_some_digit_pred)
#提高阈值
threshold = 200000
y_some_digit_pred = (y_scores>threshold)
print(y_some_digit_pred)
#绘制阈值函数图



y_scores = cross_val_predict(sgd_clf,X_train,y_train_5,cv=3,method="decision_function")
precisions, recalls, thresholds = precision_recall_curve(y_train_5,y_scores)

def plot_precison_recall_vs_threshold(precisions,recalls,thresholds):
    plt.plot(thresholds,precisions[:-1],"b--",label="Precision")
    plt.plot(thresholds, recalls[:-1], "g-", label="Recall")
    plt.xlabel("Threshold")
    plt.legend(loc="upper left")
    plt.ylim([0,1])
plot_precison_recall_vs_threshold(precisions,recalls,thresholds)
plt.show()

报错信息如下：
Traceback (most recent call last):
File "F:/python项目/mnist.py", line 77, in
precisions, recalls, thresholds = precision_recall_curve(y_train_5,y_scores)
File "C:\Users\15701\Anaconda3\lib\site-packages\sklearn\metrics\ranking.py", line 417, in precision_recall_curve
sample_weight=sample_weight)
File "C:\Users\15701\Anaconda3\lib\site-packages\sklearn\metrics\ranking.py", line 304, in _binary_clf_curve
y_score = column_or_1d(y_score)
File "C:\Users\15701\Anaconda3\lib\site-packages\sklearn\utils\validation.py", line 583, in column_or_1d
raise ValueError("bad input shape {0}".format(shape))
ValueError: bad input shape (60000, 2)

不胜感激

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
一叶扁舟。。。 2019-04-15 18:51
关注
print(y_train_5.shape) 结果为(60000,) print(y_scores.shape)结果为(60000, 2)，
print(y_scores)结果为[[ 0. -229600.48544944]
[ 0. -792845.57622101]
[ 0. -529311.13077603]
...,
[ 0. -806955.80116218]
[ 0. -199716.61091746]
[ 0. -499524.22190059]]
解决方案为： y_scores=y_score[:,1]

本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

bad input shape (60000, 2) sklearn 机器学习
2018-11-24 01:21

回答 2 已采纳 print(y_train_5.shape) 结果为(60000,) print(y_scores.shape)结果为(60000, 2)， print(y_scores)结果为[[
宝塔面板pm2重启服务后出现502 Bad Gateway gateway 服务器运维
2023-01-17 19:18

回答 1 已采纳望采纳！！！点击回答右侧采纳即可！！我给些思路吧可以尝试检查一下服务器的配置文件，比如Apache或Nginx的配置文件，看看是否有任何错误或者不正确的配置。另外，可以检查一下服务器上的缓存服务，比如
ssm框架页面报400 bad requst错误 ajax html5 javascript 有问必答
2021-12-21 16:40

回答 2 已采纳 // 明显错误 var username=$("#username"); var password=$("#password"); var repassword=$("#re
【腾讯云 HAI域探秘】——即时职场生存指南小游戏以及【自行搭建Stable Diffusion图片AI绘制 | ChatGLM2-6B AI进行智能对话 | Pytorch2.0 AI框架视频处理】
2023-10-27 22:33

红目香薰的博客【腾讯云 HAI域探秘】——自行搭建Stable Diffusion模型服务用于生成AI图片 | 自行搭建ChatGL M26BAI模型服务用于AI对话自主创建AI对话工具，腾讯云有一套最新的HAI工具，我们一起来探秘吧。
502 bad gateway nginx
2022-04-18 09:34

回答 4 已采纳看上去有可能是程序死循环了，你把进程kill掉，如果可以重新访问，那就是了，检查代码就好了
400 bad request问题 spring boot vue.js
2022-08-05 18:51

回答 3 已采纳 ajax异步请求参数报空吧
cv2.error: OpenCV(4.5.5) :-1: error: (-5:Bad argument) in function 'pointPolygonTest' python
2022-06-07 11:23

回答 2 已采纳函数的第二个入参pt的类型错误，把maskpts[i，0]的值用print()打印出来看看，它们应该是数值类型才对
AI-新手玩转RKNN
2023-05-27 15:34

ansondroider的博客关于RKNN RKNN 是Rockchip npu 平台使用的模型类型，以.rknn后缀结尾的模型文件。... mask, anchors): anchors = [anchors[i] for i in mask] grid_h, grid_w = map(int, input.shape[0:2]) box_confidence ...
Linux中fstat函数报错：bad addreass c语言
2022-08-18 11:15

回答 1 已采纳 statbuf只是个指针，没有分配空间啊修改为： struct stat stabuf; if( fstat(fd, &stabuf) == -1)
PlotlyRequestError: Bad API key python 有问必答
2021-07-21 11:42

回答 1 已采纳 api_key='7408123tjz'，这个key的值是错的。
IOS新手，自定义UIView问题,EXC_BAD_ACCESS code=2 ios
2016-02-16 08:37

回答 2 已采纳好吧，最后自己找到问题了。调试时候在这一行上面加了一行NSLog,结果是NSLog无限输出。然后发现问题是在XIB里面，我把类关联了Class，这样在加载XIB时候将会调用aweakFromNib。而
人工智能与人类智能的相互作用：如何实现高效的协作
2024-01-06 01:04

禅与计算机程序设计艺术的博客 人工智能（Artificial Intelligence, AI）和人类智能（Human Intelligence, HI）之间的相互作用是一个令人兴奋的研究领域。随着计算能力的不断提高和数据量的不断增加，人工智能技术的发展已经取得了显著的进展。...
Yii2 CSRF令牌时间段增加 csrf php
2018-04-02 12:24

回答 1 已采纳 You can use frontend and backend as different cookie and session Cookie Backend 'identityCookie'
MMCVDeformConv2d转TensorRT记录
2023-03-25 22:36

天亮换季的博客 optimization_profile() for input_name, param in input_shapes.items(): min_shape = param['min_shape'] opt_shape = param['opt_shape'] max_shape = param['max_shape'] profile.set_shape(input_name, min_...
PSP - AlphaFold2 适配不同来源搜索的 MSA 接口
2023-05-10 09:52

SpikeKing的博客当使用 AlphaFold2 进行蛋白结构预测时，有时，比较复杂的序列，需要优化 MSA 搜索，再进行预测蛋白结构，即需要将 MSA 与结构推理两个部分解耦。
人工智能安全与光明时代
2024-04-19 08:47

肆壹柒Z的博客 If you control a powerful model that mediates all consumption and production of information,2 and it’s a proprietary secret, you can shape what people believe, how people act — and censor whatever...
没有解决我的问题, 去提问

悬赏问题

¥20 ML307A在使用AT命令连接EMQX平台的MQTT时被拒绝
¥20 腾讯企业邮箱邮件可以恢复么
¥15 有人知道怎么将自己的迁移策略布到edgecloudsim上使用吗？
¥15 错误 LNK2001 无法解析的外部符号
¥50 安装pyaudiokits失败
¥15 计组这些题应该咋做呀
¥60 更换迈创SOL6M4AE卡的时候，驱动要重新装才能使用，怎么解决？
¥15 让node服务器有自动加载文件的功能
¥15 jmeter脚本回放有的是对的有的是错的
¥15 r语言蛋白组学相关问题