- 读取数据
2、处理数据import csv f=open("C:\ProgramData\Anaconda3\Scripts\white_wine.csv","r") reader=csv.reader(f) data=[ ] for row in reader: data.append(row) for i in range(5): print(data[i]) f.close
1)查看白葡萄酒总共分为几种品质等级
2)统计每种等级的数量content_dict={} for row in data[1:]: quality=int(row[-1]) if quality not in content_dict.keys(): content_dict[quality]=[row] else: content_dict[quality].append(row) for key in content_dict: print(key,":",len(content_dict[key]))
3)计算每个数据集中fixed acidity的均值content_dict={} for row in data[1:]: quality=int(row[-1]) if quality not in content_dict.keys(): content_dict[quality]=[row] else: content_dict[quality].append(row) for key in content_dict: print(key,":",len(content_dict[key]))
mean_list=[]
for key,value in content_dict.items():
sum=0
for row in value:
sum+=float(row[0])
mean_list.append((key,sum/len(value)))
for item in mean_list:
print(item[0],":",item[1])
4、思考题
计算每个数据集中fixed acidity低于均值的数目。