熬了5个小时了,对于小白来讲太蒙了,有没有大神帮帮我。
import pandas as pd
import numpy as np
import statsmodels.api as sm
import pylab as pl
df = pd.read_csv(r'C:\Users\Administrator\Desktop\Application.csv')
print(df.head())
df.columns = ['admit','gre','gpa','sch_rank']
print(df.columns)
df.describe()
dummy_ranks = pd.get_dummies(df['sch_rank'],prefix = 'sch_rank')
print(dummy_ranks.head())
cols_to_keep = ['admit','gre','gpa']
data = df[cols_to_keep].join(dummy_ranks.loc[:, : 'sch_rank_3'])
print(data.head())
data['intercept'] = 1.0
print(data.head())
train_cols = data.columns[1:]
print(train_cols)
logit = sm.Logit(data['admit'],data[train_cols])
result = logit.fit()
我是按照一篇教程做的,但是我这怎么就过不了啊。教程地址放下边。