三大件实现基本数据分析

最新推荐文章于 2023-06-29 22:04:09 发布

Alanye_

最新推荐文章于 2023-06-29 22:04:09 发布

阅读量250

点赞数

分类专栏：实用零基础文章标签：数据分析 python

本文链接：https://blog.csdn.net/qq_43554763/article/details/106658277

版权

实用同时被 2 个专栏收录

4 篇文章 0 订阅

订阅专栏

零基础

2 篇文章 0 订阅

订阅专栏

背景
我们将建立一个逻辑回归模型来预测一个学生是否被大学录取。假设你是一个大学系的管理员，你想根据两次考试的结果来决定每个申请人的录取机会。

1导入库

#三大件
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
%matplotlib inline

2数据读取（数据已上传）
https://download.csdn.net/download/qq_43554763/12509526?spm=1001.2014.3001.5501

pdData=pd.read_csv('data/LogiReg_data.txt',names=['Exam 1', 'Exam 2', 'Admitted'])
pdData.head()

在这里插入图片描述

4.画图

positive = pdData[pdData['Admitted'] == 1] # returns the subset of rows such Admitted = 1, i.e. the set of *positive* examples
negative = pdData[pdData['Admitted'] == 0] # returns the subset of rows such Admitted = 0, i.e. the set of *negative* examples

fig, ax = plt.subplots(figsize=(10,5))
ax.scatter(positive['Exam 1'], positive['Exam 2'], s=30, c='b', marker='o', label='Admitted')
ax.scatter(negative['Exam 1'], negative['Exam 2'], s=30, c='r', marker='x', label='Not Admitted')
ax.legend()
ax.set_xlabel('Exam 1 Score')
ax.set_ylabel('Exam 2 Score')