Tushare Day4——导入IPO新股列表new_share并分析基金和盈利

IPO新股列表new_share

1. 基础信息

在这里插入图片描述
在这里插入图片描述
在这里插入图片描述

import pandas as pd
import tushare as ts
from sqlalchemy import create_engine 
import pymysql
pymysql.install_as_MySQLdb()
date_1 = '19600101'
date_2 = '19800101'
### 在2000年之前没有数据
date_3 = '20000101'
date_4 = '20150101'
date_5 = '20210101'

2. 从数据接口取出new_share(由于限制,分两次取再合并)

2.1 20000101到20150101

# 由于每次限制取2000条,这里分两次取出来
pro = ts.pro_api()

df = pro.new_share(start_date = date_3, end_date = date_4)
df.head()
ts_codesub_codenameipo_dateissue_dateamountmarket_amountpricepelimit_amountfundsballot
0603889.SH732889新澳股份20141223201412312668.02401.217.9521.631.04.78910.43
1603017.SH732017园区设计20141222201412311500.01350.029.9722.880.64.49550.55
2603636.SH732636南威软件20141222201412302500.02250.014.9522.961.03.73750.40
3002736.SZ002736国信证券2014121920141229120000.060000.05.8322.9736.069.96001.80
4002738.SZ002738中矿资源20141219201412303000.02700.07.5722.981.22.27100.40
df.info()
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 1082 entries, 0 to 1081
Data columns (total 12 columns):
 #   Column         Non-Null Count  Dtype  
---  ------         --------------  -----  
 0   ts_code        1082 non-null   object 
 1   sub_code       1082 non-null   object 
 2   name           1082 non-null   object 
 3   ipo_date       1082 non-null   object 
 4   issue_date     1082 non-null   object 
 5   amount         1082 non-null   float64
 6   market_amount  1082 non-null   float64
 7   price          1082 non-null   float64
 8   pe             1082 non-null   float64
 9   limit_amount   1082 non-null   float64
 10  funds          1082 non-null   float64
 11  ballot         1082 non-null   float64
dtypes: float64(7), object(5)
memory usage: 101.6+ KB

2.2 20150101到20210101

pro = ts.pro_api()

df_2 = pro.new_share(start_date = date_4, end_date = date_5)
df_2.info()
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 1369 entries, 0 to 1368
Data columns (total 12 columns):
 #   Column         Non-Null Count  Dtype  
---  ------         --------------  -----  
 0   ts_code        1369 non-null   object 
 1   sub_code       1369 non-null   object 
 2   name           1369 non-null   object 
 3   ipo_date       1369 non-null   object 
 4   issue_date     1369 non-null   object 
 5   amount         1369 non-null   float64
 6   market_amount  1369 non-null   float64
 7   price          1369 non-null   float64
 8   pe             1369 non-null   float64
 9   limit_amount   1369 non-null   float64
 10  funds          1369 non-null   float64
 11  ballot         1369 non-null   float64
dtypes: float64(7), object(5)
memory usage: 128.5+ KB

2.3 将二者合并

frames = [df, df_2]

data = pd.concat(frames, ignore_index = True)
data.info()
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 2451 entries, 0 to 2450
Data columns (total 12 columns):
 #   Column         Non-Null Count  Dtype  
---  ------         --------------  -----  
 0   ts_code        2451 non-null   object 
 1   sub_code       2451 non-null   object 
 2   name           2451 non-null   object 
 3   ipo_date       2451 non-null   object 
 4   issue_date     2451 non-null   object 
 5   amount         2451 non-null   float64
 6   market_amount  2451 non-null   float64
 7   price          2451 non-null   float64
 8   pe             2451 non-null   float64
 9   limit_amount   2451 non-null   float64
 10  funds          2451 non-null   float64
 11  ballot         2451 non-null   float64
dtypes: float64(7), object(5)
memory usage: 229.9+ KB
data.head()
ts_codesub_codenameipo_dateissue_dateamountmarket_amountpricepelimit_amountfundsballot
0603889.SH732889新澳股份20141223201412312668.02401.217.9521.631.04.78910.43
1603017.SH732017园区设计20141222201412311500.01350.029.9722.880.64.49550.55
2603636.SH732636南威软件20141222201412302500.02250.014.9522.961.03.73750.40
3002736.SZ002736国信证券2014121920141229120000.060000.05.8322.9736.069.96001.80
4002738.SZ002738中矿资源20141219201412303000.02700.07.5722.981.22.27100.40
df.info()
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 1082 entries, 0 to 1081
Data columns (total 12 columns):
 #   Column         Non-Null Count  Dtype  
---  ------         --------------  -----  
 0   ts_code        1082 non-null   object 
 1   sub_code       1082 non-null   object 
 2   name           1082 non-null   object 
 3   ipo_date       1082 non-null   object 
 4   issue_date     1082 non-null   object 
 5   amount         1082 non-null   float64
 6   market_amount  1082 non-null   float64
 7   price          1082 non-null   float64
 8   pe             1082 non-null   float64
 9   limit_amount   1082 non-null   float64
 10  funds          1082 non-null   float64
 11  ballot         1082 non-null   float64
dtypes: float64(7), object(5)
memory usage: 101.6+ KB

3. 分析筹集基金

3.1 选出募集基金前20的公司

# 选出募集资金前20的公司

data.sort_values(by='funds', ascending=False).head(20)
ts_codesub_codenameipo_dateissue_dateamountmarket_amountpricepelimit_amountfundsballot
715601288.SH780288农业银行20100706201007152557058.801031032.52.6814.431000.0685.29189.29
1001601668.SH780668中国建筑20090722200907291200000.00600000.04.1851.29700.0501.60002.83
2276601211.SH780211国泰君安2015061820150626152500.00106750.019.7122.9945.7300.57751.57
1514601138.SH780138工业富联2018052420180608196953.00100610.013.7717.0941.3267.96000.34
1070601898.SH780898中煤能源2008012520080201152533.34118213.316.8343.719999.9256.71360.73
1061601186.SH780186中国铁建2008022620080310245000.00196000.09.0830.569999.9222.46000.64
686601818.SH780818光大银行2010081020100818700000.00245000.03.1016.40240.0217.00003.63
1120688981.SH787981中芯国际2020070720200716193846.0025284.027.460.0042.1202.85500.21
982601618.SH780618中国中冶2009090920090921350000.00226634.75.4241.90200.0189.70001.18
846601688.SH780688华泰证券201002092010022678456.1354919.320.0029.4130.0156.912314.43
926601989.SH780989中国重工2009120720091216199500.00119700.07.3842.14100.0147.23101.36
907601299.SH780299中国北车2009122120091229250000.00162500.05.5649.21150.0139.00001.33
325601669.SH780669中国水电2011092720111018300000.00150000.04.5015.00150.0135.00009.70
2306601985.SH780985中国核电2015060220150610389100.00272370.03.3922.29116.7131.90491.63
859601106.SH780106中国一重2010020120100209200000.00100000.05.7041.22100.0114.00004.98
1353003816.SZ003816中国广核2019081220190826504986.00184252.02.4914.6075.7111.67200.60
939600999.SH730999招商证券200911102009111735854.6128683.731.0056.2610.0111.14931.01
998601788.SH780788光大证券200908042009081852000.0034080.121.0858.5630.0109.61600.81
1382688009.SH787009中国通号2019071020190722180000.0037800.05.8518.8025.2106.84700.23
2071601229.SH780229上海银行201611022016111660045.0054041.017.778.2618.0106.70000.23

3.2 按基金数额画频数直方图

data.funds.hist()
<matplotlib.axes._subplots.AxesSubplot at 0x21436b3ebb0>

在这里插入图片描述

3.3 小于80的基金按数额画频数直方图

data[data.funds<80].funds.hist()
<matplotlib.axes._subplots.AxesSubplot at 0x21437341430>

在这里插入图片描述

3.4 小于50的基金按数额画频数直方图

data[data.funds<50].funds.hist()
<matplotlib.axes._subplots.AxesSubplot at 0x214373d6dc0>

在这里插入图片描述

3.5 小于30的基金按数额画频数直方图

data[data.funds<30].funds.hist() # 大部分funds都集中在5亿左右
<matplotlib.axes._subplots.AxesSubplot at 0x214373cad00>

在这里插入图片描述

发现大部分基金数额都集中在5亿左右。

4. 分析盈利

4.1 选出市盈率最小的前20公司

# 选出市盈率最小的前20公司

data.sort_values(by='pe').head(20)
ts_codesub_codenameipo_dateissue_dateamountmarket_amountpricepelimit_amountfundsballot
1084688185.SH787185康希诺20200731202008132480.0697.0209.710.000.4510.67000.03
1140688277.SH787277天智航20200622202007074190.01081.012.040.000.705.08300.04
1129688180.SH787180君实生物20200702202007158713.02120.055.500.001.3529.35600.04
1126688165.SH787165埃夫特202007032020071513045.03720.06.350.002.4512.66500.05
1120688981.SH787981中芯国际2020070720200716193846.025284.027.460.0042.10202.85500.21
1117688256.SH787256寒武纪20200708202007204010.0972.064.390.000.6029.27700.04
1110688561.SH787561奇安信202007142020072210194.02159.056.100.001.4047.76100.04
1199688126.SH787126沪硅产业202004092020042062007.013457.03.890.008.6526.44300.10
1086688339.SH787339亿华通20200729202008101763.0461.076.650.000.250.00000.04
1229688177.SH787177百奥泰20200211202002216000.01734.032.760.001.1520.89400.05
1083688313.SH787313仕佳光子20200731202008124600.01311.010.820.000.850.00000.03
1248688266.SH787266泽璟制药20200114202001236000.01722.033.760.001.1025.40900.05
1152688520.SH787520神州细胞20200609202006225000.01430.025.640.000.9521.01500.04
2142601997.SH780997贵阳银行201608042016081650000.045000.08.496.0815.0042.45000.22
100601225.SH780225陕西煤业2014011720140128100000.015000.04.006.2315.0040.00004.87
2196603528.SH732528多伦科技20160421201605035168.04651.29.456.411.904.88380.06
1486002936.SZ002936郑州银行201809072018091960000.054000.04.596.5018.0027.54000.25
1024002254.SZ002254烟台氨纶20080616200806253200.02560.018.596.672560.005.94880.08
1484601577.SH780577长沙银行201809122018092634216.030794.07.996.9710.2027.33800.17
1963601881.SH780881中国银河201701112017012360000.054000.06.817.0218.0040.86000.21

4.2 按盈利多少画频数直方图

data.pe.hist()
<matplotlib.axes._subplots.AxesSubplot at 0x214374f3880>

在这里插入图片描述

4.3 小于200的盈利画频数直方图

data[data.pe<200].pe.hist()
<matplotlib.axes._subplots.AxesSubplot at 0x214374ef5b0>

在这里插入图片描述

4.4 小于150的盈利画频数直方图

data[data.pe<150].pe.hist()
<matplotlib.axes._subplots.AxesSubplot at 0x214376d2790>

在这里插入图片描述

data.to_csv("new_share_20200821.csv")
  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值