任务2:论⽂文作者统计
2.1 任务说明
- 任务主题:论⽂文作者统计,统计所有论⽂文作者出现频率Top10的姓名;
- 任务内容:论⽂文作者的统计、使⽤用 Pandas 读取数据并使⽤用字符串操作;
- 任务成果:学习 Pandas 的字符串串操作;
2.2 数据处理理步骤
在原始arxiv数据集中论⽂文作者authors 字段是⼀个字符串格式,其中每个作者使用逗号进行分隔,所
以我们⾸先需要完成以下步骤:
- 使⽤用逗号对作者进⾏切分;
- 剔除单个作者中⾮常规的字符;
具体操作可以参考以下例子:
'''
C. Bal\\'azs, E. L. Berger, P. M. Nadolsky, C.-P. Yuan
# 切分为,其中\\为转义符
C. Ba'lazs
E. L. Berger
P. M. Nadolsky
C.-P. Yuan
'''
#当然在原始数据集中authors_parsed 字段已经帮我们处理理好了了作者信息,可以直接使⽤用该字段完成后续统计。
"\nC. Bal\\'azs, E. L. Berger, P. M. Nadolsky, C.-P. Yuan\n# 切分为,其中\\为转义符\nC. Ba'lazs\nE. L. Berger\nP. M. Nadolsky\nC.-P. Yuan\n"
2.3 字符串串处理理
在Python中字符串是最常用的数据类型,可以使用引号('或")来创建字符串串。Python中所有的字符都使
用字符串存储,可以使⽤方括号来截取字符串,如下实例:
var1 = 'Hello Datawhale!'
var2 = "Python Everwhere!"
print("var1[-10:]: ", var1[-10:])
print("var2[1:5]: ", var2[0:7])
var1[-10:]: Datawhale!
var2[1:5]: Python
Python中还内置了很多内置函数,非常方便使用:
- string.capitalize() 把字符串的第一个字符大写
- string.isalpha() 如果 string 至少有一个字符并且所有字符都是字母则返回 True,否则返回False
- string.title() 每个单词首字母大写
- string.upper()转换 string 中的小写字母为大写
2.4 具体代码实现以及讲解
2.4.1 数据读取
import os
os.getcwd()
'C:\\Users\\Administrator\\Desktop\\datawhale\\数据分析之学术前沿分析'
data = []
import pandas as pd
import json
with open('./arxiv-metadata-oai-2019.json/arxiv-metadata-oai-2019.json','r' ) as f:
for idx, line in enumerate(f):
d = json.loads(line)
d = {"authors": d["authors"],'categories': d['categories'],'authors_parsed': d['authors_parsed'] }
data.append(d)
data = pd.DataFrame(data)
# 为了方便处理数据,我们只选择了三个字段进行读取。
2.4.2 数据统计
- 统计所有作者姓名出现频率的Top10;
- 统计所有作者姓(姓名最后一个单词)的出现频率的Top10;
- 统计所有作者姓第⼀个字符的评率;
为了了节约计算时间,下面选择部分类别下的论文进行处理理:
# 选择类别为cs.CV下⾯面的论⽂文
data2 = data[data['categories'].apply(lambda x: 'cs.CV' in x)]
# 拼接所有作者
data2
authors | categories | authors_parsed | |
---|---|---|---|
531 | Mahesh Pal | cs.NE cs.CV | [[Pal, Mahesh, ]] |
1408 | Serguei A. Mokhov, Stephen Sinclair, Ian Cl\'e... | cs.SD cs.CL cs.CV cs.MM cs.NE | [[Mokhov, Serguei A., , for the MARF R&D Group... |
3231 | Chris Aholt, Bernd Sturmfels, Rekha Thomas | math.AG cs.CV | [[Aholt, Chris, ], [Sturmfels, Bernd, ], [Thom... |
4120 | Jos\'e I. Ronda, Antonio Vald\'es and Guillerm... | cs.CV | [[Ronda, José I., ], [Valdés, Antonio, ], [Gal... |
4378 | Tanaya Guha and Rabab K. Ward | cs.CV | [[Guha, Tanaya, ], [Ward, Rabab K., ]] |
... | ... | ... | ... |
167912 | Zilong Ji, Xiaolong Zou, Tiejun Huang, Si Wu | cs.CV cs.LG | [[Ji, Zilong, ], [Zou, Xiaolong, ], [Huang, Ti... |
167913 | Tristan Sylvain, Linda Petrini, Devon Hjelm | cs.CV cs.LG | [[Sylvain, Tristan, ], [Petrini, Linda, ], [Hj... |
167914 | Jonathan Ho, Nal Kalchbrenner, Dirk Weissenbor... | cs.CV | [[Ho, Jonathan, ], [Kalchbrenner, Nal, ], [Wei... |
167918 | Chia-Mu Yu, Ching-Tang Chang, Yen-Wu Ti | cs.CV | [[Yu, Chia-Mu, ], [Chang, Ching-Tang, ], [Ti, ... |
167964 | Dian Chen and Brady Zhou and Vladlen Koltun an... | cs.RO cs.AI cs.CV cs.LG | [[Chen, Dian, ], [Zhou, Brady, ], [Koltun, Vla... |
11168 rows × 3 columns
all_authors = sum(data2['authors_parsed'], [])
# 处理理完成后all_authors 变成了了所有⼀个list,其中每个元素为⼀个作者的姓名。我们⾸先来完成姓名频率的统计。
all_authors
[['Pal', 'Mahesh', ''],
['Mokhov', 'Serguei A.', '', 'for the MARF R&D Group'],
['Sinclair', 'Stephen', '', 'for the MARF R&D Group'],
['Clément', 'Ian', '', 'for the MARF R&D Group'],
['Nicolacopoulos', 'Dimitrios', '', 'for the MARF R&D Group'],
['Aholt', 'Chris', ''],
['Sturmfels', 'Bernd', ''],
['Thomas', 'Rekha', ''],
['Ronda', 'José I.', ''],
['Valdés', 'Antonio', ''],
['Gallego', 'Guillermo', ''],
['Guha', 'Tanaya', ''],
['Ward', 'Rabab K.', ''],
['Olaizola', 'Igor G.', ''],
['Quartulli', 'Marco', ''],
['Florez', 'Julian', ''],
['Sierra', 'Basilio', ''],
['Xie', 'Xiaohua', ''],
['Xu', 'Kai', ''],
['Mitra', 'Niloy J.', ''],
['Cohen-Or', 'Daniel', ''],
['Chen', 'Baoquan', ''],
['Guha', 'Tanaya', ''],
['Nezhadarya', 'Ehsan', ''],
['Ward', 'Rabab K', ''],
['Gao', 'Fei', ''],
['Tao', 'Dacheng', ''],
['Gao', 'Xinbo', ''],
['Li', 'Xuelong', ''],
['Sun', 'Yuli', ''],
['Tao', 'Jinxu', ''],
['Liu', 'Conggui', ''],
['Sun', 'Yuli', ''],
['Tao', 'Jinxu', ''],
['Soorma', 'Neha', '', 'M.TECH'],
['Singh',
'Jaikaran',
'',
'Department of Electronics and Communication, SSSIST, Sehore,\n M.P. India'],
['Tiwari',
'Mukesh',
'',
'Department of Electronics and Communication, SSSIST, Sehore,\n M.P. India'],
['Poling', 'Bryan', ''],
['Lerman', 'Gilad', ''],
['Szlam', 'Arthur', ''],
['Chung', 'Moo K.', ''],
['Hanson', 'Jamie L.', ''],
['Ye', 'Jieping', ''],
['Davidson', 'Richard J.', ''],
['Pollak', 'Seth D.', ''],
['Li', 'Junhua', ''],
['Struzik', 'Zbigniew', ''],
['Zhang', 'Liqing', ''],
['Cichocki', 'Andrzej', ''],
['Gilani', 'Syed Zulqarnain', ''],
['Mian', 'Ajmal', ''],
['Shafait', 'Faisal', ''],
['Reid', 'Ian', ''],
['Li', 'Junhua', ''],
['Li', 'Chao', ''],
['Cichocki', 'Andrzej', ''],
['van Gennip', 'Yves', ''],
['Athavale', 'Prashant', ''],
['Gilles', 'Jérôme', ''],
['Choksi', 'Rustum', ''],
['Vitanyi', 'P. M. B.', '', 'CWI and University of Amsterdam'],
['Vitale', 'Jonathan', ''],
['Williams', 'Mary-Anne', ''],
['Johnston', 'Benjamin', ''],
['Boccignone', 'Giuseppe', ''],
['Borji', 'Ali', ''],
['Cheng', 'Ming-Ming', ''],
['Hou', 'Qibin', ''],
['Jiang', 'Huaizu', ''],
['Li', 'Jia', ''],
['Strauß', 'Tobias', '', 'for the University of Rostock - CITlab'],
['Grüning', 'Tobias', '', 'for the University of Rostock - CITlab'],
['Leifert', 'Gundram', '', 'for the University of Rostock - CITlab'],
['Labahn', 'Roger', '', 'for the University of Rostock - CITlab'],
['Leifert', 'Gundram', '', 'for the University of Rostock - CITlab'],
['Grüning', 'Tobias', '', 'for the University of Rostock - CITlab'],
['Strauß', 'Tobias', '', 'for the University of Rostock - CITlab'],
['Labahn', 'Roger', '', 'for the University of Rostock - CITlab'],
['Cohen', 'Taco S.', ''],
['Welling', 'Max', ''],
['Torres', 'Wuilian', ''],
['Rueda-Toicen', 'Antonio', ''],
['Melo', 'E. F.', ''],
['de Oliveira', 'H. M.', ''],
['Oh', 'Tae-Hyun', ''],
['Tai', 'Yu-Wing', ''],
['Bazin', 'Jean-Charles', ''],
['Kim', 'Hyeongwoo', ''],
['Kweon', 'In So', ''],
['Li', 'Xiangru', ''],
['Lu', 'Yu', ''],
['Comte', 'Georges', ''],
['Luo', 'Ali', ''],
['Zhao', 'Yongheng', ''],
['Wang', 'Yongjun', ''],
['Lai', 'Hanjiang', ''],
['Pan', 'Yan', ''],
['Liu', 'Ye', ''],
['Yan', 'Shuicheng', ''],
['Felzenszwalb', 'Pedro F.', ''],
['Svaiter', 'Benar F.', ''],
['Arrigoni', 'Federica', ''],
['Fusiello', 'Andrea', ''],
['Rossi', 'Beatrice', ''],
['Fragneto', 'Pasqualina', ''],
['Mandal', 'Subhamoy', ''],
['Sudarshan', 'Viswanath Pamulakanty', ''],
['Nagaraj', 'Yeshaswini', ''],
['Ben', 'Xose Luis Dean', ''],
['Razansky', 'Daniel', ''],
['Boccignone', 'Giuseppe', ''],
['Parizi', 'Sobhan Naderi', ''],
['He', 'Kun', ''],
['Aghajani', 'Reza', ''],
['Sclaroff', 'Stan', ''],
['Felzenszwalb', 'Pedro', ''],
['Isikdogan', 'F.', ''],
['Bovik', 'A. C.', ''],
['Passalacqua', 'P.', ''],
['Bohi', 'Amine', ''],
['Prandi', 'Dario', ''],
['Guis', 'Vincente', ''],
['Bouchara', 'Frédéric', ''],
['Gauthier', 'Jean-Paul', ''],
['Hafemann', 'Luiz G.', ''],
['Sabourin', 'Robert', ''],
['Oliveira', 'Luiz S.', ''],
['Oh', 'Tae-Hyun', ''],
['Matsushita', 'Yasuyuki', ''],
['Tai', 'Yu-Wing', ''],
['Kweon', 'In So', ''],
['Tsakiris', 'Manolis C.', ''],
['Vidal', 'Rene', ''],
['Tsakiris', 'Manolis C.', ''],
['Vidal', 'Rene', ''],
['Mandal', 'Subhamoy', ''],
['Deán-Ben', 'Xosé Luís', ''],
['Razansky', 'Daniel', ''],
['Palmieri', 'Luigi', ''],
['Rudenko', 'Andrey', ''],
['Arras', 'Kai O.', ''],
['Ginosar', 'Shiry', ''],
['Rakelly', 'Kate', ''],
['Sachs', 'Sarah', ''],
['Yin', 'Brian', ''],
['Lee', 'Crystal', ''],
['Krahenbuhl', 'Philipp', ''],
['Efros', 'Alexei A.', ''],
['McClure', 'Patrick', ''],
['Kriegeskorte', 'Nikolaus', ''],
['Fu', 'Yanwei', ''],
['Huang', 'De-An', ''],
['Sigal', 'Leonid', ''],
['Huang', 'Shaoli', ''],
['Xu', 'Zhe', ''],
['Tao', 'Dacheng', ''],
['Zhang', 'Ya', ''],
['Khosravi', 'Mohammad Reza', ''],
['Sharif-Yazd', 'Mohammad', ''],
['Moghimi', 'Mohammad Kazem', ''],
['Keshavarz', 'Ahmad', ''],
['Rostami', 'Habib', ''],
['Mansouri', 'Suleiman', ''],
['Huttunen', 'Heikki', ''],
['Yancheshmeh', 'Fatemeh Shokrollahi', ''],
['Chen', 'Ke', ''],
['Iyer', 'Rahul Radhakrishnan', ''],
['Parekh', 'Sanjeel', ''],
['Mohandoss', 'Vikas', ''],
['Ramsurat', 'Anush', ''],
['Raj', 'Bhiksha', ''],
['Singh', 'Rita', ''],
['Sudarshan', 'Viswanath P', ''],
['Weiser', 'Tobias', ''],
['Chintala', 'Phalgun', ''],
['Mandal', 'Subhamoy', ''],
['Dutta', 'Rahul', ''],
['Gaya', 'Joel D. O.', ''],
['Codevilla', 'Felipe', ''],
['Duarte', 'Amanda C.', ''],
['Drews-Jr', 'Paulo L.', ''],
['Botelho', 'Silvia S.', ''],
['Abdulkhaev', 'Alisher', ''],
['Yilmaz', 'Ozgur', ''],
['Liu', 'Fuqiang', ''],
['Bi', 'Fukun', ''],
['Chen', 'Liang', ''],
['Markuš', 'Nenad', ''],
['Pandžić', 'Igor S.', ''],
['Ahlberg', 'Jörgen', ''],
['Granstrom', 'Karl', ''],
['Baum', 'Marcus', ''],
['Reuter', 'Stephan', ''],
['Savinov', 'Nikolay', ''],
['Haene', 'Christian', ''],
['Ladicky', 'Lubor', ''],
['Pollefeys', 'Marc', ''],
['Ponti', 'Moacir', ''],
['Riva', 'Mateus', ''],
['Barina', 'David', ''],
['Kula', 'Michal', ''],
['Zemcik', 'Pavel', ''],
['Granstrom', 'Karl', ''],
['Fatemi', 'Maryam', ''],
['Svensson', 'Lennart', ''],
['Chen', 'Yanxiang', ''],
['Hu', 'Yuxing', ''],
['Zhang', 'Luming', ''],
['Li', 'Ping', ''],
['Zhang', 'Chao', ''],
['Triki', 'Amal Rannen', ''],
['Blaschko', 'Matthew B.', ''],
['Chua', 'Jeroen', ''],
['Felzenszwalb', 'Pedro F.', ''],
['Konyushkova', 'Ksenia', ''],
['Sznitman', 'Raphael', ''],
['Fua', 'Pascal', ''],
['Zamzmi', 'Ghada', ''],
['Goldgof', 'Dmitry', ''],
['Kasturi', 'Rangachar', ''],
['Sun', 'Yu', ''],
['Ashmeade', 'Terri', ''],
['Gallego', 'Guillermo', ''],
['Lund', 'Jon E. A.', ''],
['Mueggler', 'Elias', ''],
['Rebecq', 'Henri', ''],
['Delbruck', 'Tobi', ''],
['Scaramuzza', 'Davide', ''],
['Arablouei', 'Reza', ''],
['Goan', 'Ethan', ''],
['Gensemer', 'Stephen', ''],
['Kusy', 'Branislav', ''],
['Al-Shabi', 'Mundher', ''],
['Cheah', 'Wooi Ping', ''],
['Connie', 'Tee', ''],
['Zha', 'Zhiyuan', ''],
['Wen', 'Bihan', ''],
['Zhang', 'Jiachao', ''],
['Zhou', 'Jiantao', ''],
['Zhu', 'Ce', ''],
['Spampinato', 'Concetto', ''],
['Palazzo', 'Simone', ''],
['Kavasidis', 'Isaak', ''],
['Giordano', 'Daniela', ''],
['Shah', 'Mubarak', ''],
['Souly', 'Nasim', ''],
['Aizenbud', 'Yariv', ''],
['Shkolnisky', 'Yoel', ''],
['Han', 'Lei', ''],
['Sun', 'Juanzhen', ''],
['Zhang', 'Wei', ''],
['Xiu', 'Yuanyuan', ''],
['Feng', 'Hailei', ''],
['Lin', 'Yinjing', ''],
['Coninx', 'Alexandre', ''],
['Bessière', 'Pierre', ''],
['Droulez', 'Jacques', ''],
['Clement', 'Lee', ''],
['Peretroukhin', 'Valentin', ''],
['Kelly', 'Jonathan', ''],
['Liu', 'Yi', ''],
['Liu', 'Jingwei', ''],
['Prangnell', 'Lee', ''],
['Peretroukhin', 'Valentin', ''],
['Clement', 'Lee', ''],
['Kelly', 'Jonathan', ''],
['Cai', 'Xiaohao', ''],
['Wallis', 'Christopher G. R.', ''],
['Chan', 'Jennifer Y. H.', ''],
['McEwen', 'Jason D.', ''],
['Oliveira', 'P. A. M.', ''],
['Cintra', 'R. J.', ''],
['Bayer', 'F. M.', ''],
['Kulasekera', 'S.', ''],
['Madanayake', 'A.', ''],
['Coutinho', 'V. A.', ''],
['Selvaraju', 'Ramprasaath R.', ''],
['Cogswell', 'Michael', ''],
['Das', 'Abhishek', ''],
['Vedantam', 'Ramakrishna', ''],
['Parikh', 'Devi', ''],
['Batra', 'Dhruv', ''],
['McCaig', 'Graeme', ''],
['DiPaola', 'Steve', ''],
['Gabora', 'Liane', ''],
['Liu', 'Min', ''],
['Shi', 'Yifei', ''],
['Zheng', 'Lintao', ''],
['Xu', 'Kai', ''],
['Huang', 'Hui', ''],
['Manocha', 'Dinesh', ''],
['Laga', 'Hamid', ''],
['Xie', 'Qian', ''],
['Jermyn', 'Ian H.', ''],
['Srivastava', 'Anuj', ''],
['Aksoy', 'Eren Erdal', ''],
['Orhan', 'Adil', ''],
['Woergoetter', 'Florentin', ''],
['Gewali', 'Utsav B.', ''],
['Monteiro', 'Sildomar T.', ''],
['Gewali', 'Utsav B.', ''],
['Monteiro', 'Sildomar T.', ''],
['Tang', 'Da', ''],
['Jebara', 'Tony', ''],
['McClure', 'Patrick', ''],
['Kriegeskorte', 'Nikolaus', ''],
['Mastriani', 'Mario', ''],
['Shah', 'Abhay', ''],
['Abramoff', 'Michael D.', ''],
['Wu', 'Xiaodong', ''],
['Zhang', 'Li', ''],
['Xiang', 'Tao', ''],
['Gong', 'Shaogang', ''],
['Iscen', 'Ahmet', ''],
['Tolias', 'Giorgos', ''],
['Avrithis', 'Yannis', ''],
['Furon', 'Teddy', ''],
['Chum', 'Ondrej', ''],
['Johnson', 'Jeremiah', ''],
['Emeršič', 'Žiga', ''],
['Štruc', 'Vitomir', ''],
['Peer', 'Peter', ''],
['Rozumnyi', 'Denys', ''],
['Kotera', 'Jan', ''],
['Sroubek', 'Filip', ''],
['Novotny', 'Lukas', ''],
['Matas', 'Jiri', ''],
['Lukežič', 'Alan', ''],
['Vojíř', 'Tomáš', ''],
['Čehovin', 'Luka', ''],
['Matas', 'Jiří', ''],
['Kristan', 'Matej', ''],
['Lu', 'Yuzhen', ''],
['Berenbaum', 'David', ''],
['Deighan', 'Dwyer', ''],
['Marlow', 'Thomas', ''],
['Lee', 'Ashley', ''],
['Frickel', 'Scott', ''],
['Howison', 'Mark', ''],
['Wu', 'Bichen', ''],
['Wan', 'Alvin', ''],
['Iandola', 'Forrest', ''],
['Jin', 'Peter H.', ''],
['Keutzer', 'Kurt', ''],
['Liu', 'Yun', ''],
['Cheng', 'Ming-Ming', ''],
['Hu', 'Xiaowei', ''],
['Wang', 'Kai', ''],
['Bai', 'Xiang', ''],
['Khoreva', 'Anna', ''],
['Perazzi', 'Federico', ''],
['Benenson', 'Rodrigo', ''],
['Schiele', 'Bernt', ''],
['Sorkine-Hornung', 'Alexander', ''],
['Wijmans', 'Erik', ''],
['Furukawa', 'Yasutaka', ''],
['Le', 'Hieu', ''],
['Yu', 'Chen-Ping', ''],
['Zelinsky', 'Gregory', ''],
['Samaras', 'Dimitris', ''],
['Dong', 'Qiulei', ''],
['Hu', 'Zhanyi', ''],
['Averbuch-Elor', 'Hadar', ''],
['Bar', 'Nadav', ''],
['Cohen-Or', 'Daniel', ''],
['Albarqouni', 'Shadi', ''],
['Fotouhi', 'Javad', ''],
['Navab', 'Nassir', ''],
['Rahimpour', 'Alireza', ''],
['Taalimi', 'Ali', ''],
['Qi', 'Hairong', ''],
['Cai', 'Deng', ''],
['Zhuang', 'Xiahai', ''],
['Connie', 'Tee', ''],
['Al-Shabi', 'Mundher', ''],
['Goh', 'Michael', ''],
['Borsoi', 'Ricardo A.', ''],
['Aya', 'Julio C. C.', ''],
['Costa', 'Guilherme H.', ''],
['Bermudez', 'José C. M.', ''],
['Barron', 'Jonathan T.', ''],
['Zhang', 'He', ''],
['Sindagi', 'Vishwanath', ''],
['Patel', 'Vishal M.', ''],
['Kortylewski', 'Adam', ''],
['Wieczorek', 'Aleksander', ''],
['Wieser', 'Mario', ''],
['Blumer', 'Clemens', ''],
['Parbhoo', 'Sonali', ''],
['Morel-Forster', 'Andreas', ''],
['Roth', 'Volker', ''],
['Vetter', 'Thomas', ''],
['Qi', 'Guo-Jun', ''],
['Dutta', 'Anjan', ''],
['Sahbi', 'Hichem', ''],
['Emeršič', 'Žiga', ''],
['Gabriel', 'Luka Lan', ''],
['Štruc', 'Vitomir', ''],
['Peer', 'Peter', ''],
['Rafegas', 'Ivet', ''],
['Vanrell', 'Maria', ''],
['Alexandre', 'Luis A.', ''],
['Arias', 'Guillem', ''],
['Minaee', 'Shervin', ''],
['Abdolrashidi', 'Amirali', ''],
['Wang', 'Yao', ''],
['Zuo', 'Xinxin', ''],
['Wang', 'Sen', ''],
['Zheng', 'Jiangbin', ''],
['Yang', 'Ruigang', ''],
['Rahmani', 'Mostafa', ''],
['Atia', 'George', ''],
['Guo', 'Hengkai', ''],
['Wang', 'Guijin', ''],
['Chen', 'Xinghao', ''],
['Zhang', 'Cairong', ''],
['Qiao', 'Fei', ''],
['Yang', 'Huazhong', ''],
['Takahashi', 'Ryo', ''],
['Matsubara', 'Takashi', ''],
['Uehara', 'Kuniaki', ''],
['Gupta', 'Saurabh', ''],
['Tolani', 'Varun', ''],
['Davidson', 'James', ''],
['Levine', 'Sergey', ''],
['Sukthankar', 'Rahul', ''],
['Malik', 'Jitendra', ''],
['Yao', 'Hantao', ''],
['Dai', 'Feng', ''],
['Zhang', 'Dongming', ''],
['Ma', 'Yike', ''],
['Zhang', 'Shiliang', ''],
['Zhang', 'Yongdong', ''],
['Tian', 'Qi', ''],
['Litjens', 'Geert', ''],
['Kooi', 'Thijs', ''],
['Bejnordi', 'Babak Ehteshami', ''],
['Setio', 'Arnaud Arindra Adiyoso', ''],
['Ciompi', 'Francesco', ''],
['Ghafoorian', 'Mohsen', ''],
['van der Laak', 'Jeroen A. W. M.', ''],
['van Ginneken', 'Bram', ''],
['Sánchez', 'Clara I.', ''],
['Zhang', 'Wei', ''],
['Hu', 'Shengnan', ''],
['Liu', 'Kan', ''],
['Zha', 'Zhengjun', ''],
['Sochor', 'Jakub', ''],
['Juránek', 'Roman', ''],
['Špaňhel', 'Jakub', ''],
['Maršík', 'Lukáš', ''],
['Široký', 'Adam', ''],
['Herout', 'Adam', ''],
['Zemčík', 'Pavel', ''],
['Mueggler', 'Elias', ''],
['Gallego', 'Guillermo', ''],
['Rebecq', 'Henri', ''],
['Scaramuzza', 'Davide', ''],
['Inoue', 'Hiroshi', ''],
['Mahbod', 'Amirreza', ''],
['Schaefer', 'Gerald', ''],
['Wang', 'Chunliang', ''],
['Ecker', 'Rupert', ''],
['Ellinger', 'Isabella', ''],
['Sochor', 'Jakub', ''],
['Špaňhel', 'Jakub', ''],
['Herout', 'Adam', ''],
['Xu', 'Sheng', ''],
['Wang', 'Ruisheng', ''],
['Zheng', 'Han', ''],
['Antonello', 'Morris', ''],
['Carraro', 'Marco', ''],
['Pierobon', 'Marco', ''],
['Menegatti', 'Emanuele', ''],
['Kawahara', 'Jeremy', ''],
['Hamarneh', 'Ghassan', ''],
['Li', 'Kun', ''],
['Yang', 'Jingyu', ''],
['Lai', 'Yu-Kun', ''],
['Guo', 'Daoliang', ''],
['Volkhonskiy', 'Denis', ''],
['Nazarov', 'Ivan', ''],
['Burnaev', 'Evgeny', ''],
['Baur', 'Christoph', ''],
['Albarqouni', 'Shadi', ''],
['Navab', 'Nassir', ''],
['Kim', 'Youngsung', ''],
['Yoo', 'ByungIn', ''],
['Kwak', 'Youngjun', ''],
['Choi', 'Changkyu', ''],
['Kim', 'Junmo', ''],
['Wu', 'Huikai', ''],
['Zheng', 'Shuai', ''],
['Zhang', 'Junge', ''],
['Huang', 'Kaiqi', ''],
['Lin', 'Yutian', ''],
['Zheng', 'Liang', ''],
['Zheng', 'Zhedong', ''],
['Wu', 'Yu', ''],
['Hu', 'Zhilan', ''],
['Yan', 'Chenggang', ''],
['Yang', 'Yi', ''],
['Zhao', 'Long', ''],
['Han', 'Fangda', ''],
['Peng', 'Xi', ''],
['Zhang', 'Xun', ''],
['Kapadia', 'Mubbasir', ''],
['Pavlovic', 'Vladimir', ''],
['Metaxas', 'Dimitris N.', ''],
['Lou', 'Jing', ''],
['Wang', 'Huan', ''],
['Chen', 'Longtao', ''],
['Xu', 'Fenglei', ''],
['Xia', 'Qingyuan', ''],
['Zhu', 'Wei', ''],
['Ren', 'Mingwu', ''],
['Khoreva', 'Anna', ''],
['Benenson', 'Rodrigo', ''],
['Ilg', 'Eddy', ''],
['Brox', 'Thomas', ''],
['Schiele', 'Bernt', ''],
['Wang', 'Zhiguang', ''],
['Yang', 'Jianbo', ''],
['Cannings', 'Timothy I.', ''],
['Berrett', 'Thomas B.', ''],
['Samworth', 'Richard J.', ''],
['Avola', 'Danilo', ''],
['Foresti', 'Gian Luca', ''],
['Martinel', 'Niki', ''],
['Pannone', 'Daniele', ''],
['Piciarelli', 'Claudio', ''],
['Shrikumar', 'Avanti', ''],
['Greenside', 'Peyton', ''],
['Kundaje', 'Anshul', ''],
['Wu', 'Zuxuan', ''],
['Davis', 'Larry S.', ''],
['Sigal', 'Leonid', ''],
['Meinhardt', 'Tim', ''],
['Moeller', 'Michael', ''],
['Hazirbas', 'Caner', ''],
['Cremers', 'Daniel', ''],
['Elgendy', 'Omar A.', ''],
['Chan', 'Stanley H.', ''],
['Arnold', 'Lukas On', '', 'for the SoLid collaboration'],
['Janai', 'Joel', ''],
['Güney', 'Fatma', ''],
['Behl', 'Aseem', ''],
['Geiger', 'Andreas', ''],
['Deniz', 'Cem M.', ''],
['Xiang', 'Siyuan', ''],
['Hallyburton', 'Spencer', ''],
['Welbeck', 'Arakua', ''],
['Babb', 'James S.', ''],
['Honig', 'Stephen', ''],
['Cho', 'Kyunghyun', ''],
['Chang', 'Gregory', ''],
['Carvalho', 'João', ''],
['Marques', 'Manuel', ''],
['Costeira', 'João P.', ''],
['Xu', 'Minmin', ''],
['Xu', 'Siyu', ''],
['Zhu', 'Jihua', ''],
['Li', 'Yaochen', ''],
['Wang', 'Jun', ''],
['Lu', 'Huimin', ''],
['Brogan', 'Joel', ''],
['Bestagini', 'Paolo', ''],
['Bharati', 'Aparna', ''],
['Pinto', 'Allan', ''],
['Moreira', 'Daniel', ''],
['Bowyer', 'Kevin', ''],
['Flynn', 'Patrick', ''],
['Rocha', 'Anderson', ''],
['Scheirer', 'Walter', ''],
['Pandey', 'Gaurav', ''],
['Dukkipati', 'Ambedkar', ''],
['Wang', 'Xiaosong', ''],
['Peng', 'Yifan', ''],
['Lu', 'Le', ''],
['Lu', 'Zhiyong', ''],
['Bagheri', 'Mohammadhadi', ''],
['Summers', 'Ronald M.', ''],
['Borkar', 'Tejas', ''],
['Karam', 'Lina', ''],
['Harangi', 'Balazs', ''],
['Bae', 'Sung-Ho', ''],
['Elgharib', 'Mohamed', ''],
['Hefeeda', 'Mohamed', ''],
['Matusik', 'Wojciech', ''],
['Zhang', 'Jing', ''],
['Li', 'Wanqing', ''],
['Ogunbona', 'Philip', ''],
['Xu', 'Dong', ''],
['Lu', 'Yao', ''],
['Yang', 'Zhirong', ''],
['Kannala', 'Juho', ''],
['Kaski', 'Samuel', ''],
['Wang', 'Zhengyang', ''],
['Yuan', 'Hao', ''],
['Ji', 'Shuiwang', ''],
['Dong', 'Xingping', ''],
['Shen', 'Jianbing', ''],
['Wu', 'Dongming', ''],
['Guo', 'Kan', ''],
['Jin', 'Xiaogang', ''],
['Porikli', 'Fatih', ''],
['Krishna', 'Onkar', ''],
['Aizawa', 'Kiyoharu', ''],
['Helo', 'Andrea', ''],
['Pia', 'Rama', ''],
['Goldman', 'Eran', ''],
['Goldberger', 'Jacob', ''],
['Dong', 'Xin', ''],
['Chen', 'Shangyu', ''],
['Pan', 'Sinno Jialin', ''],
['Khalili', 'A. M.', ''],
['Kiran', 'B Ravi', ''],
['Das', 'Arindam', ''],
['Yogamani', 'Senthil', ''],
['Herring', 'James', ''],
['Nagy', 'James', ''],
['Ruthotto', 'Lars', ''],
['Deza', 'Arturo', ''],
['Jonnalagadda', 'Aditya', ''],
['Eckstein', 'Miguel', ''],
['Veshki', 'Farshad G.', ''],
['Vorobyov', 'Sergiy A.', ''],
['Baisa', 'Nathanael L.', ''],
['Bhowmik', 'Deepayan', ''],
['Wallace', 'Andrew', ''],
['Baisa', 'Nathanael L.', ''],
['Wallace', 'Andrew', ''],
['Soleymani', 'Roghayeh', ''],
['Granger', 'Eric', ''],
['Fumera', 'Giorgio', ''],
['Tsakiris', 'Manolis C.', ''],
['Vidal', 'Rene', ''],
['Si-Yao', 'Li', ''],
['Ren', 'Dongwei', ''],
['Yin', 'Qian', ''],
['Wu', 'Jiqing', ''],
['Huang', 'Zhiwu', ''],
['Acharya', 'Dinesh', ''],
['Li', 'Wen', ''],
['Thoma', 'Janine', ''],
['Paudel', 'Danda Pani', ''],
['Van Gool', 'Luc', ''],
['Kragh', 'Mikkel', ''],
['Underwood', 'James', ''],
['Chen', 'Chong', ''],
['Öktem', 'Ozan', ''],
['Sun', 'Xu', ''],
['Ren', 'Xuancheng', ''],
['Ma', 'Shuming', ''],
['Wang', 'Houfeng', ''],
['Joshi', 'Sharad', ''],
['Khanna', 'Nitin', ''],
['Shao', 'Ruifeng', ''],
['Xu', 'Ning', ''],
['Geng', 'Xin', ''],
['Nagar', 'Rajendra', ''],
['Raman', 'Shanmuganathan', ''],
['Wang', 'Chaoyue', ''],
['Xu', 'Chang', ''],
['Wang', 'Chaohui', ''],
['Tao', 'Dacheng', ''],
['Zheng', 'Zhedong', ''],
['Zheng', 'Liang', ''],
['Yang', 'Yi', ''],
['Yao', 'Hantao', ''],
['Zhang', 'Shiliang', ''],
['Zhang', 'Yongdong', ''],
['Li', 'Jintao', ''],
['Tian', 'Qi', ''],
['Jund', 'Philipp', ''],
['Eitel', 'Andreas', ''],
['Abdo', 'Nichola', ''],
['Burgard', 'Wolfram', ''],
['Liao', 'Jun', ''],
['Jiang', 'Yutong', ''],
['Bian', 'Zichao', ''],
['Mahrou', 'Bahareh', ''],
['Nambiar', 'Aparna', ''],
['Magsam', 'Alexander W.', ''],
['Guo', 'Kaikai', ''],
['Cho', 'Yong Ku', ''],
['Zheng', 'Guoan', ''],
['Zeng', 'Zhiqiang', ''],
['Zhang', 'Jian', ''],
['Wang', 'Xiaodong', ''],
['Chen', 'Yuming', ''],
['Zhu', 'Chaoyang', ''],
['Yang', 'Weixin', ''],
['Lyons', 'Terry', ''],
['Ni', 'Hao', ''],
['Schmid', 'Cordelia', ''],
['Jin', 'Lianwen', ''],
['Vongkulbhisal', 'Jayakorn', ''],
['De la Torre', 'Fernando', ''],
['Costeira', 'João P.', ''],
['Guo', 'Tian', ''],
['Ji', 'Pan', ''],
['Reid', 'Ian', ''],
['Garg', 'Ravi', ''],
['Li', 'Hongdong', ''],
['Salzmann', 'Mathieu', ''],
['Aksoy', 'Yağız', ''],
['Aydın', 'Tunç Ozan', ''],
['Pollefeys', 'Marc', ''],
['Noyel', 'Guillaume', '', 'IPRI, SIGPH@iPRI'],
['Mees', 'Oier', ''],
['Eitel', 'Andreas', ''],
['Burgard', 'Wolfram', ''],
['Bojanowski', 'Piotr', ''],
['Joulin', 'Armand', ''],
['Lopez-Paz', 'David', ''],
['Szlam', 'Arthur', ''],
['Wojna', 'Zbigniew', ''],
['Ferrari', 'Vittorio', ''],
['Guadarrama', 'Sergio', ''],
['Silberman', 'Nathan', ''],
['Chen', 'Liang-Chieh', ''],
['Fathi', 'Alireza', ''],
['Uijlings', 'Jasper', ''],
['Benligiray', 'Burak', ''],
['Topal', 'Cihan', ''],
['Akinlar', 'Cuneyt', ''],
['Yu', 'Fisher', ''],
['Wang', 'Dequan', ''],
['Shelhamer', 'Evan', ''],
['Darrell', 'Trevor', ''],
['Zhao', 'Ningning', ''],
["O'Connor", 'Daniel', ''],
['Basarab', 'Adrian', ''],
['Ruan', 'Dan', ''],
['Hu', 'Peng', ''],
['Sheng', 'Ke', ''],
['Tong', 'Xin-Yi', ''],
['Xia', 'Gui-Song', ''],
['Hu', 'Fan', ''],
['Zhong', 'Yanfei', ''],
['Datcu', 'Mihai', ''],
['Zhang', 'Liangpei', ''],
['Rahimpour', 'Alireza', ''],
['Liu', 'Liu', ''],
['Taalimi', 'Ali', ''],
['Song', 'Yang', ''],
['Qi', 'Hairong', ''],
['Pontes', 'Jhony K.', ''],
['Kong', 'Chen', ''],
['Eriksson', 'Anders', ''],
['Fookes', 'Clinton', ''],
['Sridharan', 'Sridha', ''],
['Lucey', 'Simon', ''],
['Guo', 'Chunchao', ''],
['Lai', 'Jianhuang', ''],
['Xie', 'Xiaohua', ''],
['Prakash', 'Jaya', ''],
['Mandal', 'Subhamoy', ''],
['Razansky', 'Daniel', ''],
['Ntziachristos', 'Vasilis', ''],
['Xiao', 'Chang', ''],
['Zhang', 'Cheng', ''],
['Zheng', 'Changxi', ''],
['Phung', 'Manh Duong', ''],
['Hoang', 'Van Truong', ''],
['Dinh', 'Tran Hiep', ''],
['Ha', 'Quang', ''],
['Bali', 'Alexandre', ''],
['Ghiasi-Shirazi', 'Kamaledin', ''],
['Zhang', 'Chengyue', ''],
['Li', 'Zhiwei', ''],
['Cheng', 'Qing', ''],
['Li', 'Xinghua', ''],
['Shen', 'Huanfeng', ''],
['Baskin', 'Chaim', ''],
['Liss', 'Natan', ''],
['Zheltonozhskii', 'Evgenii', ''],
['Bronshtein', 'Alex M.', ''],
['Mendelson', 'Avi', ''],
['Peretroukhin', 'Valentin', ''],
['Clement', 'Lee', ''],
['Giamou', 'Matthew', ''],
['Kelly', 'Jonathan', ''],
['Zhang', 'He', ''],
['Sindagi', 'Vishwanath', ''],
['Patel', 'Vishal M.', ''],
['Lee', 'Minhyeok', ''],
['Seok', 'Junhee', ''],
['Park', 'Hyung Suk', ''],
['Lee', 'Sung Min', ''],
['Kim', 'Hwa Pyung', ''],
['Seo', 'Jin Keun', ''],
['Tixier', 'Antoine Jean-Pierre', ''],
['Nikolentzos', 'Giannis', ''],
['Meladianos', 'Polykarpos', ''],
['Vazirgiannis', 'Michalis', ''],
['Zeng', 'Yu', ''],
['Lu', 'Huchuan', ''],
['Borji', 'Ali', ''],
['Cho', 'Donghyeon', ''],
['Park', 'Jinsun', ''],
['Oh', 'Tae-Hyun', ''],
['Tai', 'Yu-Wing', ''],
['Kweon', 'In So', ''],
['Komorowski', 'Michal', ''],
['Trzcinski', 'Tomasz', ''],
['Pourkamali-Anaraki', 'Farhad', ''],
['Becker', 'Stephen', ''],
['Chen', 'Xinghao', ''],
['Wang', 'Guijin', ''],
['Guo', 'Hengkai', ''],
['Zhang', 'Cairong', ''],
['Yu', 'Zhou', ''],
['Yu', 'Jun', ''],
['Xiang', 'Chenchao', ''],
['Fan', 'Jianping', ''],
['Tao', 'Dacheng', ''],
['Zhang', 'Quanshi', ''],
['Wu', 'Ying Nian', ''],
['Zhang', 'Hao', ''],
['Zhu', 'Song-Chun', ''],
['Laloy', 'Eric', ''],
['Hérault', 'Romain', ''],
['Jacques', 'Diederik', ''],
['Linde', 'Niklas', ''],
['Lobos', 'Rodrigo A.', ''],
['Kim', 'Tae Hyung', ''],
['Hoge', 'W. Scott', ''],
['Haldar', 'Justin P.', ''],
['Mokari', 'Mozhgan', ''],
['Mohammadzade', 'Hoda', ''],
['Ghojogh', 'Benyamin', ''],
['Yi', 'Xin', ''],
['Babyn', 'Paul', ''],
['Yao', 'Yazhou', ''],
['Zhang', 'Jian', ''],
['Shen', 'Fumin', ''],
['Liu', 'Li', ''],
['Zhu', 'Fan', ''],
['Zhang', 'Dongxiang', ''],
['Shen', 'Heng-Tao', ''],
['Bas', 'Anil', ''],
['Smith', 'William A. P.', ''],
['Emeršič', 'Žiga', ''],
['Štepec', 'Dejan', ''],
['Štruc', 'Vitomir', ''],
['Peer', 'Peter', ''],
['George', 'Anjith', ''],
['Ahmad', 'Adil', ''],
['Omar', 'Elshibani', ''],
['Boult', 'Terrance E.', ''],
['Safdari', 'Reza', ''],
['Zhou', 'Yuxiang', ''],
['Zafeiriou', 'Stefanos', ''],
['Yaman', 'Dogucan', ''],
['Eyiokur', 'Fevziye I.', ''],
['Ekenel', 'Hazim K.', ''],
['Sakaridis', 'Christos', ''],
['Dai', 'Dengxin', ''],
['Van Gool', 'Luc', ''],
['Nguyen', 'Anh', ''],
['Do', 'Thanh-Toan', ''],
['Caldwell', 'Darwin G.', ''],
['Tsagarakis', 'Nikos G.', ''],
['Moolekamp', 'Fred', ''],
['Melchior', 'Peter', ''],
['Shen', 'Li', ''],
['Margolies', 'Laurie R.', ''],
['Rothstein', 'Joseph H.', ''],
['Fluder', 'Eugene', ''],
['McBride', 'Russell B.', ''],
['Sieh', 'Weiva', ''],
['Datta', 'Shounak', ''],
['Nag', 'Sayak', ''],
['Das', 'Swagatam', ''],
['Helber', 'Patrick', ''],
['Bischke', 'Benjamin', ''],
['Dengel', 'Andreas', ''],
['Borth', 'Damian', ''],
['He', 'Xiangteng', ''],
['Peng', 'Yuxin', ''],
['Cangea', 'Cătălina', ''],
['Veličković', 'Petar', ''],
['Liò', 'Pietro', ''],
['Wu', 'Cinna', ''],
['Tygert', 'Mark', ''],
['LeCun', 'Yann', ''],
['Garcia', 'Noa', ''],
['Vogiatzis', 'George', ''],
['Hu', 'Jie', ''],
['Shen', 'Li', ''],
['Albanie', 'Samuel', ''],
['Sun', 'Gang', ''],
['Wu', 'Enhua', ''],
['Anwar', 'Syed Muhammad', ''],
['Majid', 'Muhammad', ''],
['Qayyum', 'Adnan', ''],
['Awais', 'Muhammad', ''],
['Alnowami', 'Majdi', ''],
['Khan', 'Muhammad Khurram', ''],
['Wang', 'Qian', ''],
['Chen', 'Ke', ''],
['Lesort', 'Timothée', ''],
['Seurin', 'Mathieu', ''],
['Li', 'Xinrui', ''],
['Díaz-Rodríguez', 'Natalia', ''],
['Filliat', 'David', ''],
['Jiang', 'Lai', ''],
['Xu', 'Mai', ''],
['Wang', 'Zulin', ''],
['Rangesh', 'Akshay', ''],
['Yuen', 'Kevan', ''],
['Satzoda', 'Ravi Kumar', ''],
['Rajaram', 'Rakesh Nattoji', ''],
['Gunaratne', 'Pujitha', ''],
['Trivedi', 'Mohan M.', ''],
['Fong', 'Chamberlain', ''],
['Jha', 'Ranjeet Ranjan', ''],
['Thapar', 'Daksh', ''],
['Patil', 'Shreyas Malakarjun', ''],
['Nigam', 'Aditya', ''],
['Bhunia', 'Ankan Kumar', ''],
['Alaei', 'Alireza', ''],
['Roy', 'Partha Pratim', ''],
['Corbière', 'Charles', ''],
['Ben-Younes', 'Hedi', ''],
['Ramé', 'Alexandre', ''],
['Ollion', 'Charles', ''],
['Dubey', 'Shiv Ram', ''],
['Lerman', 'Gilad', ''],
['Shi', 'Yunpeng', ''],
['Zhang', 'Teng', ''],
['Pasquale', 'Giulia', ''],
['Ciliberto', 'Carlo', ''],
['Odone', 'Francesca', ''],
['Rosasco', 'Lorenzo', ''],
['Natale', 'Lorenzo', ''],
['Gong', 'Sixue', ''],
['Boddeti', 'Vishnu Naresh', ''],
['Jain', 'Anil K.', ''],
['Sanzari', 'Marta', ''],
['Ntouskos', 'Valsamis', ''],
['Pirri', 'Fiora', ''],
['Duran', 'Joan', ''],
['Buades', 'Antoni', ''],
['Di', 'Xing', ''],
['Sindagi', 'Vishwanath A.', ''],
['Patel', 'Vishal M.', ''],
['Xu', 'Mai', ''],
['Li', 'Tianyi', ''],
['Wang', 'Zulin', ''],
['Deng', 'Xin', ''],
['Yang', 'Ren', ''],
['Guan', 'Zhenyu', ''],
['Dar', 'Salman Ul Hassan', ''],
['Özbey', 'Muzaffer', ''],
['Çatlı', 'Ahmet Burak', ''],
['Çukur', 'Tolga', ''],
['Vidal', 'Rosaura G.', ''],
['Banerjee', 'Sreya', ''],
['Grm', 'Klemen', ''],
['Struc', 'Vitomir', ''],
['Scheirer', 'Walter J.', ''],
['Shi', 'Bowen', ''],
['Livescu', 'Karen', ''],
['Shin', 'Seung Yeon', ''],
['Lee', 'Soochahn', ''],
['Yun', 'Il Dong', ''],
['Kim', 'Sun Mi', ''],
['Lee', 'Kyoung Mu', ''],
['Li', 'Yijun', ''],
['Huang', 'Jia-Bin', ''],
['Ahuja', 'Narendra', ''],
['Yang', 'Ming-Hsuan', ''],
['Tu', 'Peihan', ''],
['Tu', 'Peihan', ''],
['Bacchuwar', 'Ketan', '', 'GE Healthcare, LIGM'],
['Cousty', 'Jean', '', 'LIGM'],
['Vaillant', 'Régis', '', 'GE Healthcare'],
['Najman', 'Laurent', '', 'LIGM'],
['Thapar', 'Daksh', ''],
['Aggarwal', 'Divyansh', ''],
['Agarwal', 'Punjal', ''],
['Nigam', 'Aditya', ''],
['Jiang', 'Zutao', ''],
['Zhu', 'Jihua', ''],
['Evangelidis', 'Georgios D.', ''],
['Zhang', 'Changqing', ''],
['Pang', 'Shanmin', ''],
['Li', 'Yaochen', ''],
['Dolz', 'Jose', ''],
['Ayed', 'Ismail Ben', ''],
['Yuan', 'Jing', ''],
['Desrosiers', 'Christian', ''],
['Zhou', 'Linjun', ''],
['Cui', 'Peng', ''],
['Yang', 'Shiqiang', ''],
['Zhu', 'Wenwu', ''],
['Tian', 'Qi', ''],
...]
# 拼接所有的作者
authors_names = [' '.join(x) for x in all_authors]
authors_names = pd.DataFrame(authors_name)
authors_names
0 | |
---|---|
0 | Pal Mahesh |
1 | Mokhov Serguei A. for the MARF R&D Group |
2 | Sinclair Stephen for the MARF R&D Group |
3 | Clément Ian for the MARF R&D Group |
4 | Nicolacopoulos Dimitrios for the MARF R&D Group |
... | ... |
49139 | Ti Yen-Wu |
49140 | Chen Dian |
49141 | Zhou Brady |
49142 | Koltun Vladlen |
49143 | Krähenbühl Philipp |
49144 rows × 1 columns
# 根据作者频率绘制直⽅方图
import matplotlib.pyplot as plt
plt.figure(figsize=(10, 6))
authors_names[0].value_counts().head(10).plot(kind='barh')
<AxesSubplot:>
# 修改图配置
import matplotlib.pyplot as plt
plt.figure(figsize=(10, 6))
authors_names[0].value_counts().head(10).plot(kind='barh')
names = authors_name[0].value_counts().index.values[:10]
_ = plt.yticks(range(0, len(names)), names)
plt.ylabel('Author')
plt.xlabel('Count')
Text(0.5, 0, 'Count')
# 接下来统计姓名姓,也就是authors_parsed 字段中作者第⼀一个单词:
authors_lastnames = [x[0] for x in all_authors]
authors_lastnames = pd.DataFrame(authors_lastnames)
plt.figure(figsize=(10, 6))
authors_lastnames[0].value_counts().head(10).plot(kind = 'barh')
names = authors_lastnames[0].value_counts().index.values[:10]
_ = plt.yticks(range(0, len(names)), names)
plt.ylabel('Author')
plt.xlabel('Count')
#绘制得到的结果,从结果看出这些都是华⼈人或者中国姓⽒氏~
Text(0.5, 0, 'Count')