pydata

Title : Welcome
Author : You
Logo : True

[TITLE]

python core

# groupby(iterable[, keyfunc]) -> create an iterator which returns
# (key, sub-iterator) grouped by each value of key(value).
from itertools import goupby
[k for k, g in groupby('AAAABBBCCDAABBB')]
>>> ['A', 'B', 'C', 'D', 'A', 'B']
[list(g) for k, g in groupby('AAAABBBCCD')]
>>> [['A', 'A', 'A', 'A'], ['B', 'B', 'B'], ['C', 'C'], ['D']]

numpy

x = np.arange(9.).reshape(3, 3)
np.where(x>5)
np.where(x < 5, x, -1)               # Note: broadcasting.
array([[ 0.,  1.,  2.],
       [ 3.,  4., -1.],
       [-1., -1., -1.]])
x[x>5] # simple boolean indexing suffices if you select a subarray

pandas

path = 'ch02/usagov_bitly_data2012-03-16-1331923249.txt'
records = [json.loads(line) for line in open(path)]
from pandas import DataFrame, Series
frame = DataFrame(records)
tz_counts = frame['tz'].value_counts() # Series.value_counts
clean_tz = frame['tz'].fillna('Missing') # Series.fillna
clean_tz[clean_tz == ''] = 'Unknown'
tz_counts = clean_tz.value_counts()
%matplotlib inline
tz_counts[:10].plot(kind='barh', rot=0) # Series.plot

results = Series([x.split()[0] for x in frame.a.dropna()]) # Series.dropna
cframe = frame[frame.a.notnull()] # Series.notnull
operating_system = np.where(cframe['a'].str.contains('Windows'),'Windows', 'Not Windows') # np.where

matplotlib

%matplotlib inline in ipython

Madoko

Madoko is a fast markdown processor for writing professional articles
with a focus on simplicity and plain text readability.

  • Read the reference manual.
  • Explore the upper-right toolbox menu to discover how Markdown works.
  • Alt-Q reformats the current paragraph.

Enjoy!

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值