解决PYCHARM里显示数据不完全问题(PANDAS)
标签: python数据分析 python 数据分析 概率论
我最近看了两节关于数据分析的课程,其中最基础也最重要的知识就是支持度,置信度和提升度了。而在打印提升度的相关信息时,我遇到了一些麻烦!
老师用的是 Jupyter Notebook 来演示,而我用的是pycharm(其实跟编译器没关系),然后打印提升度时我发现有很多数据我无法打印出来!只是给我留了半串省略号…我就纳闷了,到底是啥原因?
shopping_basket = {
‘ID’: [1, 2, 3, 4, 5, 6],
‘Basket’:[
[‘Onion’, ‘Beer’, ‘Chicken’, ‘Drink’, ‘Burger’, ‘Chips’, ‘Disper’],
[‘Onion’, ‘Beer’, ‘Chicken’, ‘Burger’, ‘Chips’, ],
[‘Onion’, ‘Chicken’, ‘Drink’, ‘Burger’, ‘Chips’, ‘Disper’],
[‘Onion’, ‘Chicken’, ‘Drink’],
[‘Beer’, ‘Chicken’, ‘Drink’, ‘Burger’, ‘Chips’, ‘Disper’],
[‘Drink’, ‘Burger’, ‘Chips’, ‘Disper’]
]
}
retail = pd.DataFrame(shopping_basket)
print(retail)
数据集中都是由字符串组成的,需要转化成数值编码
用到了方法,默认情况删除某一行
retail_id = retail.drop(‘Basket’, 1)
print(retail_id)
retail_Basket = retail.Basket.str.join(‘,’)
print(retail.Basket)
print(retail.Basket.str)
print(retail_Basket)
retail_Basket = retail_Basket.str.get_dummies(‘,’)
print(retail_Basket)
retail = retail_id.join(retail_Basket)
pri