apriori算法代码python_Apriori算法原理及Python代码

最新推荐文章于 2024-05-11 02:50:11 发布

欧乃诗派

最新推荐文章于 2024-05-11 02:50:11 发布

阅读量3k

点赞数

文章标签： apriori算法代码python

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/weixin_42514433/article/details/113891115

版权

一、Apriori算法原理

参考：Python --深入浅出Apriori关联分析算法(一)www.cnblogs.com

二、在Python中使用Apriori算法

查看Apriori算法的帮助文档：

from mlxtend.frequent_patterns import apriori

help(apriori)

Help on function apriori in module mlxtend.frequent_patterns.apriori:

apriori(df, min_support=0.5, use_colnames=False, max_len=None, verbose=0, low_memory=False)

Get frequent itemsets from a one-hot DataFrame

Parameters

-----------

df : pandas DataFrame

pandas DataFrame the encoded format. Also supports

DataFrames with sparse data;

Please note that the old pandas SparseDataFrame format

is no longer supported in mlxtend >= 0.17.2.

The allowed values are either 0/1 or True/False.

For example,

#apriori算法对输入数据类型有特殊要求！

#需要是数据框格式，并且数据要进行one-hot编码转换，转换后商品名称为列名，值为True或False。

#每一行记录代表一个顾客一次购物记录。

#第0条记录为[Apple,Beer,Chicken,Rice]，第1条记录为[Apple,Beer,Rice]，以此类推。

```

Apple Bananas Beer Chicken Milk Rice

0 True False True True False True

1 True False True False False True

2 True False True False False False

3 True True False False False False

4 False False True True True True

5 False False True False True True

6 False False True False True False

7 True True False False False False

```

min_support : float (default: 0.5)#最小支持度

A float between 0 and 1 for minumum support of the itemsets returned.

The support is computed as the fraction

`transactions_where_item(s)_occur / total_transactions`.

use_colnames : bool (default: False)

#设置为True，则返回的关联规则、频繁项集会使用商品名称，而不是商品所在列的索引值

If `True`, uses the DataFrames' column names in the returned DataFrame

instead of column indices.

max_len : int (default: None)

Maximum length of the itemsets generated. If `None` (default) all

possible itemsets lengths (under the apriori condition) are evaluated.

verbose : int (default: 0)

Shows the number of iterations if >= 1 and `low_memory` is `True`. If

>=1 and `low_memory` is `False`, shows the number of combinations.

low_memory : bool (default: False)

If `True`, uses an iterator to search for combinations above

`min_support`.

Note that while `low_memory=True` should only be used for large dataset

if memory resources are limited, because this implementation is approx.

3-6x slower than the default.

Returns

-----------

pandas DataFrame with columns ['support', 'itemsets'] of all itemsets

that are >= `min_support` and < than `max_len`

(if `max_len` is not None).

Each itemset in the 'itemsets' column is of type `frozenset`,

which is a Python built-in type that behaves similarly to

sets except that it is immutable.

练习数据集：

提取码: 6mbg

部分数据截图：

导入数据：

import pandas as pd

path = 'C:\\Users\\Cara\\Desktop\\store_data.csv'

records = pd.read_csv(path,header=None,encoding='utf-8')

print(records)

结果如下：

使用TransactionEncoder对交易数据进行one-hot编码：

先查看TransactionEncoder的帮助文档：

from mlxtend.preprocessing import TransactionEncoder

... help(TransactionEncoder)

...

Help on class TransactionEncoder in module mlxtend.preprocessing.transactionencoder:

class TransactionEncoder(sklearn.base.BaseEstimator, sklearn.base.TransformerMixin)

| Encoder class for transaction data in Python lists

|

| Parameters

| ------------<

最低0.47元/天解锁文章

关注

0
点赞
踩
10

收藏

觉得还不错? 一键收藏
0
评论
apriori算法代码python_Apriori算法原理及Python代码

一、Apriori算法原理参考：Python --深入浅出Apriori关联分析算法(一)www.cnblogs.com二、在Python中使用Apriori算法查看Apriori算法的帮助文档：from mlxtend.frequent_patterns import apriorihelp(apriori)Help on function apriori in module mlxtend....
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。