Rec Pangu 开源推荐系统项目教程

最新推荐文章于 2024-09-25 08:22:36 发布

陈冉茉

最新推荐文章于 2024-09-25 08:22:36 发布

阅读量787

点赞数 12

本文链接：https://blog.csdn.net/gitblog_01117/article/details/142506939

版权

Rec Pangu 开源推荐系统项目教程

rec_pangu rec_pangu is a flexible open-source project for recommendation systems. It incorporates diverse AI models like ranking algorithms, sequence recall, multi-interest models, and graph-based techniques. Designed for both beginners and advanced users, it enables rapid construction of efficient, custom recommendation engines. 项目地址: https://gitcode.com/gh_mirrors/re/rec_pangu

1. 项目介绍

Rec Pangu 是一个灵活的开源项目，专为推荐系统设计。它集成了多种 AI 模型，包括排序算法、序列召回、多兴趣模型和基于图的技术。该项目旨在为初学者和高级用户提供一个快速构建高效、定制推荐引擎的平台。

2. 项目快速启动

2.1 安装

首先，克隆项目仓库并安装依赖：

git clone https://github.com/HaSai666/rec_pangu.git
cd rec_pangu
pip install -e . --verbose

2.2 排序任务 Demo

以下是一个简单的排序任务示例代码：

import torch
from rec_pangu.dataset import get_dataloader
from rec_pangu.models.ranking import xDeepFM
from rec_pangu.trainer import RankTrainer
import pandas as pd

if __name__ == '__main__':
    df = pd.read_csv('sample_data/ranking_sample_data.csv')
    print(df.head())

    schema = {
        "sparse_cols": ['user_id', 'item_id', 'item_type', 'dayofweek', 'is_workday', 'city', 'county', 'town', 'village', 'lbs_city', 'lbs_district', 'hardware_platform', 'hardware_ischarging', 'os_type', 'network_type', 'position'],
        "dense_cols": ['item_expo_1d', 'item_expo_7d', 'item_expo_14d', 'item_expo_30d', 'item_clk_1d', 'item_clk_7d', 'item_clk_14d', 'item_clk_30d', 'use_duration'],
        "label_col": 'click'
    }

    train_df = df
    valid_df = df
    test_df = df

    device = torch.device('cpu')

    train_loader, valid_loader, test_loader, enc_dict = get_dataloader(train_df, valid_df, test_df, schema)

    model = xDeepFM(enc_dict=enc_dict)

    trainer = RankTrainer(num_task=1)

    trainer.fit(model, train_loader, valid_loader, epoch=5, lr=1e-3, device=device)

    trainer.save_model(model, './model_ckpt')

    test_metric = trainer.evaluate_model(model, test_loader, device=device)
    print('Test metric:{}'.format(test_metric))