Twitter for BigQuery 项目使用教程-CSDN博客

本文链接：https://blog.csdn.net/gitblog_00048/article/details/139696173

Twitter for BigQuery 项目使用教程

twitter-for-bigquery Simplest way to get Tweets into BigQuery. Uses Google Cloud & App Engine, as well as Python and D3. 项目地址: https://gitcode.com/gh_mirrors/tw/twitter-for-bigquery

1. 项目介绍

Twitter for BigQuery 是一个开源项目，旨在简化将 Twitter 数据导入 Google BigQuery 的过程。该项目利用 Google Cloud 和 App Engine，结合 Python 和 D3 技术，帮助用户快速将 Twitter 数据流式传输到 BigQuery 中，并进行简单的可视化分析。通过该项目，用户可以轻松生成可以直接在 BigQuery 界面中运行的查询，或者扩展这些查询以满足自己的应用需求。

2. 项目快速启动

2.1 环境准备

在开始之前，请确保您已经完成以下准备工作：

创建一个 Twitter 应用并获取 API 密钥和令牌。
拥有一个 Google Cloud Platform 账户。
安装 Google App Engine SDK for Python。

2.2 项目克隆

首先，克隆项目到本地：

git clone https://github.com/twitterdev/twitter-for-bigquery.git
cd twitter-for-bigquery

2.3 配置文件设置

打开项目目录中的 config_template 文件，填写以下字段：
- TWITTER_CONSUMER_KEY
- TWITTER_CONSUMER_SECRET
- TWITTER_ACCESS_TOKEN
- TWITTER_ACCESS_TOKEN_SECRET
- GOOGLE_SERVICE_ACCOUNT_EMAIL
- GOOGLE_SERVICE_ACCOUNT_PRIVATE_KEY_PATH
将 config_template 文件重命名为 config。

2.4 数据加载

运行以下命令开始将 Twitter 数据加载到 BigQuery：

python load.py

2.5 本地运行

使用以下命令在本地运行应用：

dev_appserver.py --appidentity_email_address="YOUR_TOKEN@developer.gserviceaccount.com" --appidentity_private_key_path=/PATH/TO/key.pem

2.6 部署到 Google App Engine

更新 app.yaml 文件，将项目名称指向您的 Google Cloud 项目。
使用 Google App Engine Launcher 部署应用：
- 点击 "File -> New Application"。
- 指定应用 ID 和应用目录。
- 点击 "Save"。
- 在 "Extra Flags" 部分添加命令行参数。
- 点击 "Deploy"。