tmdb数据集
Distributed TMDb API Data Download using AWS Lambda.
使用AWS Lambda的分布式TMDb API数据下载。
是否想偶尔听到有关Tensorflow,Keras,DeepLearning4J,Python和Java的抱怨? (Wanna hear occasional rants about Tensorflow, Keras, DeepLearning4J, Python and Java?)
Join me on twitter @ twitter.com/hudsonmendes!
和我一起在twitter @ twitter.com/hudsonmendes上!
Taking Machine Learning models to production is a battle. And there I share my learnings (and my sorrows) there, so we can learn together!
将机器学习模型投入生产是一场战斗。 我在那里分享我的学习(和悲伤),所以我们可以一起学习!
数据科学系列的数据管道 (Data Pipeline for Data Science Series)
This is a large tutorial that we tried to keep conveniently small for the occasional reader, and is divided into the following parts:
这是一个很大的教程,我们试图为偶尔阅读的读者尽量减小它的大小,并分为以下几部分:
Part 1: Problem/Solution Fit
Part 2: TMDb Data “Crawler”Part 3: Infrastructure As Code
(soon available) Part 4: Airflow & Data Pipelines
(soon available) Part 5: DAG, Film Review Sentiment Classifier Model
(soon available) Part 6: DAG, Data Warehouse Building
(soon available) Part 7: Scheduling and Landings
第1部分:问题/解决方案拟合第2部分:TMDb数据“抓取工具” 第三部分:基础架构即代码(即将推出)第4部分:气流和数据管道(即将推出)第5部分:DAG,电影评论情感分类器模型(即将推出)第6部分:DAG,数据仓库构建(即将推出)第7部分:计划和着陆
问题:链接IMDb ID和TMDb ID(The Problem: Linking IMDb ids and TMDb ids)
This project has the following problem statement:
该项目具有以下问题陈述:
Data Analysts must be able to produce reports on-demand, as well as run several roll-ups and drill-down queries into what the Review Sentiment is for both IMDb films and IMDb actors/actresses, based on their TMDb Film Reviews; And the Sentiment Classifier must be our own.
数据分析师必须能够按需生成报告,并且能够基于他们的TMDb电影评论,对IMDb电影和IMDb演员/女演员的评论情绪进行多次汇总和深入查询; 情感分类器必须是我们自己的。
Looking at the TMDb API specification, we find that they have links to the IMDb Film ids:
查看TMDb API规范,我们发现它们具有指向IMDb电影ID的链接:
However, the "find" endpoint only supports one ID per request:
但是&#x