Stanford--Introduce to Databases(2)

Question 5

               

 

Write a query to return the ratings data in a more readable format: reviewer name, movie title, stars, and
ratingDate. Also, sort the data, first by reviewer name, then by movie title, and lastly by number of stars.
select name, title, stars, ratingDate from (Movie natural join Reviewer) natural join  Rating order by name, title, stars

Question 6

For all cases where the same reviewer rated the same movie twice and gave it a higher rating the second
time, return the reviewer's name and the title of the movie.
select name, title from Reviewer natural join Movie where rID in (select rID from ((select * from Rating R1 where rID in (select rID from Rating R2 group by rID, mID having count(stars)=2) and exists (select * from Rating R3 where R3.rID=R1.rID and R3.mID=R1.mID and R3.ratingDate>R1.ratingDate and R3.stars>R1.stars)) as Rs)) and mID in (select mID from ((select * from Rating R1 where rID in (select rID from Rating R2 group by rID, mID having count(stars)=2) and exists (select * from Rating R3 where R3.rID=R1.rID and R3.mID=R1.mID and R3.ratingDate>R1.ratingDate and R3.stars>R1.stars)) as Rs))

Question 7

For each movie that has at least one rating, find the highest number of stars that movie received. Return
the movie title and number of stars. Sort by movie title.
 
select title, Ep.stars from Movie,  (select mID, max(stars) as stars from Rating group by mID) Ep where Movie.mID=Ep.mID  order by title
 

Question 8

For each movie, return the title and the 'rating spread', that is, the difference between highest and lowest
 
ratings given to that movie. Sort by rating spread from highest to lowest, then by movie title.
select title, Ep.diff from Movie M, (select mID, max(stars)-min(stars) as diff from Rating group by mID) Ep where M.mID=Ep.mID order by Ep.diff desc, title;
 

Question 9

Find the difference between the average rating of movies released before 1980 and the average rating of
 
movies released after 1980. (Make sure to calculate the average rating for each movie, then the average
 
of those averages for movies before 1980 and movies after. Don't just calculate the overall average rating
 
before and after 1980.)
select distinct ((select avg(Ep.avgstar) from (select mID, avg(stars) avgstar from Rating join Movie using(mID) group by mID having year<1980) Ep)-(select avg(Ep.avgstar) from (select mID, avg(stars) avgstar from Rating join Movie using(mID) group by mID having year>1980) Ep)) from Rating

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
Stanford-sentiment-treebank(斯坦福情感树库)是一个用于情感分析的数据集。这个数据集由斯坦福大学的研究人员创建,并用于训练和测试情感分析模型。 Stanford-sentiment-treebank的数据集包含了超过11,855个句子,这些句子被分为了5个情感类别:非常负面、负面、中性、正面和非常正面。每个句子都经过了人工标注和建立了情感树结构。 情感树结构是指将每个句子划分为一个树状结构,其中每个节点表示一个短语或单词,并且每个节点都有一个对应的情感标签。根节点代表整个句子的情感,而叶子节点代表具体的短语或单词的情感。 借助这个数据集,研究人员可以训练机器学习模型来识别和预测句子的情感。通过对情感树进行分析,研究人员可以了解到不同单词和短语在不同情感类别中的关联性,从而提高情感分析模型的准确性。 Stanford-sentiment-treebank的数据集被广泛用于情感分析领域的研究和实践。它为研究人员提供了一个基准数据集,用于比较不同情感分析模型的效果。此外,它也为开发者提供了一个用于训练和测试自然语言处理模型的有价值资源。 总而言之,Stanford-sentiment-treebank是一个用于情感分析研究的重要数据集。它通过情感树的结构为研究人员提供了对句子情感的深入理解,并促进了情感分析模型的开发和改进。

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值