>
(2) Spark Jobs
0: 16091: 2: the sonnets3: 4: by william shakespeare5: 6: 7: 8: 19: from fairest creatures we desire increase10: that thereby beautys rose might never die11: but as the riper should by time decease12: his tender heir might bear his memory13: but thou contracted to thine own bright eyes14: feedst thy lights flame with selfsubstantial fuel
Command took 1.82s
Word Count Lab: Building a word count application
This lab will build on the techniques covered in the Spark tutorial to develop a simple word count application. The volume of unstructured text in existence is growing dramatically, and Spark is an excellent tool for analyzing this type of data. In this lab, we will write code that calculates the most common words in theComplete Works of William Shakespeare retrieved from Project Gutenberg.
This could also be scaled to find the most common words in Wikipedia.
During this lab we will cover: