In this section, we will describe our real-life use case of learning from open data, and then describe how to prepare Apache Spark computing for our real-life projects.
As discussed in Chapter 9, City Analytics on Spark, in the United States and worldwide, more and more governments at various levels have made their collected data openly available to the public. As a result of expanding analytics of open data, many governmental and social organizations have used these open datasets to improve service to citizens, with a lot of good results recorded, such as in https://www.socrata.com/video/added-value-open-datas-internal-use-case/. Using data analytics for cities has a huge impact as more than half of us live in urban centers now, and this urban residence percentage is higher and higher every year.
Especially, using big data to measure communities is also favored by researchers and practitioners, as we can see at http://files.meetup.com/11744342...