Capstone Project – The Battle of Neighborhoods

Project Report | Capstone Project – The Battle of Neighborhoods

1. Intoduction

This project aims to find the better neighborhood environment, like pub, park or gym. With the map of Scarborough, Toranto, this project will help people to decide which neighborhood is the beneficial place compared to various participants neighborhood.

As lots of people are migrating to Toranto. They need information and resources to balance housing price and schools for their children. So, this project is for those people among choosing neighborhoods, like the access to cafes, schools, super markets, hospitals, etc.

This project will analyse features for people migrating to Scarborough to search a best neighborhood. The features include median housing price and better school, lower crime rates, road connectivity, good management for emergency facilities, and recreational facilities.

People will get an awareness of that area before moving to a new city.

Foursquare API Data:

Foursquare provides different data information in different venues among neighborhoods. Those information includes venue names, locations, menus and even photos. The foursquare platform is to obtain the required information through the API.

With the gaining information of neighborhoods, Foursquare API would gather information about venues of neighborhoods. For each neighborhood, the radius is 100 meters.

Foursquare data contains venues, longitude, latitude and postcodes. The information obtained per venue as follows:

  1. Neighborhood
  2. Neighborhood Latitude
  3. Neighborhood Longitude
  4. Venue
  5. Name of the venue e.g. the name of a store or restaurant
  6. Venue Latitude
  7. Venue Longitude
  8. Venue Category

Map of Scarborough
在这里插入图片描述

2. Data

Data Link: https://en.wikipedia.org/wiki/List_of_postal_codes_of_Canada:_M

In this project, I’d like to use Scarborough dataset which we scrapped from wikipedia on Week 3. Dataset consisting of latitude and longitude, zip codes.

3. Methodology Section

过程:

在维基上以lxml的形式获得街区数据,邮编,名称,地区,然后整理数据tr分行td分列,通过geocoder用邮编获取对应位置的经纬度,整理数据,通过folium画出各区地图
链接Foursquare,获取对应地区的详细信息,包括
在这里插入图片描述
数据清洗删除无效的行列,整理,统计数据,例如什么的多少,用one hot code表示各个类别,数据归一化,按照频率划分,比如某个社区的coffee占总数的比例,Kmeans分类,通过不同类别做出图形,结合school rate 和房价做评分。

Clustering Approach:

To compare the similarities of two cities, we decided to explore neighborhoods, segment them, and group them into clusters to find similar neighborhoods in a big city like New York and Toronto. To be able to do that, we need to cluster data which is a form of unsupervised machine learning: k-means clustering algorithm.

K-Means Clustering Algoritm
在这里插入图片描述

Most Common venues in each Neighborhood
在这里插入图片描述

Work Flow:

With my credentials of Foursquare API, features of the neighborhoods would be gathered and utilized. As the request limitations, the radius parameter of neighborhood is set to 700 and the total number of place requests is set to 100.

4. Results

** Map of Foursquare Request**
在这里插入图片描述

Map of Clusters in Scarborough
在这里插入图片描述

Average Housing Price by Clusters in Scarborough
在这里插入图片描述

Schools Rating by Clusters in Scarborough
在这里插入图片描述

5. Discussion

Problem Solved:

The purpose of this project offer suggestions to people with a better neighborhood in Scarborough.So the connectivity to the airport, bus stops, distance to the downtown, markets and etc are conuts.

Sorted list of house in terms of housing prices in a ascending or descending order
Sorted list of schools in terms of location, fees, rating and reviews

6. Conclusion

With the help of k-means cluster algorithm, the neighborhoods are separated into 10 clusters with 103 different lattitude and logitude from the dataset. The dataset has similar neighborhoods around. The charts represents a particular neighborhood with average house prices and school ratings .

I really appreciate this opportunity and experience with the efforts to deal with all the tasks. This project is a practical application in a real situation by using Data Science tools.The mapping with Folium is a useful tool to consolidate information and make analysis visualized.

Improvment:

With farther works, this project could be more precise in terms in find the best house not only based on price in Scarborough. It may requires something else around.

Depended Libraries :

Pandas: To create and edit dataframes.

Folium: To visualize the neighborhood clusters distribution.

Scikit Learn: To import clustering algorithms.

JSON: To handle JSON files.

XML: To separate data from presentation and XML stores data in plain text format.

Geocoder: To retrieve location from data.

Beautiful Soup and Requests: To extract data from HTML and XML.

Matplotlib: To draw plots.

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
资源包主要包含以下内容: ASP项目源码:每个资源包中都包含完整的ASP项目源码,这些源码采用了经典的ASP技术开发,结构清晰、注释详细,帮助用户轻松理解整个项目的逻辑和实现方式。通过这些源码,用户可以学习到ASP的基本语法、服务器端脚本编写方法、数据库操作、用户权限管理等关键技术。 数据库设计文件:为了方便用户更好地理解系统的后台逻辑,每个项目中都附带了完整的数据库设计文件。这些文件通常包括数据库结构图、数据表设计文档,以及示例数据SQL脚本。用户可以通过这些文件快速搭建项目所需的数据库环境,并了解各个数据表之间的关系和作用。 详细的开发文档:每个资源包都附有详细的开发文档,文档内容包括项目背景介绍、功能模块说明、系统流程图、用户界面设计以及关键代码解析等。这些文档为用户提供了深入的学习材料,使得即便是从零开始的开发者也能逐步掌握项目开发的全过程。 项目演示与使用指南:为帮助用户更好地理解和使用这些ASP项目,每个资源包中都包含项目的演示文件和使用指南。演示文件通常以视频或图文形式展示项目的主要功能和操作流程,使用指南则详细说明了如何配置开发环境、部署项目以及常见问题的解决方法。 毕业设计参考:对于正在准备毕业设计的学生来说,这些资源包是绝佳的参考材料。每个项目不仅功能完善、结构清晰,还符合常见的毕业设计要求和标准。通过这些项目,学生可以学习到如何从零开始构建一个完整的Web系统,并积累丰富的项目经验。
资源包主要包含以下内容: ASP项目源码:每个资源包中都包含完整的ASP项目源码,这些源码采用了经典的ASP技术开发,结构清晰、注释详细,帮助用户轻松理解整个项目的逻辑和实现方式。通过这些源码,用户可以学习到ASP的基本语法、服务器端脚本编写方法、数据库操作、用户权限管理等关键技术。 数据库设计文件:为了方便用户更好地理解系统的后台逻辑,每个项目中都附带了完整的数据库设计文件。这些文件通常包括数据库结构图、数据表设计文档,以及示例数据SQL脚本。用户可以通过这些文件快速搭建项目所需的数据库环境,并了解各个数据表之间的关系和作用。 详细的开发文档:每个资源包都附有详细的开发文档,文档内容包括项目背景介绍、功能模块说明、系统流程图、用户界面设计以及关键代码解析等。这些文档为用户提供了深入的学习材料,使得即便是从零开始的开发者也能逐步掌握项目开发的全过程。 项目演示与使用指南:为帮助用户更好地理解和使用这些ASP项目,每个资源包中都包含项目的演示文件和使用指南。演示文件通常以视频或图文形式展示项目的主要功能和操作流程,使用指南则详细说明了如何配置开发环境、部署项目以及常见问题的解决方法。 毕业设计参考:对于正在准备毕业设计的学生来说,这些资源包是绝佳的参考材料。每个项目不仅功能完善、结构清晰,还符合常见的毕业设计要求和标准。通过这些项目,学生可以学习到如何从零开始构建一个完整的Web系统,并积累丰富的项目经验。
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值