Statistical Analysis Plan (SAP) 范例


@author: Mingran Jia

  • URL of data:
    https://nethouseprices.com/house-prices/Lanarkshire/GLASGOW?page=1
    https://nethouseprices.com/house-prices/Lanarkshire/GLASGOW?page=2

    https://nethouseprices.com/house-prices/Lanarkshire/GLASGOW?page=10

  • The context of the data:

    > glimpse(house_price_glasgow)
    Rows: 500
    Columns: 3
    $ address <chr> "1 Ettrick Place, Glasgow, G43 1UA", "2~
    $ prices  <dbl> 144500, 212750, 185000, 90000, 126894, ~
    $ types   <chr> "Flat", "Flat", "Flat", "Semi Detached"~
    
  • The content of the data:

    > str(house_price_glasgow)
    tibble [500 x 3] (S3: tbl_df/tbl/data.frame)
     $ address: chr [1:500] "1 Ettrick Place, Glasgow, G43<U+00A0>1UA" "2/1 26 Tassie Street, Glasgow, G41<U+00A0>3QF" "Flat 1/3 18 Prospecthill Grove, Glasgow, G42<U+00A0>9LD" "41 Ochil Street, Glasgow, G32<U+00A0>7SD" ...
     $ prices : num [1:500] 144500 212750 185000 90000 126894 ...
     $ types  : chr [1:500] "Flat" "Flat" "Flat" "Semi Detached" ...
    

Research Questions

We wish to evaluate the relationship among house types, locations and prices to assist real estate developers set more reasonable price.

  • The impact of different housing types on house prices
  • The impact of different regions on house prices
  • The mutual influence of different areas and room types

Statistical Analysis Plan for House Price

Population

  • Glasgow House Price Statistics

Primary Objective:

  • Estimate the influence of house types and locations on house prices

Secondary Objectives:

  • Assess the top-heated house types and postcodes in the city
  • Estimate the mutual influence of different locations and room types

Data Collection methods:

  • Scrap the most recent 500 house prices of 201191 total in Glasgow from the house saling website of Scotland as the sample to represent the population of Glasgow house statistics.
  • The house is identified by an unique address.
  • The house is classified by limitedly different types.

Variables Under Consideration:

  • House prices grouped by area; house prices grouped by different type; difference in house price for different type in the same area; difference in house price for different area in the same type - Primary outcome variable
  • Areas division accessed by locations - Primary explanatory variable
  • Top house type; top house area; top and bottom selling house type in each area; top and bottom selling house area of different types - Explanatory outcome variable

Missing Data Procedures:

  • If any data of the house type or location is missing, that house is excluded from analysis.
  • If the price is missing, use the average price of that area; if there are less than two house prices in that area, then that area is excluded from analysis.

Summaries to be presented:

  • Basic statistical discription applied to house price including mean, standard deviation, median, etc…

Models to be fitted

  • Linear model to be
  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值