airbnb数据_Airbnb的数据驱动故事

airbnb数据

by Akshaj Verma

由Akshaj Verma

Airbnb的数据驱动故事 (A Data-Driven Story of Airbnb)

Users on AirBnB can book a place to stay in over 34,000 cities across more than 190 countries. The goal of this blog post is to analyze the data, identify problems and opportunities, and come up with insights to increase revenue. This data was taken from Kaggle. If you’re interested in similar projects, have a look at my other blogposts here.

AirBnB的用户可以预订在190多个国家/地区的34,000多个城市中居住的地方。 这篇博客文章的目的是分析数据,发现问题和机会,并提出增加收入的见解。 该数据取自Kaggle 。 如果您对类似的项目感兴趣,请在此处查看我的其他博客文章。

Why AirBnB though?

为什么选择AirBnB?

Well, I wanted to work on a real world data set that had real world implications with a wee bit of preprocessing. AirBnB was the most interesting dataset that I found. So, here we go.

好吧,我想处理一个现实世界的数据集,该数据集通过一点点预处理就对现实世界产生了影响。 AirBnB是我发现的最有趣的数据集。 所以,我们开始。

Import the libraries.

导入库。

Import the dataset.

导入数据集。

预处理 (Pre-Processing)

Convert to date-time format using lubridate. Replace -unknown- gender by NA.

使用lubridate转换为日期时间格式。 用NA替换-unknown-性别。

  • NA values in the date_first_booking column say that the user hasn’t booked any room.

    date_first_booking列中的NA值表示用户尚未预订任何房间。

  • NA values in the age column means that the user hasn’t specified their age. We can fill dummy values in the age column.

    age列中的NA值表示用户未指定年龄。 我们可以在age列中填写虚拟值。

  • NA values in the gender column means that the user hasn’t specified their gender.

    gender列中的NA值表示用户尚未指定其性别。

Note that there are 95,688 NA values in the gender column and 117,763 filled values. So, our analysis based on the gender demographic might not be entirely correct in the real world.

请注意 ,性别列中有95,688 NA值,填充值为117,763。 因此,我们基于性别人口统计的分析在现实世界中可能并不完全正确。

Number of NA values in each column of the data frame.

数据帧每一列中的NA值数。

colSums(is.na(train_users_2))

The age column contains values less than 18 and more than 80. In fact, age contains values as large as 104 and 2014. We will assign NA values to them.

age列包含的值小于18且大于80。实际上,“ age包含的值分别为104和2014。我们将为其分配NA值。

To fill in the NA values in the age column, we’ll calculate the mean and standard deviation of the age column. Then we’ll generate n numbers as random integers between the mean and the standard deviation to fill in the NA values. n is the number of NA values in the age column.

为了填补在的NA值age专栏中,我们将计算出的平均值和标准偏差age列。 然后,我们将在平均值和标准差之间生成n个数字作为随机整数以填充NA值。 nage列中的NA值数。

Create a new column called age_brackets and add it to the data frame.

创建一个名为age_brackets的新列,并将其添加到数据框中。

Finally, we’ll add 2 new columns to the data frame.

最后,我们将在数据框中添加2个新列。

  • The first column is time_first_active_to_booking, which is equal to number of days between date_first_booking and timestamp_first_active.

    第一列是time_first_active_to_booking ,它等于date_first_bookingtimestamp_first_active之间的数。

  • The second column is time_signup_to_booking, which is equal to number of days between date_first_booking and date_account_created

    第二列是time_signup_to_booking ,它等于date_first_bookingdate_account_created之间的

View the number of NAs per column.

查看每列NA的数量。

colSums(is.na(train_users_2))
  • As you can see in the table above, only the date_first_booking, time_first_active_to_booking, and gender columns have NA values. This is totally fine.

    如上表所示,只有date_first_bookingtime_first_active_to_bookinggender列具有NA值。 很好

  • NA values in the date_first_booking column means that the user hasn’t booked any hotels yet.

    date_first_booking列中的NA值表示用户尚未预订任何酒店。

  • NA values in the gender column means that the user hasn’t specified their gender. time_first_active_to_booking is derived from date_first_booking, so it will have NA values.

    gender列中的NA值表示用户尚未指定其性别。 time_first_active_to_booking是从date_first_booking派生的,因此它将具有NA值。

Reset the gender levels. If you don’t do this, the -unknown- level will still show up in the levels(train_users_2$gender). We don’t want that, as we’ve already set all the -unknown- gender values to NA.

重置gender级别。 如果您不这样做, -unknown-级别仍会显示在levels(train_users_2$gender) 。 我们不想要那样,因为我们已经将所有-unknown-性别值设置为NA。

train_users_2$gender <- factor(train_users_2$gender)

We are done with the pre-processing. Whew. :P

我们已经完成了预处理。 ew。 :P

探索性数据分析 (Exploratory data analysis)

年龄,性别和语言 (Age, Gender, and Language)
  1. We can see that there are a lot of missing values for gender. The majority of the users did not fill in their gender information on the platform.

    我们可以看到,性别存在很多缺失的价值观。 大多数用户没有在平台上填写其性别信息。
  2. In the second plot, we observe that the age group of the majority of the users lies between 25 and 47 with the most users around the age of 30. This tells us that young and middle aged users are dominant.

    在第二个图中,我们观察到大多数用户的年龄组在25到47岁之间,而大多数用户的年龄在30岁左右。这告诉我们,中青年用户占主导地位。
  3. For a company based in the US, it isn’t surprising that the most used language on their portal/app is English.

    对于一家位于美国的公司而言,其门户网站/应用程序上使用最多的语言是英语也就不足为奇了。
  4. If we remove the English language from the plot, Chinese(zh) is the next most popular language on AirBnB, followed by French and Spanish. This suggests that AirBnB, after the US, is really popular in French and Spanish speaking countries/ communities. French is predominantly spoken in France, so we know this app is popular in France. But we can’t say that for Spanish, because Spanish is spoken in a lot of countries including Spain, Columbia, and the US, among many others.

    如果我们从情节中删除英语,则中文(zh)是AirBnB上第二受欢迎的语言,其次是法语和西班牙语。 这表明,继美国之后,AirBnB在法语和西班牙语国家/地区中确实很受欢迎。 法语主要在法国使用,因此我们知道此应用在法国很流行。 但是我们不能说西班牙语,因为西班牙语在很多国家都说过,包括西班牙,哥伦比亚和美国等。

This data, along with the user’s location, can be used to identify which regions (inside countries) use what language. Then, maybe, we could show targeted ads to those communities.

该数据以及用户的位置可用于识别哪些地区(国家/地区内)使用哪种语言。 然后,也许我们可以向这些社区展示目标广告。

年龄与性别 (Age vs Gender)

Based on age, there is almost no difference between number of men and women who use AirBnB. Men and women in their 30s are the most prominent users of AirBnB.

根据年龄,使用AirBnB的男女人数几乎没有差异。 30多岁的男女是AirBnB的最主要用户。

会员营销又名广告 (Affiliate Marketing a.k.a Advertisements)

Before we begin to analyze the graphs, let’s understand what affiliate marketing is.

在开始分析图表之前,让我们了解什么是联盟营销。

Affiliate marketing is a type of performance-based marketing in which a business rewards one or more affiliates for each visitor or customer brought by the affiliates’ own marketing efforts. Affiliate marketing is quickly becoming a powerful way to increase sales.

会员营销是一种基于绩效的营销,其中,企业会根据会员的营销活动为每个访问者或客户奖励一个或多个会员。 会员营销正Swift成为增加销量的有力方法。

The 2 plots show the distribution of the affiliate channels used by different affiliate providers.

2个图显示了不同联盟提供商使用的联盟渠道的分布。

Direct marketing performed by AirBnB itself has had the most outreach in terms of marketing. Direct marketing is a form of advertising where organizations communicate directly to customers through a variety of media including text messages, email, websites, online adverts, promotional letters, and targeted television.

在营销方面,由AirBnB本身进行的直接营销影响最大。 直接营销是一种广告形式,组织可以通过多种媒体直接与客户进行沟通,包括短信,电子邮件,网站,在线广告,促销信和定向电视。

Google is a close second affiliate provider with semi-branding being its most popular affiliate channel. Bing, Facebook, and Craigslist are the other “major” contributors.

Google是第二紧密的会员供应商,其中半品牌是其最受欢迎的会员渠道。 必应,Facebook和Craigslist是其他“主要”贡献者。

根据年龄进行有针对性的营销 (Targeted Marketing based on Age)

These graphs show the comparison of the usage of the AirBnB platform based on the age demographic.

这些图显示了基于年龄人口统计的AirBnB平台使用情况的比较。

基于性别的针对性营销 (Targeted Marketing based on Gender)

These graphs show the comparison of the usage of the AirBnB platform based on the gender demographic.

这些图显示了基于性别人口统计的AirBnB平台使用情况的比较。

  1. More women than men are targeted by the direct affiliate channel.

    直接会员渠道针对的是女性多于男性。
  2. Same goes for the direct affiliate provider.

    直接会员提供商也是如此。
  3. If we remove the direct affiliate channel, we observe the semi branded and semi non-branded are the two most popular channels followed by API and SEO (Search Engine Optimization). With the exception of the API channel, all other channels cater to more women than men.

    如果删除direct会员渠道,我们会发现半品牌和半非品牌是两个最受欢迎的渠道,其次是API和SEO(搜索引擎优化)。 除了API频道外,其他所有频道迎合的女性人数都比男性多。

  4. Google as an affiliate provider is more common amongst women than men.

    谷歌作为会员提供者在女性中比男性更普遍。
注册应用程序和注册方法 (Signup App and Signup Method)
  1. Signing up using email is the most popular option, followed by signup using Facebook. No body likes to link their Google account to their Airbnb account.

    使用电子邮件注册是最受欢迎的选项,其次是使用Facebook注册。 没有人喜欢将其Google帐户关联到其Airbnb帐户。
  2. An overwhelming majority of people access the AirBnB platform using browsers on their computers followed by their iOS app. The fact that Android users are less than iOS users might seem odd, but remember that AirBnB is an American company with it’s largest user base being Americans. iOS is more popular in the US than Android.

    绝大多数人使用计算机上的浏览器以及iOS应用程序访问AirBnB平台。 Android用户少于iOS用户这一事实似乎很奇怪,但请记住,AirBnB是一家美国公司,其最大的用户群是美国人。 iOS在美国比Android更为流行。
  3. People probably don’t use the app as much. This could be beacuse they don’t like the Android/iOS app’s UI or functionality. Maybe the web version offers more functionality and is easier to use. Or people aren’t aware of the AirBnB app.

    人们可能不怎么使用该应用程序。 这可能是因为他们不喜欢Android / iOS应用的用户界面或功能。 也许网络版本提供了更多的功能并且更易于使用。 或人们不了解AirBnB应用。
基于年龄和性别的注册应用和方法 (Signup App and Method based on Age and Gender)
  1. As expected, elderly people do not use smart phones at all to use AirBnB. A large number of people in their 20s, 30s, and 40s use their computers to access the AirBnB platform. One would expect the “tech savvy” teens and 20 year olds to use smartphones more, but that isn’t the case. (Note that there a lot more people in the 30s age bracket. This assumption might be wrong.)

    不出所料,老年人根本不使用智能手机来使用AirBnB。 20多岁,30多岁和40多岁的许多人使用他们的计算机访问AirBnB平台。 人们会期望“精通技术”的青少年和20岁的年轻人更多地使用智能手机,但事实并非如此。 (请注意,在30岁年龄段的人群更多。这个假设可能是错误的。)
  2. More women prefer signing up using their computers while more men prefer iOS/Android apps.

    越来越多的女性更喜欢使用计算机进行注册,而越来越多的男性更喜欢使用iOS / Android应用。
  3. A lot more people in their 30s prefer signing up using email as compared to Facebook. An almost weirdly equal number of people in their 20s and 30s prefer to signup using Facebook.

    与Facebook相比,30多岁的人们更喜欢使用电子邮件进行注册。 在20多岁和30多岁之间,几乎几乎相等的人数更喜欢使用Facebook进行注册。
  4. More women than men prefer to use Facebook and email signup methods. Compared to the other 2, Google sign up method is like a 404 error, Does Not Exist.

    与男性相比,更多女性更喜欢使用Facebook和电子邮件注册方法。 与其他2相比,Google注册方法就像一个404错误,“不存在”。
第一设备类型与年龄和性别 (First Device type vs Age and Gender)
  1. Macs are the most preferred laptops to access the AirBnB platform followed by Windows Desktop. Again, Apple is supremely popular in the US.

    Mac是访问AirBnB平台的首选笔记本电脑,其次是Windows Desktop。 同样,苹果在美国非常受欢迎。
  2. iPhones and iPads are the second most widely used devices to access the AirBnB platform.

    iPhone和iPad是访问AirBnB平台的第二广泛使用的设备。
  1. Mac Desktops are highly popular amongst people in their 20s and 30s to access the AirBnB platform followed by the Windows Desktop.

    Mac桌面在20多岁和30多岁的人群中非常受欢迎,可以访问AirBnB平台,其次是Windows桌面。
  2. We see a decreasing trend in the usage of Macs as the age increases. There’s no disparity between Mac Desktops and Windows Desktops for people in their 60s.

    随着年龄的增长,我们发现Mac的使用量呈下降趋势。 Mac台式机和Windows台式机对60多岁的人们没有区别。
  3. Smart phones, however, become unpopular as the Age increases.

    然而,随着年龄的增长,智能手机变得不受欢迎。
多年来的预订和帐户 (Bookings and Accounts over the years)
  1. Number of bookings rapidly increase every year.

    预订数量每年都在Swift增加。
  2. The sharp drop in bookings for the year 2015 is because we only have data till 29–06–2015. filter(train_users_2, date_first_booking >= "2015-06-29")

    2015年的预订量急剧下降是因为我们只有截至2015年6月29日的数据。 filter(train_users_2, date_first_booking >= "2015-06-2 9”)

  3. For the number of accounts created, we only have data till 2014–06–30. filter(train_users_2, date_account_created >= "2014-06-30")

    对于创建的帐户数,我们只有2014年6月30日之前的数据。 filter(train_users_2, date_account_created >= "2014-06-3 0”)

每年的首次预订数量 (Number of First-bookings per year)
  1. Number of first-bookings is at its lowest around January. This could possibly be because the year has just ended, so people don’t travel anywhere so soon. Also, cold outside much?

    一月份的首次预订数量最低。 这可能是因为这一年刚刚结束,所以人们不会很快去任何地方。 还有,外面冷了多少?
  2. Number of first-bookings always spikes up between the months of July and October. This could be in anticipation of festivals like Thanksgiving and Oktoberfest (or the summer holidays).

    在7月到10月之间,首次预订的数量始终会激增。 这可能是在期待感恩节和慕尼黑啤酒节(或暑假)之类的节日。
  3. However, we see a sharp drop in the number of bookings starting from July, 2014 until July, 2015.

    但是,从2014年7月到2015年7月,预订数量急剧下降。
“In July 2014, Airbnb revealed design revisions to the site and mobile app and introduced a new logo. Some considered the new logo to be visually similar to genitalia, but a consumer survey by Survata showed only a minority of respondents thought this was the case.”
“ 2014年7月,Airbnb公开了对该网站和移动应用的设计修订,并引入了新徽标。 一些人认为新徽标在视觉上类似于生殖器,但Survata进行的一项消费者调查显示,只有少数受访者认为是这种情况。

Google “AirBnB 2014” to find the reason for the sudden drop in the number of bookings in 2014.

Google“ AirBnB 2014”查找2014年预订数量突然下降的原因。

每年创建的帐户数 (Number of Accounts created per year)
  1. This plot follows a similar trend as the above plot.

    该图遵循与上述图类似的趋势。
  2. Number of new (first) accounts created is less around January and spikes around September and October.

    1月前后创建的新(第一个)帐户数量较少,而9月和10月前后则增加。
  3. People probably create new accounts to book as well as compare prices between other services.

    人们可能会创建新帐户进行预订,并比较其他服务之间的价格。

AirBnB could probably reduce prices or give more discounts and offers during the months of August, September, and October so that more people book places.

AirBnB可能会在8月,9月和10月的几个月内降低价格或提供更多折扣和优惠,以使更多的人预订位子。

从注册到首次预订的时间(根据年龄和性别) (Time between Sign up and First booking based on Age and Gender)

The colored boxes indicate the interquartile range which represents the middle 50% of the data. The whiskers extend from either side of the box. The whiskers represent the ranges for the bottom 25% and the top 25% of the data values, excluding outliers.

彩色框表示四分位间距,代表数据的中间50%。 晶须从盒子的两侧伸出。 晶须代表数据值的底部25%和顶部25%的范围,不包括异常值。

  1. A heavy majority of people irrespective of age and gender book rooms on the day they signup. The median value is 0.

    在注册当天,绝大多数人不分年龄和性别的书房。 中值为0。
  2. You can see the “outliers” book rooms more 1000 days after signing up on the platform.

    在平台上注册1000天后,您可以看到“异常值”书房。
  3. The “wait-time” for the middle 50% users from each age bracket generally tends to decrease with age.

    每个年龄段的中50%用户的“等待时间”通常会随着年龄的增长而减少。
从第一次预订到第一次活动的时间 (Time between the First Booking and First Activity)
  1. Time between the first booking and first activity of the users 0 or close to 0 for a lot of people.

    很多人从第一次预订到用户的第一次活动之间的时间为0或接近0。
  2. There are people who’ve booked their first room more than 100 days after their first activity on the AirBnB platform. Damn.

    在AirBnB平台上进行首次活动后,有100多人预订了第一间房间。 该死的。
从第一次预订到注册的时间 (Time between First booking and Signup)
  1. Here, we see that the number of days is negative for quite a number of people. People have booked rooms as long as a year before creating an account. Other than that, the data seems similar to the above plot.

    在这里,我们看到许多人的天数是负数。 人们在创建帐户之前已经预订了长达一年的房间。 除此之外,数据似乎与上述曲线相似。
  2. A huge number of people book the rooms on the same day they signup on the AirBnB platform.

    在AirBnB平台上签约的同一天,就有大量的人预订房间。

Let’s analyze the negative values. How many negative values are there?

让我们分析负值。 有多少个负值?

We see that there are only 29 negative values. This means that there were 29 users who were able to book their rooms without creating an account!

我们看到只有29个负值。 这意味着有29位用户无需预订即可预订房间!

Let’s see in which years this happened. After filtering in only the negative values of time_signup_to_booking, that is people who booked rooms before signing up, we plot the following graph. This graph tells us that users could sign up before booking on the AirBnB platform from 2010 to 2013.

让我们看看这发生在哪几年。 在仅过滤了time_signup_to_booking的负值(即在注册之前预订房间的人)后,我们绘制了下图。 该图告诉我们,从2010年到2013年,用户可以在预订AirBnB平台之前进行注册。

The following statement was released by AirBnB.

以下声明由AirBnB发布。

“Up until early 2013 there was a handful of flows where a user was able to book before fully creating an account (by the definition of account creation we use today). After early 2013 this is no longer possible.”
“直到2013年初,用户可以在完全创建帐户之前进行预订(根据我们今天使用的帐户创建的定义),可以进行少量的预订。 在2013年初之后,这将不再可能。”

结果 (Results)

预订与未预订 (Booked vs Not Booked)

NDF means no booking was made.

NDF表示未预订。

  1. There are clearly more inert users than active users.

    显然,惰性用户比活跃用户更多。
  2. Male vs Female ratio is pretty much the same for users who book vs those who don’t. NA values, however, differ. There are lot of users who don’t provide their gender and don’t book any rooms.

    对于预订的用户与未预订的用户,男女比例几乎相同。 但是,NA值不同。 许多用户不提供性别且不预订任何房间。
  3. People in their 30s are the highest in the lot for both booking and not booking rooms. The ratio of Booked : NotBooked is less than 1 for people in their 20s, 30, and 40s. While the same ratio is kind of constant for people in their 50s, 60s, and 70s.

    无论是预定房间还是预定房间,30多岁的人都是最高的。 预订比例 对于20多岁,30多岁和40多岁的人, NotBooked小于1。 虽然相同的比例对于50年代,60年代和70年代的人来说是一个常数。

  4. As you can see, a huge number of people don’t end up booking rooms. The US has the most number of bookings after that. A large number of these bookings must be domestic, as the company itself is based in the USA.

    如您所见,很多人最终没有预订房间。 之后,美国的预订量最多。 由于公司本身位于美国,因此这些预订中的许多必须是国内预订。

The graph shows that people in their 20s, 30s, and 40s are AirBnB’s base customers.

该图显示,年龄在20、30和40岁的人是AirBnB的基本客户。

目的地国家/地区的频率 (Frequency of the Destination Country)
  1. Highest number of inactive users (people who haven’t booked a room) are in their 30s.

    闲置用户(未预订房间的人)数量最多的是30多岁。
  2. Women travel around slightly more than men using AirBnB.

    使用AirBnB的女性出行比男性略多。
  3. After the US and “other” countries, France is the next most popular destination.

    在美国和“其他”国家之后,法国是第二受欢迎的目的地。
  4. More women travel to France as compared to men, while more men visit Canada as compared to women.

    与男子相比,前往法国旅行的女性人数更多,而与加拿大相比,前往加拿大旅行的男性人数更多。

Note that the travel need not be only international. It can be both national or international, as the country of origin of users is not provided in the dataset.

请注意,旅行不必只是国际旅行。 它可以是国家或国际的,因为数据集中未提供用户的原籍国。

联盟渠道对目的地国家的影响 (Effect of Affiliate Channel on the Destination Country)
  1. Direct affiliate channels have played a major role to confirm bookings.

    直接会员渠道在确认预订方面起着重要作用。
  2. Semi branding affiliate channel plays an important role to confirm bookings, especially in the US.

    半品牌会员渠道在确认预订方面起着重要作用,尤其是在美国。
联盟提供商对目的地国家的影响 (Effect of Affiliate Provider on the Destination Country)

After the direct affiliate provider, Google plays an important role in confirming the bookings.

在直接联属网络营销提供商之后,Google在确认预订方面起着重要作用。

基于性别的月度预订统计 (Monthly Booking Statistics based on Gender)
  1. Women have always booked more hotels on AirBnB than men except for one month — December, 2013.

    除了一个月(2013年12月),女性在AirBnB上预订的酒店总是比男性多。
  2. We can see the total number of booking peaking around July, except in 2014 and 2015.

    我们可以看到预订总数在7月左右达到峰值,但2014和2015年除外。
基于性别的每周预订统计 (Weekly Booking Statistics based on Gender)
  1. Number of bookings is always the lowest during the weekend, ie. Saturday and Sunday.

    在周末,预订数量始终是最低的。 星期六和星期日。
  2. The number of bookings always peaks during Thursday and Friday. People probably book rooms for the weekend on Thursdays and Fridays.

    预订数量始终在周四和周五达到高峰。 人们可能在星期四和星期五预订周末的房间。
  3. Hotel bookings by men drop a lot on Sundays as compared to Saturdays, but is the opposite for women.

    与周六相比,周日男性的酒店预订量下降很多,但是女性则相反。
  4. The number of bookings keep increasing from Monday to Friday only to fall on the weekend.

    从星期一到星期五,预订数量一直在增加,直到周末才减少。
预订数量的每日变化 (Daily Variation in the Number of Bookings)

We observe the AirBnB went from from 0–50 bookings per day in 2010 to almost 200 bookings per day in the early months of 2014.The number dropped down to around 100 bookings per day thereafter.

我们观察到,AirBnB从2010年的每天0到50预订下降到2014年前几个月的每天近200预订。此后,这一数字下降到每天约100预订。

结论 (Conclusions)

Before I started working on this, I had a few preconceived notions about the kind of results I would get. For example, I did not expect iOS to be immensely popular. More so than Android.

在开始进行此工作之前,我对可以得到的结果有一些先入为主的观念。 例如,我没想到iOS会如此流行。 比Android更是如此。

Of the 432 million smart phones sold in the last quarter of 2016, 352 million ran Android (81.7 percent) and 77 million ran iOS (17.9 percent).

在2016年第四季度售出的4.32亿智能手机中,有3.52亿运行Android(占81.7%)和7700万运行iOS(占17.9%)。

The sharp drop in the number of users around July, 2014 was also kind of unexpected. I mean, the company had a terrific growth rate until that point. It all started to go downhill after that. Although we have data only until mid 2015, who knows bookings might have picked up after that.

2014年7月前后的用户数量急剧下降也令人感到意外。 我的意思是,到那时为止,公司的发展速度非常快。 之后,一切开始下坡。 尽管我们只有截至2015年年中的数据,但谁知道之后可能会有所增长。

Who would’ve expected the number of bookings to drop on weekends? Certainly not me. But as it turns out, if you want to go out over the weekend, you gotta book your hotels before that. My bad. :P

谁能想到周末的预订量会减少? 当然不是我。 但事实证明,如果您想在周末外出,则必须在此之前预订酒店。 我的错。 :P

Booking a room before creating an account? This part took me a lot of time to figure out. I thought I had erred. But when I finally Googled it, it turns out that this was possible before early 2013. AirBnB fixed the bug soon after.

在创建帐户之前预订房间? 这部分花了我很多时间弄清楚。 我以为我犯了错误。 但是,当我最终用Google搜索它时,事实证明这是有可能在2013年初之前完成的。AirBnB很快就修复了该错误。

The other thing that stuck out was, people in their 30s were more active on AirBnB as compared to people in their 20s. HOW? I certainly did not expect people in their 70s to travel, much less using AirBnB. *_*

突出的另一件事是,与20多岁的人相比,30多岁的人在AirBnB上的活跃度更高。 怎么样? 我当然没想到70年代的人们会旅行,更不用说使用AirBnB了。 * _ *

Thank you for reading. Suggestions and constructive criticism are welcome. :) You can find me on LinkedIn. You can view the the full code here.

感谢您的阅读。 欢迎提出建议和建设性批评。 :)您可以在LinkedIn上找到我 您可以在此处查看完整的代码。

You can also check out my other blogposts Exploratory Analysis of FIFA 18 dataset using R, Getting Started With Hive, and GgPlot ’Em All | Pokemon on R.

您还可以使用RHive入门GgPlot'Em All |查看我的其他博客文章FIFA 18数据集的探索性分析。 R上的口袋妖怪

翻译自: https://www.freecodecamp.org/news/a-data-driven-story-of-airbnb-25e6c5be8973/

airbnb数据

  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值