数据仓库各层表 数据 示例_我如何使用数据来降低成本和增加利润的4个示例

数据仓库各层表 数据 示例

There is a lot of talk about how data, data science, and machine learning can all be applied to help make critical business decisions.

关于如何将数据,数据科学和机器学习全部用于帮助做出关键的业务决策,有很多讨论。

As our team works with various companies across the U.S. and in various industries, we have had a lot of opportunities to do more than just talk. We have helped many companies take their data and turn it into valuable decisions.

当我们的团队与美国各地以及各行各业的多家公司合作时,我们有很多机会要做的不只是谈话。 我们已经帮助许多公司获取数据并将其转化为有价值的决策。

Not every data project has required complex models or machine learning. However, all of them have turned into opportunities for our clients to either see increased revenue or decreasing costs — all through the power of data.

并非每个数据项目都需要复杂的模型或机器学习。 但是,所有这些都为我们的客户提供了机会,他们可以借助数据的力量来增加收入或降低成本。

In today’s article, we will be discussing and reflecting on some of these use cases with you. Hopefully, they will help inspire you to look for ways you could help improve your company with data insights.

在今天的文章中,我们将与您讨论并反思其中一些用例。 希望它们会启发您寻找可以通过数据洞察力改善公司的方法。

In particular, we will be focusing on fraud detection, service cannibalization, dynamic pricing, and tracking the cost of chronic disease. Each of these cases provided different challenges and opportunities for our team to help out by using a combination of data engineering and data science to provide a clear path to better decisions.

特别是,我们将专注于欺诈检测,服务同类化,动态定价以及跟踪慢性病的成本。 这些案例中的每一个都为我们的团队提供了不同的挑战和机遇,他们可以结合使用数据工程和数据科学来为更好的决策提供清晰的途径。

检测异常值—欺诈和超级用户 (Detecting Outliers — Fraud and Superusers)

A common question clients have is figuring out who in a population is displaying some sort of abnormal behavior, like fraud, being superusers of a product, or prescriptions for opioids.

客户经常遇到的一个常见问题是弄清人口中哪些人表现出某种异常行为,例如欺诈,成为产品的超级用户或阿片类药物的处方。

These are often easily spotted by looking for outliers. Here is what is interesting about figuring out who in a population is performing fraudulent or undesirable behavior. It usually sticks out.

通常可以通过查找异常值来轻松发现这些异常。 这是弄清楚人口中谁在执行欺诈或不良行为的有趣之处 通常会伸出来。

Why?

为什么?

Because it tends to be that these behaviors deviate wildly not only from normal behavior but also from their own patterns.

因为这些行为往往不仅会偏离正常行为,而且会偏离其自身的模式。

For example, let’s look at the opioid epidemic. Many reports were showing how some counties in the U.S. had a lot higher pill-per-person rate than normal.

例如,让我们看看阿片类药物的流行。 许多报告显示,美国某些县的人均药丸率比正常人高得多。

Jackson County in Ohio had a rate of 107 pills per person per year. In comparison, most other counties had one-third or a quarter of that.

俄亥俄州的杰克逊县每人每年有107粒药片的比率 。 相比之下,大多数其他县只占其中的三分之一或四分之一。

Looking at the map in the article, Jackson county sticks out. This was similar to fraudulent behavior.

看着文章中的地图,杰克逊县很突出。 这类似于欺诈行为。

We have been able to help companies with similarly detecting medical fraud. In the medical field, there is something called upcoding. Upcoding refers to the process in which a medical provider may bill for a service that is more expensive than what they did.

我们已经能够帮助公司类似地检测医疗欺诈。 在医学领域,有一种叫做upcoding的东西。 升码是指医疗提供者可能为比其服务更昂贵的服务付费的过程。

Now, there are a couple of ways upcoding can appear in data. For example, one way can be that a general practitioner is constantly coding for emergency procedures (which are more expensive than the normal version).

现在,有两种方法可以在数据中出现升码。 例如,一种方法可以是,全科医生不断为紧急情况程序编码(这比普通版本昂贵)。

This means there will often be a lower ratio of normal to emergency procedures for said practitioners. This will usually stand out. You can honestly usually see this graphed as either a scatter plot or a distribution plot.

这意味着对于这些从业者来说,正常程序与紧急程序的比率通常较低。 这通常会脱颖而出。 老实说,您通常可以将其视为散点图或分布图。

This is shown below.

如下所示。

Image for post
Image source: Author
图片来源:作者
Image for post
Image source: Author
图片来源:作者

Now the plot is rarely enough. However, this usually can help tell your story. When you bring this chart to a meeting, it can be easy to show your directors where you could be saving money.

现在情节很少。 但是,这通常可以帮助您讲述故事。 当您将此图表带到会议上时,可以很容易地向您的董事展示您可以省钱的地方。

服务同类化—自己偷钱 (Service Cannibalization — Stealing Money From Yourself)

As businesses grow, they often want to create new services and products. However, oftentimes these products and services may cross over to other services and products your business is already selling.

随着业务的增长,他们通常希望创建新的服务和产品。 但是,这些产品和服务通常可能会转移到您的企业已经在销售的其他服务和产品中。

Sometimes this is OK because you would rather cannibalize your own product vs. allowing a competitor to come in with their own new product. Think iPhone vs iPod. Yes, Apple destroyed the sales of the iPod. But if they hadn’t, someone else would have.

有时这是可以的,因为您宁愿蚕食自己的产品,也不愿让竞争对手使用自己的新产品。 想想iPhone与iPod。 是的,苹果取消了iPod的销售。 但是,如果没有,其他人将拥有。

On the other hand, sometimes you are merely providing a duplicate service or product.

另一方面,有时您只是提供重复的服务或产品。

Think of a coffee store putting similar stores too close to each other. Something like this happened in Washington with Krispy Kreme a long time ago, when they had to pull back after expanding and cannibalizing their own business.

想像一下一家咖啡店,使相似的商店彼此之间距离太近。 很久以前,在华盛顿与Krispy Kreme发生了这样的事情,当时他们在扩大和蚕食自己的业务后不得不撤退。

One of our clients started doing something similar. It’s not uncommon. Your business is doing well, so you think it is time to expand. But one of our clients didn’t realize they were cannibalizing their services and getting minimal ROI.

我们的一位客户开始做类似的事情。 这并不少见。 您的业​​务进展顺利,因此您认为是时候扩展了。 但是我们的一位客户没有意识到他们正在蚕食他们的服务并获得最小的投资回报。

Our team quickly found this out after we ran an analysis of their services. We saw that their new service only really added about 3% extra income for the same cost of every other service, which was responsible for about 13%–15% of the income.

在对他们的服务进行分析之后,我们的团队很快发现了这一点。 我们看到,他们的新服务实际上仅以与其他服务相同的成本增加了约3%的额外收入,约占收入的13%至15%。

In the end, the client just needed to push their customers to the other services they already provided and reduce the duplicate service.

最后,客户只需要将他们的客户推向他们已经提供的其他服务,并减少重复的服务。

动态定价-提高利润 (Dynamic Pricing — Improve Profits)

Companies like Uber and Expedia have used dynamic pricing to optimize costs for both users and their services. Through a combination of historical and current data, these companies have been able to better price their services.

UberExpedia这样的公司已经使用动态定价来优化用户及其服务的成本。 通过结合历史数据和当前数据,这些公司能够更好地为其服务定价。

But optimized pricing is not limited to tech companies. In fact, there are many other industries that can similarly benefit from using similar techniques as large tech companies to better price their services. We have been able to help one such company in the transportation industry develop its own easy-to-use tool that allows them to better manage pricing.

但是优化的定价不仅限于科技公司。 实际上,还有许多其他行业可以像大型高科技公司一样,通过使用类似的技术来更好地定价其服务,从而从中受益。 我们已经能够帮助运输行业中的一家公司开发自己的易于使用的工具,使他们能够更好地管理价格。

It has provided the company an opportunity to not only increase revenues but also better manage employees and overtime, as they are more aware of which days to expect heavy usage of their services and which days/months to reduce overtime hours. We actually have continued to work with this client further, optimizing and analyzing other parts of their business.

它为公司提供了一个机会,不仅可以增加收入,而且可以更好地管理员工和加班,因为他们更清楚期望哪些天大量使用其服务,以及哪些天/数月可以减少加班时间。 实际上,我们实际上一直在与该客户进一步合作,优化和分析其业务的其他部分。

One final note about dynamic pricing and utilization: Dynamic pricing doesn’t even always require custom work.

关于动态定价和利用率的最后一点:动态定价甚至并不总是需要定制工作。

There are several services out there that could help fill the gap. For example, PricingHub is one example of this. However, oftentimes you will still probably end up spending a similar amount in the long run to implement their system and then manage it month to month, compared to just building your own system. Also, prepackaged tools are often a little less robust and might be developed to broadly solve the problem rather than fit your needs.

有几种服务可以帮助填补这一空白。 例如, PricingHub是这种情况的一个示例。 但是,从长远来看,与仅仅构建自己的系统相比,从长远来看,您最终可能仍会花费相近的费用来实施他们的系统,然后每月进行管理。 而且,预包装的工具通常不够健壮,可以开发为广泛解决问题而不是满足您的需求。

预测慢性病的代价 (Predicting the Cost of Chronic Disease)

When you are developing an analysis or dashboard, often you will need to figure out what kinds of actions or decisions the end user is hoping to make from said deliverable.

在开发分析或仪表板时 ,通常需要弄清楚最终用户希望从所述可交付成果中做出什么样的动作或决定。

For example, one use case our team took on was helping a healthcare provider figure out whether or not their new policies were both improving the health of patients and reducing costs, the thought being that if you could improve the health of patients, you would in turn reduce costs. That was the case in our team's first project. In this project, there were two phases. In the first phase, we found out that the current analysts had been trying to run basic queries using Microsoft Access.

例如,我们团队采用的一个用例是帮助医疗保健提供者确定他们的新政策是否在改善患者的健康和降低成本方面,以为如果可以改善患者的健康,您将转降低成本。 我们团队的第一个项目就是这种情况。 在这个项目中,有两个阶段。 在第一阶段,我们发现当前的分析师一直在尝试使用Microsoft Access运行基本查询。

The problem was that the data was too large for Microsoft Access to handle.

问题是数据太大,Microsoft Access无法处理。

Thus, our first step was transferring the data into a better system. In this case, the company was a Microsoft shop, so we used Microsoft SQL Server.

因此,我们的第一步是将数据传输到更好的系统中。 在这种情况下,公司是一家Microsoft商店,因此我们使用了Microsoft SQL Server。

This alone took queries that might take ten to 20 minutes to run and made them run in milliseconds. Our team was now able to develop much more complex queries because of this small change. This is one of the benefits of having a team that is not only focused on complex models and algorithms but also has good data principles.

仅此一项就花了10到20分钟才能运行的查询,并使其在毫秒内运行。 由于这一微小的变化,我们的团队现在能够开发更复杂的查询。 这是拥有一个不仅专注于复杂模型和算法而且还具有良好数据原理的团队的好处之一。

We were able to build a database that allowed not only us but also the other teams at the company to start performing analytics.

我们能够建立一个数据库,该数据库不仅允许我们而且允许公司的其他团队开始执行分析。

From there, our team developed a model that helped outline the future costs patients would experience after contracting a chronic disease as well as show how the healthcare provider’s new policies were improving those costs.

从那里,我们的团队开发了一个模型,该模型有助于概述患者患上慢性病后的未来费用,并展示医疗保健提供者的新政策如何改善这些费用。

An example of how we ended up displaying the data is below. Our model used several features to predict the average cost of a patient diagnosis with a specific set of chronic diseases vs. if the healthcare provider continued their improved policies.

下面是一个如何最终显示数据的示例。 我们的模型使用多种功能来预测与一组特定的慢性疾病相比,如果医疗保健提供者继续改善政策,则患者诊断的平均费用。

Image for post
Image source: Author
图片来源:作者

大数据导致重大决策 (Great Data Leads to Great Decisions)

At the end of the day, having great data leads to great decisions. We have used data in the use cases above, as well as with other clients, to help clients do everything from drive business strategy to develop dashboards to get new clients.

归根结底,拥有出色的数据可以做出明智的决定。 我们在上述用例中以及与其他客户一起使用了数据,以帮助客户完成从驱动业务战略到开发仪表板以获取新客户的所有工作。

翻译自: https://medium.com/better-programming/4-examples-of-how-i-used-data-to-reduce-costs-and-increase-profits-2921d8ad5107

数据仓库各层表 数据 示例

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值