数据分布策略_有效数据项目的三种策略

数据分布策略

Many data science projects do not go into production, why is that? There is no doubt in my mind that data science is an efficient tool with impressive performances. However, a successful data project is also about effectiveness: doing the right things as Russell Ackoff would write in “A systemic view of transformational leadership”.

许多数据科学项目没有投入生产 ,为什么呢? 毫无疑问,数据科学是一种具有出色性能的有效工具。 但是,一个成功的数据项目也与有效性有关:如罗素·阿科夫(Russell Ackoff)在“ 变革型领导的系统观点 ”中所写, 做正确的事

Successful problem solving requires finding the right solution to the right problem. We fail more often because we solve the wrong problem than because we get the wrong solution to the right problem — Russell L. Ackoff (1974)

成功的问题解决需要找到正确问题的正确解决方案。 我们失败的原因更多是因为我们解决了错误的问题,而不是因为我们没有解决正确的问题— Russell L. Ackoff(1974)

How do you focus on your projects and make sure they will bring value to the company? Are you strategically thinking about how to bring your project to fruition?

您如何专注于您的项目并确保它们将为公司带来价值? 您是否在战略上考虑如何使您的项目实现?

NB: I will use golf — a strategic sport — as an illustrative analogy here.

注意:在这里,我将使用高尔夫这一具有战略意义的运动作为比喻。

OKR:设定您致力于实现的目标 (OKR: Setting objectives that you commit to achieve)

Objectives and Key Results (OKR) have been adopted in successful organisations to drive tremendous growth (Intel, Google, …). They were initially introduced by John Doerr to increase focus that produces value.

目标和关键结果(OKR)已通过 成功的组织来推动巨大的增长(英特尔,谷歌等)。 它们最初是由约翰·多尔(John Doerr)引入的,以增加对创造价值的关注。

The general idea is to set Objectives that motivate you. Imagine you are passionate about golf and next Friday there is a big competition. In the last few years, nobody won it performing well on more than 15 holes out of the 18 on the course. Setting yourself to win it is a good objective — it is specific, ambitious, and happens at a given time. You then set Key Results that can measure how you are doing on this objective. In this golf example, they could be:

一般的想法是设定激励您的目标 。 想象一下您对高尔夫充满热情,下周五将进行一场激烈的比赛。 在过去的几年中,没有人赢得过比赛中18个洞中超过15个洞的出色表现。 让自己赢得胜利是一个很好的目标-这是特定的,雄心勃勃的,并且在特定时间发生 。 然后,您可以设置关键结果 ,以衡量您在此目标上的表现 在这个高尔夫示例中,它们可能是:

  • Hit a par (ideal number of shots to get into a hole) on at least 16 out of the 18 holes.

    在18个洞中的至少16个洞上击出标准杆(理想的开枪次数)。
  • Avoid dropping the ball in a sand trap more than three times — because you know that you are bad at getting out of them.

    避免将球掉入沙坑中超过3次-因为您知道自己很难摆脱掉它们。
  • Go for a 20 min practice session before the competition — as you usually make a few bad shots with cold muscles.

    比赛前进行20分钟的练习-因为您通常会因肌肉冰冷而做出一些不好的动作。

Checking all the key results are then a good indicator that you could win.

检查所有关键结果便是您可能会获胜的良好指示。

In another scenario, working for a large bank, picture you are tasked to build a loan risk model with 80% accuracy. Here are some possible key results:

在另一种情况下(为一家大型银行工作),您需要负责建立准确性为80%的贷款风险模型。 以下是一些可能的关键结果:

  • Get 80% client repayment behaviour data by XX/YY/ZZZZ.

    通过XX / YY / ZZZZ获取80%的客户还款行为数据。
  • Test three explainable model types by AA/BB/CCCC.

    通过AA / BB / CCCC测试三种可解释的模型类型。
  • Define and track four metrics to follow the model’s performances and understand where the model is wrong.

    定义并跟踪四个指标,以跟踪模型的性能并了解该模型在哪里出错。

OKRs can be used to drive focus on anything. I find them useful to define my goals on a project: building a model or an application, when will it be good enough? Aiming for the key results brings clarity. Failing becomes a learning experience that stimulates better OKRs definitions and work. On the other hand, success is then crystal clear, and you should enjoy it.

OKR可用于推动对任何事物的关注。 我发现它们对于定义项目目标很有用:建立模型或应用程序,什么时候足够好? 瞄准关键结果会带来清晰度。 失败成为一种学习经验,可以激发更好的OKR定义和工作。 另一方面,成功是显而易见的,您应该享受成功。

Must read on the topic: Measure what matters by John Doerr.

必须阅读以下主题: 衡量 约翰·杜尔的重要性。

传动系统方法 (The Drivetrain Approach)

A drivetrain approach is a comprehensive strategy to data products definition. The following diagram shows its essential steps:

传动系统方法是数据产品定义的综合策略。 下图显示了其基本步骤:

Image for post
iStock under license to M. Koutero. iStock的元素已获M. Koutero许可。

In a new project we might ask ourselves:

在一个新项目中,我们可能会问自己:

  • Objectives

    目标

Setting objectives includes answering questions such as: Does it add value to the business? Is it aligned with the current roadmap? When should it be done? Is it opening new perspectives?

设定目标包括回答以下问题:是否能为企业增加价值? 它与当前路线图一致吗? 什么时候应该做? 它开辟了新的视角吗?

  • Levers

    杠杆

What elements in the final product are under my control? Can I change the price of the product? The ranking on the recommendation page? …

我可以控制最终产品中的哪些元素? 我可以更改产品的价格吗? 推荐页面上的排名? …

  • Data

    数据

Given objectives and levers, what kind of data could I use? What are the compliance issues?

给定目标和杠杆,我可以使用哪种数据? 有哪些合规性问题?

  • Model / Simulation

    模型/模拟

Simulations should indicate if there is enough information in your data combined with your levers to get to your objectives. Could you drive more sales with fewer risks in the loan model example?

模拟应表明您的数据中是否有足够的信息与您的杠杆相结合以实现目标。 在贷款模型示例中,您能否以更少的风险推动更多的销售?

Every step is also an exit point. If you can’t find a solution alone or collectively, it might be an indication that it is not worth your time and should focus on something else.

每一步也是一个出口点。 如果您不能单独或集体找到解决方案,则可能表明它不值得您花时间,而应专注于其他方面。

Must read on the topic: Designing great data products by Jeremy Howard, Margit Zwemer and Mike Loukides.

必须阅读的题目是: 设计大数据产品 杰里米·霍华德 玛吉特池维谋 麦克Loukides

决策智能 (Decision intelligence)

Decision intelligence is a more general discipline that tackles how to build a strategy given objectives in complex situations. The general process integrates notions such as external causes, multiple causal links, and feedback loops. Teams creating causal diagrams can then rationally decide upon a strategy with a clear perception of the problem at hand. One might understand decision intelligence as an extended merger between OKRs and the drive train approach.

决策智能是一门比较通用的学科,致力于解决复杂情况下给定目标的战略制定方法。 常规过程集成了诸如外部原因,多个因果链接和反馈循环之类的概念。 然后,创建因果图的团队可以合理地决定策略,并清楚地了解当前的问题。 人们可能将决策智能理解为OKR与动力传动系统方法之间的扩展合并。

Image for post
iStock under license to M. Koutero. iStock的元素,并已获得M. Koutero的许可。

In the small example above, once you select a club, whether the ball will fly high (and hopefully far) or stay rolling on the ground means the wind is more or less likely to affect. Staying on the ground might be safer, but making only small shots, you will need more of them. Having a good strategy means you will find a reasonable equilibrium to achieve your objectives and goals.

在上面的小示例中,一旦选择了一个球杆,球会飞高(并希望远飞)还是保持在地面上滚动,这意味着或多或少会影响风。 留在地面上可能会更安全,但只拍摄一点,您将需要更多。 拥有良好的策略意味着您将找到一个合理的平衡点来实现自己的目标。

In the OKR example about the risk loan model, we would make here deeper inquiries. Would having a loan model that makes mistakes on certain types of customers be a hazard on equity? Is it possible that employees in charge of validating loans would only rely on the model, become less critical thinkers and be less likely to adjust their behaviour when delicate cases occur? Causal diagrams enable you to understand the indirect consequences of your decisions. If you consider that getting the right clean data and building a model ready for production is often a task that takes months, is it not worth spending some time on the reasons you are doing it?

在有关风险贷款模型的OKR示例中,我们将在这里进行更深入的查询。 具有在某些类型的客户上犯错误的贷款模型会危害股本吗? 负责发生贷款问题的员工是否可能仅依靠模型,变得不那么批判性的思想家并且在发生细微情况时不太可能调整其行为? 因果图使您能够理解决策的间接后果。 如果您认为获取正确的干净数据并为生产做好准备的模型构建通常需要花费数月的时间,难道不应该花一些时间在做这些事情的原因上吗?

For engineers and scientists, it is not extremely different from specifying a classical digital product with its constraints and target performances but broadening the perspective. What is interesting to me, is the focus on the decision making (“should I build this product and how?”) putting both business and technical people together to make sure that at the scale of a whole ecosystem, the next move is the right one.

对于工程师和科学家而言,它与指定具有约束条件和目标性能的经典数字产品并没有什么不同,但是拓宽了视野。 对我而言,有趣的是将重点放在决策上(“我应该制造这种产品以及如何制造吗?”),将业务人员和技术人员放在一起,以确保在整个生态系统的规模上,下一步行动是正确的之一。

Must read on the topic: Link by Lorien Pratt

必须阅读以下主题: Lorien Pratt的 链接

Strategy is not limited to a top/down practice falling under the umbrella of leaders, managers, product managers, etc. I think it is part of any job to meet halfway and have some strategical thinking under the hood. Maybe these frameworks are sometimes too elaborated, but at its core, they start with a simple question that we can ask ourselves: why should I do this project?

战略不仅限于领导,经理,产品经理等领导下的自上而下的实践。我认为,中途开会并有一些战略思想是任何工作的一部分。 也许这些框架有时过于复杂,但从根本上讲,它们以一个简单的问题开始,我们可以问自己:我为什么要进行这个项目?

As a field, growing beyond the AI hype, we cannot stay in an isolated system, extending our level of specialisation without clearly showing its value. Missing middle professionals are likely to be of importance in this task (Paul R. Daugherty — CTIO at Accenture and Lorien Pratt). Whether they will be decision intelligence specialists, data strategists or data product managers will be a matter of semantics and establishing new practices in the field.

作为一个超越AI炒作的领域,我们不能停留在孤立的系统中,无法在没有清楚显示其价值的情况下扩展专业化水平。 缺少中层专业人员可能对这项任务很重要( Paul R. Daugherty-埃森哲公司的CTIO和Lorien Pratt)。 他们将是决策情报专家数据战略家还是数据产品经理,将取决于语义并确定该领域的新实践。

___________

___________

References :

参考文献:

  • Ackoff, R, L: 1998, A Systemic View of Transformational Leadership (Systemic Practice and Action Research).

    Ackoff,R,L:1998年, 《变革型领导的系统观点》 (系统实践与行动研究)。

  • Ackoff, R. L.: 1974, Redesigning the Future: A Systems Approach to Societal Problems (John Wiley & Sons).

    RL,阿科夫(Ackoff),1974年,《 重新设计未来:社会问题的系统方法》 (约翰·威利父子)。

  • Doerr, J: 2018, Measure what matters: How Google, Bono, and the Gates Foundation rock the world with OKRs (Portfolio Penguin).

    Doerr,J:2018, 衡量重要的事情:Google,Bono和盖茨基金会如何利用OKR (Portfolio Penguin) 震撼整个世界

  • Pratt, L: 2019, Link (Emerald Publishing Limited).

    普拉特,L:2019, Link (Emerald Publishing Limited)。

翻译自: https://towardsdatascience.com/three-strategies-towards-effective-data-projects-eed29ad05ded

数据分布策略

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值