paper reading:PARTIES:Qos-Aware Resource Partitioning for Multiple Interative Services

PARTIES:Qos-Aware Resource Partitioning for Multiple Interative Services

2019, Shuang Chen,Chritistia Delimitrou, ASPOLOS

{引用格式}

Summary

写完笔记之后最后填,概述文章的内容,以后查阅笔记的时候先看这一段。注:写文章summary切记需要通过自己的思考,用自己的语言描述。忌讳直接Ctrl + c原文。

Research Objective(s)

We present PARTIES, a Qos-aware resoruce manager that enables an arbitrary number of interative, latency-crtitical services to share a physical node without Qos violations.

Background / Problem Statement

Unfortunately, mutli-tenancy oftern comes at a performance penalty, as co-scheduled applications contend for shared resources, leading to interference and performance unpredictability. Interference is partitcularly destructive for interative, latency-critical(LC) services, which must meet strict quality of service(QoS) guarantees.
Cloud applications are progressively shifting from batch to low-latency services. For example, traditionally throughput-bouand applications, like big data and graph analytics,are now moving to in-memory computation, with frameworks like Spark[60], which brings task execution latencies to a few milliseconds or seconds. Furthermore, cloud applications are undergoing a major redesign form large, monolithic services that encompass the entire application functionality in a single binary, to hundreds or thousands of loosely-coupled microsevices[28-30,52]. While the end-to-end latency of a large-scale service remains in the granularity of serveral milliseconds or seconds, each microservice must meet much tighter latency constraints, often in the order of a few hundrends of microseconds.

Method(s)

作者解决问题的方法/算法是什么?是否基于前人的方法?基于了哪些?
PARTIES leverages an online monitoring framework that operates at the granularity of a few hundred milliseconds, to quickly detect QoS violations. Upon detection, the runtime boosts the allocation of one or more resources to the LC service whose latency suffers the most. PARTIES assumes no a priori knowledge of incoming applications, making it applicable in settings like public clouds where user-submitted applications are not known in advance. PAPRTIES uses both OS- and hardware-level partitioning mechanisms available in modern platforms, including containers, tread pinning, cahe partitioning, frequency scaling, memory capacity partitioning, and disk and network bandwidth partitioning to sastisfy the instantaneous resource needs of each co-scheduled interative service.

We then introduce the concept of resource fungibility,i.e., the fact that resources can be traded for each other to arrive to equivalent application performance. Fungibility improves the controller’s flexibility and convergence speed.

在这里插入图片描述

Unlike resources in the storage wheel where the benefit in performancee is almost always immediate, adjusting compute resources may require multiple rounds befeore there are noticeable peformance gains. Indeed, when an aplication is servely starved for compute resoruces, fine-grained adjustments, e.g., in frequency, are not enough to dissipate the long queues taht have built up in the system. Every time the cotroller completes a turn in the compute wheel, it checks memory utilization before deciding whether to initiate another round or to jump to the storage wheel. If memoy slack is large and latency does not drop after scaling compute up, there is high probability that the allocated compute resources are not yet sufficient. On the other hand, if memory is almost saturated, the QoS violation is likely due to an increasing dataset, in which case the controller jumps to the storage wheel.

Evaluation

作者如何评估自己的方法?实验的setup是什么样的?感兴趣实验数据和结果有哪些?有没有问题或者可以借鉴的地方?

Conclusion

作者给出了哪些结论?哪些是strong conclusions, 哪些又是weak的conclusions(即作者并没有通过实验提供evidence,只在discussion中提到;或实验的数据并没有给出充分的evidence)?

Notes

(optional) 不在以上列表中,但需要特别记录的笔记。

References

(optional) 列出相关性高的文献,以便之后可以继续track下去。

  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
(我现在主要在CSDN上整理计算机安全、软件工程(可信软件)、系统及通信方面的论文及相关理论书籍,如果对这方面内容感兴趣,可以访问:http://download.csdn.net/user/qysh123 查看我上传的所有资料。内容比较多,需要大家人工手动查找。另外,资料顺序并不按照时间排列,只是想起来了就上传。请大家见谅。) 体系结构和编程语言方面的著名会议,International conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS) 2012年的论文集。这个会议总体上感觉偏计算机系统结构,但是对操作系统、软件、编译器等topic均有涉及,例如目前在软件测试分析领域著名的符号执行平台S2E就是发表在11年的ASPLOS上。对这个会议,请参考以下介绍: 1 ASPLOS是由ACM主办的国际会议。主要关注硬件、体系结构、编译、操作系统等研究方向,在国内外学术界很高的影响。 2 ASPLOS(编程语言和操作系统的体系结构支持会议)是ACM开办的一个以体系结构为核心内容的多学科会议,其研究领域跨越硬件、体系结构、编译器、编程语言、操作系统、网络和应用,尤其关注这些学科间的交叉性研究课题。 ASPLOS的开会年份非常奇怪,82、87、89、91、92、94、96、98、00、02、04、06、08、09,既不是双年会,又不是但年会,还说不准奇数年或偶数年开会,真是个“不走寻常路”的会议。但ASPLOS绝对是一个精品会议,一年仅录用20多篇论文,几乎每篇都会受到计算机领域的大量引用。 2012年ASPLOS共收录了37篇文章,分为10个Session,欢迎需要的朋友下载。

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值