Persistent Memory in Exadata X8M

Persistent Memory in Exadata X8M

Christian Craft
SENIOR DIRECTOR, PRODUCT MANAGEMENT

The introduction of Persistent Memory (PMEM) marks the beginning of a revolution in the computing industry. There has always been a separation between system memory where the contents are ephemeral and byte addressable, and storage where the data is persistent and block oriented. Persistent Memory (such as Intel Optane DC Persistent Memory) blurs the line between storage and memory by being both byte addressable as well as persistent.

This new class of Non-Volatile Memory is fast enough to operate alongside conventional (volatile) DRAM in a DIMM (Dual In-Line Memory Module) form factor. Integrating into systems in DIMM slots means that Persistent Memory is able to play a vastly different role than conventional block-oriented storage such as Disk or SSD.

This article is the 2nd in a series covering Persistent Memory and how Oracle is using this technology to revolutionize database systems. Exadata is the first system on the market that is specifically designed to take advantage of Persistent Memory and accelerate the performance of Oracle Databases, while providing full redundancy and data protection that users require for their mission critical systems.

If you haven't read our Persistent Memory Primer article, be sure to learn some of the fundamentals outlined in that article here: https://blogs.oracle.com/database/persistent-memory-primer

Oracle Exadata X8M

There are 3 major changes in Exadata X8M that are driving the latest performance advancements of the platform.  These changes are highlighted in RED below.

The internal fabric has been changed from 40Gbps (Gigabits per second) InfiniBand to 100Gbps RDMA over Converged Ethernet (RoCE) fabric. The RDMA (Remote Direct Memory Access) capabilities of this network fabric are critical for getting the most out of the Persistent Memory.  Of course Persistent Memory in Exadata resides in the storage servers, which means it's fully redundant and the amount of Persistent Memory scales as the system scales.  The Linux KVM (Kernel Virtual Machine) inside of Exadata reduces the overhead of virtualization and allows the system to be configured with the largest supported memory configuration of 1.5TB per database server. Taken in combination, these new features are keeping Exadata far ahead of the competition for database performance and price/performance. For more information on Exadata X8M, please see here: https://www.oracle.com/engineered-systems/exadata/.

Combination of RoCE and PMEM

The latest performance advances of Exadata rely on the combination of RoCE with Persistent Memory. While each of these technologies provides benefits alone, the combination of them was required to make the next leap in performance of the Exadata platform.

The performance of Persistent Memory is now measured in microseconds and even nanoseconds, which means other portions of the I/O stack have become much more significant. RDMA across the Converged Ethernet fabric allows Exadata to make the most of the performance of Persistent Memory. The best way to illustrate this is to look at what would happen if Persistent Memory was introduced into Exadata without RoCE.

Persistent Memory without RoCE

Persistent Memory should deliver internal response times in the range of 6 µsec (microseconds) for processing 8K blocks of data. This response time is getting into the range where time spent in the existing I/O path is becoming a significant slice of the overall time. While Exadata has, for many years, been delivering faster I/O response times than other storage solutions, this presents an opportunity for Oracle's development team to make some big performance improvements.

We would normally expect about 6 microsecond (6,000 nanosecond) read latency to access Persistent Memory from within the Exadata Storage Server.  However, this low latency would be overshadowed by the layers of software, context switches, and network protocol overhead. In the diagram below, we see an older version of Oracle Database (prior to 19c) running on Exadata X8M. The database makes I/O requests from the Exadata storage as normal, but data is cached in Persistent Memory:

The Exadata Storage Software will cache hot blocks regardless of the database version being used. Older database versions (prior to 19c) will use the conventional Exadata I/O path. However, Oracle Database 19c (and later versions) will use RDMA to access data directly in Persistent Memory rather than using the conventional I/O path. This level of integration between the database and storage is only possible due to the tight software/hardware integration of Exadata.  This new feature is known as the Exadata X8M Persistent Memory Data Accelerator.

Persistent Memory Data Accelerator

Oracle Exadata X8M uses Persistent Memory (internally) to achieve dramatically higher I/O rates as well as industry leading low-latency storage access. The Oracle Exadata Storage Software is fully integrated with Persistent Memory and addresses the topics discussed above, while allowing Exadata X8M to take full advantage of the performance of Persistent Memory. Exadata X8M is able to achieve less than 19µsec (microsecond) storage latency and 16 million IOPS (Input/Output Operations Per Second) within a single rack. Customers do not need to configure, tune or even choose Persistent Memory in Exadata X8M. Persistent Memory is automatically included in all Exadata systems and there is nothing to configure or administer.

The RoCE network inside Exadata X8M enables RDMA over a Converged Ethernet fabric, accessing data residing in Persistent Memory. This combination of technologies allows Exadata to achieve near-memory speeds with storage that is fully redundant and fully protected from failures.

Persistent Memory Commit Accelerator

Persistent Memory in Exadata also accelerates commit processing in Oracle Databases.  Commit processing in any database represents a performance bottleneck, especially for OLTP systems. If commits are slow, the entire database can be slowed down across all users and transactions system-wide. Persistent Memory in combination with RDMA is used to accelerate log writes in Exadata X8M, which therefore improves the performance of commits.

Exadata X8M delivers up to 8X faster log writes than the previous generation of Exadata, which was already the fastest on the market. We can see the dramatic effects of PMEM in action by simply looking at the database performance metrics.

Seeing PMEM in Action on Exadata!

The combination of RoCE and PMEM results in dramatically faster response times and much higher IOPS (Input/Output Operations Per Second) than other storage solutions. In this first release of PMEM on Exadata, the unsurpassed performance is focused on the biggest area of benefit, which is database operations that result in the highest IOPS. Single block reads are the highest IOPS event in any Oracle database. We see this as "cell single block physical read" in Exadata, which equates to "db file sequential read" on non-Exadata systems. The following AWR (Automatic Workload Repository) report screen shot shows this in action:

The Exadata Storage Software will cache the most frequently read blocks of data into Persistent Memory. Those blocks will be accessed by pre-19c databases via the pre-existing Exadata I/O path, whereas 19c databases and later will use Remote Direct Memory Access (RDMA) to access those blocks. Persistent Memory is currently used as a write-through cache, so data is always persisted to Flash and Disk.

Exadata also uses Persistent Memory to accelerate commit processing, which is one of the most performance sensitive operations of any database, regardless of the workload.  The Oracle Database has to externalize commit records into redo logs and ensure those records are persisted to storage before returning control to the application, so speeding up commit processing delivers performance increases to the entire database. There are 2 primary Oracle Database performance metrics related to commit processing as follows:

  • log file sync
  • log file parallel write

These events can be viewed in AWR reports to see the benefits of RoCE and PMEM in Exadata X8M, which delivers up to 8X faster commit processing than the previous X8 release.

Data Integrity with PMEM in Exadata

It is important to note that Exadata does not suffer from the data integrity issues outlined in MOS Note# 2608116.1. The Exadata Storage Software is designed to address the data integrity challenges that are presented by the behavior of Persistent Memory. Exadata Storage Software uses Persistent Memory in AppDirect mode with devdax (Device Direct Access), and directly manages how data is written into Persistent Memory to ensure data integrity. The Exadata Storage Software also mirrors all writes to data across storage cells to protect against data loss in the event of a failure. Oracle recommends triple mirroring, or what is known as High Redundancy in Exadata to provide the best protection, even ensuring redundancy during maintenance.

Summary

Exadata uses the combination of Remote Direct Memory Access over Converged Ethernet (RoCE) network with Persistent Memory in the Exadata storage layer to provide unprecedented performance with the data integrity and availability that customers have come to expect from the Exadata platform.

资源下载链接为: https://pan.quark.cn/s/a55a57705e7e “八爪鱼采集器8.1.24.zip”是一个包含八爪鱼采集器8.1.24版本的压缩包。八爪鱼采集器是一款功能强大的网页数据抓取工具,能够帮助用户自动化地从互联网中提取各类信息,例如文章内容、产品价格、用户评价。等它在数据分析、市场研究、竞争情报等领域具有重要的应用价值。 压缩包内包含以下四个文件: “Octopus Setup 8.1.24.exe”:这是八爪鱼采集器的安装程序。用户可以通过运行该文件在计算机上安装8.1.24版本的八爪鱼采集器。安装过程中,系统会提示用户阅读并接受许可协议,选择安装路径,并且可能需要管理员权限来完成安装。 “八爪鱼8版本说明.txt”:该文本文件详细介绍了八爪鱼采集器8版本的主要功能、改进点和更新内容。它可能涵盖新功能的说明,例如更智能的爬虫算法、增强的数据处理能力、优化的用户界面等。此外,还可能包含关于如何使用新版本的指导,以及与旧版本的对比。 “安装前必读.txt”:这是一个重要的文档,用户在安装八爪鱼采集器之前应仔细阅读。它可能包含系统需求、安装步骤以及注意事项,例如确保操作系统兼容性、关闭杀毒软件以避免误报、预留足够的硬盘空间等,以确保安装过程顺利且安全。 “配置规则必读.txt”:这是一份关于如何配置和定制八爪鱼采集器的指南,尤其是针对新用户。采集器的配置规则是其核心功能之一,用户可以根据这些规则设置要爬取的网站、指定抓取的数据字段、设定爬取频率以及数据处理方式等。该文档可能包含一系列实例和最佳实践,帮助用户更好地理解和使用八爪鱼采集器的功能。 八爪鱼采集器8.1.24版本提供了一套完整的网页数据采集解决方案,从安装到配置,再到实际采集操作,都有详细的指导文件。用户通过学习和使用该工具,可以高效地从网络上获取大量有价值的信息,为各种业务决策提供
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值