18. Amazon Elastic File System

Overview

  • Amazon Elastic File System (Amazon EFS) provides a simple, serverless, set-and-forget elastic file system for use with AWS Cloud services and on-premises resources.
  • It is built to scale on demand to petabytes without disrupting applications, growing and shrinking automatically as you add and remove files, eliminating the need to provision and manage capacity to accommodate growth.
  • Amazon EFS has a simple web services interface that allows you to create and configure file systems quickly and easily.
  • Amazon EFS supports the Network File System version 4 (NFSv4.1 and NFSv4.0)protocol
  • Amazon EFS offers a range of storage classes designed for different use cases. These include:
    • Standard storage classes – EFS Standard and EFS Standard–Infrequent Access (Standard–IA), which offer multi-AZ resilience and the highest levels of durability and availability.
    • One Zone storage classes – EFS One Zone and EFS One Zone–Infrequent Access (EFS One Zone–IA), which offer customers the choice of additional savings by choosing to save their data in a single AZ’.
  • EFS file systems can grow to petabyte scale, drive high levels of throughput, and allow massively parallel access from compute instances to your data.
  • Amazon EFS supports authentication, authorization, and encryption capabilities to help you meet your security and compliance requirements. 
  • Amazon EFS is designed to provide the throughput, IOPS, and low latency needed for a broad range of workloads.

Amazon EFS: How it works

  • To access your Amazon EFS file system in a VPC, you create one or more mount targets in the VPC.
    • For file systems using Standard storage classes, you can create a mount target in each availability Zone in the AWS Region.
    • For file systems using One Zone storage classes, you create only a single mount target that is in the same Availability Zone as the file system.
  • mount target provides an IP address for an NFSv4 endpoint at which you can mount an Amazon EFS file system.
  • You mount your file system using its Domain Name Service (DNS) name, which resolves to the IP address of the EFS mount target in the same Availability Zone as your EC2 instance.
  • Amazon EFS lifecycle management automatically manages cost-effective file storage for your file systems. When enabled, lifecycle management migrates files that haven't been accessed for a set period of time to an infrequent access storage class, Standard-IA or One Zone-IA. 
  • Amazon EFS with Standard storage classes

Diagram showing 3 Availability Zones in a VPC, containing EC2 instances and mount           targets, and a mounted EFS file system.

  • Amazon EFS with One Zone storage classes

Diagram showing 2 Availability Zones in a VPC, containing EC2 instances, only one mount target, and a mounted EFS One Zone file system.

  • Amazon EFS with AWS Direct Connect and AWS Managed VPN

Diagram showing Amazon EFS works with AWS Direct Connect to mount an EFS file system on an on-premises server.

 Transferring data into and out of Amazon EFS

  • You can use AWS Transfer Family and AWS DataSync to transfer data into and out of your Amazon EFS file systems.
  • You can also use DataSync to transfer files between two EFS file systems, including file systems in different AWS Regions and file systems owned by different AWS accounts.
  • Using Transfer Family, you can securely enable third parties such as your vendors, partners, or customers access to your files over the supported protocols at scale globally, without needing to manage any infrastructure.
    • The Transfer Family server and your Amazon EFS file system are located in the same AWS Region.

Managing Amazon EFS file systems

  • File system management tasks refer to creating and deleting file systems and managing tags, file system backups, access, and network accessibility with mount targets of existing file systems.

Managing mount targets

  • As long as you have one mount target in an Availability Zone, the EC2 instances launched in any of its subnets can share the same mount target.
  • At a minimum, you should create a mount target in each Availability Zone from which you want to access the file system.
  • To avoid application disruption, stop applications and unmount the file system before deleting the mount target.
  • you can create mount targets for the file system in one VPC at a time. If you want to access the file system from another VPC, first delete the mount targets from the current VPC. Then create new mount targets in another VPC.
  • When you create a mount target, you associate security groups with the mount target.

Managing EFS storage classes

Storage classDesigned forDurability (designed for)AvailabilityAvailability zonesOther considerations

EFS Standard

Frequently accessed data requiring the highest durability and availability.

99.999999999% (11 9's)

99.99%

>=3

None

EFS Standard–Infrequent Access (IA)

Long lived, infrequently accessed data requiring the highest durability and availability.

99.999999999% (11 9's)

99.99%

>=3

Per GB retrieval fees apply.

EFS One Zone

Frequently accessed data that doesn’t require highest levels of durability and availability.

99.999999999% (11 9's)*

99.90%

1

Not resilient to the loss of the Availability Zone.

EFS One Zone-IA

Long lived, infrequently accessed data that doesn’t require highest levels of durability and availability.

99.999999999% (11 9's)*

99.90%

1

Not resilient to the loss of the Availability Zone. Per GB retrieval fees apply.

Managing access to encrypted file systems

  • Amazon EFS supports two forms of encryption for file systems, encryption in transit and encryption at rest.
  • Any key management you need to perform is only related to encryption at rest.
  • Amazon EFS automatically manages the keys for encryption in transit.
  • When you create a file system using encryption at rest, you specify a customer master key (CMK). The CMK can be aws/elasticfilesystem (the AWS managed CMK for Amazon EFS), or it can be a CMK that you manage.
  • File data—the contents of your files—is encrypted at rest using the CMK that you specified when you created your file system.
  • Metadata—file names, directory names, and directory contents—is encrypted by a key that Amazon EFS manages.

Amazon EFS performance

  • Amazon EFS file systems are distributed across an unconstrained number of storage servers.
  • This distributed data storage design enables file systems to grow elastically to petabyte scale. It also enables massively parallel access from compute instances, including Amazon EC2, Amazon ECS, and AWS Lambda, to your data.
  • The Amazon EFS distributed design avoids the bottlenecks and constraints inherent to traditional file servers.
  • The distributed nature of Amazon EFS file systems enables high levels of availability, durability, and scalability.

Performance modes

  • General Purpose performance mode
    • We recommend the General Purpose performance mode for the majority of your Amazon EFS file systems.
    • General Purpose is ideal for latency-sensitive use cases, like web serving environments, content management systems, home directories, and general file serving.
    • If you don't choose a performance mode when you create your file system, Amazon EFS selects the General Purpose mode for you by default.
  • Max I/O performance mode
    • File systems in the Max I/O mode can scale to higher levels of aggregate throughput and operations per second.
    • This scaling is done with a tradeoff of slightly higher latencies for file metadata operations.
    • Highly parallelized applications and workloads, such as big data analysis, media processing, and genomic analysis, can benefit from this mode.

Throughput modes

  • Throughput bursting mode
    • All EFS file systems, regardless of size, can burst to 100 MiB/s of metered throughput.
    • File systems that have more than 1 TiB in the EFS Standard or One Zone storage class can burst to 100 MiB/s per TiB of data stored in the file system.
    • The portion of time that a file system can burst is determined by its size
    • Amazon EFS uses a credit system to determine when file systems can burst.
    • Each file system earns credits over time at a baseline rate that is determined by the size of the file system that is stored in the EFS Standard or One Zone storage class.
    • The baseline rate is 50 MiB/s per TiB of storage
    • Amazon EFS meters read operations at a one-third the rate of other operations toward the baseline rate
    • A file system accumulates burst credits whenever it is inactive or driving throughput below its baseline metered rate
    • File system size (GiB)Baseline metered throughput (MiB/s)Burst metered throughput (MiB/s)Maximum burst duration (Min/Day)% of Time file system can burst (per day)
      100.5*1007.20.5%
      25612.510018012.5%
      51225.010036025.0%
      102450.010072050.0%
      153675.015072050.0%
      2048100.020072050.0%
      3072150.030072050.0%
      4096200.040072050.0%
    • To calculate the file system size that you need to sustain the level activity that you want:
      • Identify your throughput needs by looking at your historical usage.
      • Divide this number by 24 hours, 60 minutes, 60 seconds, and 1024 bytes to get the average KiB/second your application required for that day.
      • Calculate the file system size (in GiB) required to sustain this average throughput by dividing the average throughput number (in KiB/s) by the baseline throughput number (50 KiB/s/GiB) that EFS provides.
  • Throughput Provisioned mode
    • Provisioned Throughput mode is available for applications with high throughput to storage (MiB/s per TiB) ratios, or with requirements greater than those allowed by the Bursting Throughput mode. 
  • By default, we recommend that you run your application in the Bursting Throughput mode.

Amazon EFS use cases

  • Big data and analytics——Amazon EFS provides the scale and performance required for big data applications that require high throughput to compute nodes coupled with read-after-write consistency and low-latency file operations.
  • Media processing workflows——A strong data consistency model with high throughput and shared file access can cut the time it takes to perform media processing workflows and consolidate multiple local file repositories into a single location for all users.
  • Content management and web serving——Amazon EFS provides a durable, high throughput file system for content management systems that store and serve information for a range of applications like websites, online publications, and archives.
  • Home directories——Amazon EFS can provide storage for organizations that have many users that need to access and share common datasets

Reference

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值