Colossus: Successor to the Google File System (GFS)

本文探讨了Google的下一代集群级文件系统Colossus,作为Google文件系统(GFS)的继任者,Colossus在存储软件架构上进行了重大创新,包括自动分片元数据层、使用Reed-Solomon编码的数据写入方式、客户端驱动的复制和编码等。文章还讨论了Colossus在成本、可用性和灵活性方面的优势。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

Colossus is the successor to the Google File System (GFS) as mentioned in the recent paper on Spanner on OSDI 2012. Colossus is also used by spanner to store its tablets. The information about Colossus is slim compared with GFS which is published in the paper on SOSP 2003. There is still some information about Colossus on the Web. Here, I list some of them.

 

Storage Architecture and Challenges

On Faculty Summit, July 29, 2010, by Andrew Fikes, Principal Engineer.

The slides. Some interesting points:

 

  • Storage Software: Colossus
    • Next-generation cluster-level file system
    • Automatically sharded metadata layer
    • Data typically written using Reed-Solomon (1.5x)
    • Client-driven replication, encoding and replication
    • Metadata space has enabled availability analyses
  • Why Reed-Solomon?
    • Cost. Especially w/ cross cluster replication.
    • Field data and simulations show improved MTTF
    • More flexible cost vs. availability choices

GFS: Evolution on Fast-forward

An interview with Google’s Sean Quinlan by the Association for Computer Machinery (ACM).

View the interview.

Some important info:

  • “We also ended up doing what we call a “multi-cell” approach, which basically made it possible to put multiple GFS masters on top of a pool of chunkservers.”
  • “We also have something we called Name Spaces, which are just a very static way of partitioning a namespace that people can use to hide all of this from the actual application.” … “a namespace file describes”
  • “The distributed master certainly allows you to grow file counts, in line with the number of machines you’re willing to throw at it.” … “Our distributed master system that will provide for 1-MB files is essentially a whole new design. That way, we can aim for something on the order of 100 million files per master. You can also have hundreds of masters.”
  • BitTable “as one of the major adaptations made along the way to help keep GFS viable in the face of rapid and widespread change.”

Google File System II: Dawn of the Multiplying Master Nodes Comments on GFS2 (colossus)

by Cade Metz in San Francisco.

The article and some excerpt.

转载于:https://www.cnblogs.com/dhcn/p/7389645.html

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值