5 Steps To Scaling MongoDB In 8 Minutes

最新推荐文章于 2022-05-26 14:39:10 发布

macyang

最新推荐文章于 2022-05-26 14:39:10 发布

阅读量703

点赞数

分类专栏： mongodb 文章标签： mongodb database sharding disk file random

本文链接：https://blog.csdn.net/macyang/article/details/6845735

版权

mongodb 专栏收录该内容

21 篇文章 0 订阅

订阅专栏

本文主要从5个方面提出对MongoDB进行优化的方法：使用explain查看query是否充分使用了index、热数据（+索引信息）是否能够全部放在内存中、使用文件分配速度更快的ext4文件系统、数据存储使用SSD最优、必要的时候使用sharding模式对数据进行分片。另外推荐一本书，也是关于优化MongoDB的书（50 Tips and Tricks for Mongodb Developers）更多的是涉及编程时需要注意的东西。

Jared Rosoff concisely, effectively, entertainingly, and convincingly gives an 8 minute MongoDB tutorial on scaling MongoDB at Scale Out Camp. The ideas aren't just limited to MongoDB, they work for most any database: Optimize your queries; Know your working set size; Tune your file system; Choose the right disks; Shard. Here's an explanation of all 5 strategies:

Optimize your queries. Computer science works. Complexity analysis works. A btree search is faster than a table scan. So analyze your queries. Use explain to see what your query is doing. If it is saying it's using a cursor then it's doing a table scan. That's slow. Look at the number of documents it looks at to satisfy a query. Look at how long it takes. Fix: add indexes. It doesn't matter if you are running on 1 or 100 servers.
Know your working set size. Sticking memcache in front of your database is silly. You have lots of RAM, use it. Embed your cache in the database, which is how MongoDB works. Working set = Active Documents + Used Indexes. Hitting something in RAM is fast, disk is slow. If you have a billion users and only 100K are active at a time then 100K is your working set. You want to have enough RAM for those 100K so operations are in RAM on not on disk. Remember indexes take memory too. It doesn't matter if you are running on 1 or 100 servers.
Tune your file system. Performance problems often traced to the filesystem. EXT3 is ancient. Use EXT4, XFS, or some other well performing file system. Turn off access time tracking, for a database there's no need to update a file every time a file is accessed, this is another write. Preallocating 2GB of files on EXT3 must actually write those bytes, it's slow.
Choose the right disks. Seek time is what matters. Most of what you are doing is random IO. Seek time is governed by a mechanical arm that has to swing over the disk. The average disk drive can do 200 seeks a second. Faster drives will move data off the disk faster, that is they have higher bandwidth, but their seek times will be the same. Single disk: you can do 200 queries a second. RAID 0 (stripe across multiple disks): 3 disks means 600 queries a second. RAID 10 (mirror and stripe): 6 disks means 1200 seeks a second. Choose the right disks. RAID matters. SSDs are awesome: .1 ms for a seek vs 5 ms for a disk seek. Great for random access.
Shard. If your app is slow, uses bad indexing, has slow disk drives, then a single node will be slow. Fix all this stuff before scaling out using sharded. Sharding lets you spread your workload over more machines along with high availability with replica sets. Data is partitioned to shards by ranges, for example. Can scale out to 100s of servers. Each can process 10s of 1000s of writes. Can add more capacity easily. Sharding with a good database multiples the benefits of having good queries, good drives, and good working sets.

文章来源: http://highscalability.com/blog/2011/9/13/must-see-5-steps-to-scaling-mongodb-or-any-db-in-8-minutes.html

ppt来源: http://www.slideshare.net/Dataversity/tue-1700-rosoffjaredcolor

macyang

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
5 Steps To Scaling MongoDB In 8 Minutes

本文主要从5个方面提出对MongoDB进行优化的方法：使用explain查看query是否充分使用了index、热数据（+索引信息）是否能够全部放在内存中、使用文件分配速度更快的ext4文件系统、数据存储使用SSD最优、必要的时候使用sharding模式对数据进行分片。另外推荐一本书，也是关于优化MongoDB的书（50 Tips and Tricks for Mongodb Developer
复制链接

扫一扫