I am using the cluster with 3 cassandra nodes, the cluster version is 3.0.9. Each day about 200~300 million records are inserted into the cluster.
As time goes by, more and more data occupied more and more disk space. Currently, the data distribution on each node is as the following:
UN 172.20.5.4 2.5 TiB 256 66.3% c5271e74-19a1-4cee-98d7-dc169cf87e95 rack1
UN 172.20.5.2 1.73 TiB 256 67.0% c623bbc0-9839-4d2d-8ff3-db7115719d59 rack1
UN 172.20.5.3 1.86 TiB 256 66.7% c555e44c-9590-4f45-aea4-f5eca68180b2 rack1
There is only one datacenter.
The compaciton strategy is here:
compaction = {'class': 'org.apache.cassandra.db.compaction.SizeTieredCompactionStrategy', 'max_threshold': '32', 'min_threshold': '12', 'tombstone_threshold': '0.1', 'unchecked_tombstone_compaction': 'true'}
AND compression = {'chunk_length_in_kb': '64', 'class': 'org.apache.cassandra.io.compress.LZ4Compressor'}
AND crc_check_chance = 1.0
AND dclocal_read_repair_chance = 0.1
AND default_time_to_live = 8640000
AND gc_grace_seconds = 432000
As time goes by, more and more data occupied more and more disk space. Currently, the data distribution on each node is as the following:
UN 172.20.5.4 2.5 TiB 256 66.3% c5271e74-19a1-4cee-98d7-dc169cf87e95 rack1
UN 172.20.5.2 1.73 TiB 256 67.0% c623bbc0-9839-4d2d-8ff3-db7115719d59 rack1
UN 172.20.5.3 1.86 TiB 256 66.7% c555e44c-9590-4f45-aea4-f5eca68180b2 rack1
There is only one datacenter.
The compaciton strategy is here:
compaction = {'class': 'org.apache.cassandra.db.compaction.SizeTieredCompactionStrategy', 'max_threshold': '32', 'min_threshold': '12', 'tombstone_threshold': '0.1', 'unchecked_tombstone_compaction': 'true'}
AND compression = {'chunk_length_in_kb': '64', 'class': 'org.apache.cassandra.io.compress.LZ4Compressor'}
AND crc_check_chance = 1.0
AND dclocal_read_repair_chance = 0.1
AND default_time_to_live = 8640000
AND gc_grace_seconds = 432000