Index
A
- ACID
- about / ACID
- ALL, consistency level / Consistency levels
- ANY, consistency level / Consistency levels
- asynchronous read repair / Repairing data
- asynchronous requests
- handling / Handling asynchronous requests
- queries, executing in parallel / Running queries in parallel
- atomicity, ACID / ACID
- availability, CAP theorem / The CAP theorem
B
- batched statements
- executing / Batched statements
- pitfalls / Caution with batches
- ByteOrderedPartitioner
- URL / Partitioners
- about / Partitioners
- used, for scaling out / Effects of scaling out using ByteOrderedPartitioner
C
- CAP theorem
- about / The CAP theorem
- consistency / The CAP theorem
- availability / The CAP theorem
- partition tolerance / The CAP theorem
- Cassandra
- solution / Cassandra's solution
- architecture / Cassandra's architecture
- token, assigning / Token assignment
- replication factor / The replication factor
- data storage / How Cassandra stores data
- monitoring, via Java Management Extensions (JMX) / Monitoring via Java Management Extensions
- monitoring, URL / Monitoring via Java Management Extensions
- Cassandra architecture
- about / Cassandra's architecture
- distributed hash table (DHT) / Distributed hash table
- replication / Replication
- tunable consistency / Tunable consistency
- CassandraFS
- about / Online analysis
- Cassandra logs
- about / Cassandra logs
- Cassandra Query Language (CQL) / SimpleStrategy
- CloudstackSnitch / Snitches
- cluster connection
- creating / Connecting to the cluster
- clustering columns
- about / Clustering columns
- collections
- about / Denormalizing with collections
- using / Denormalizing with collections
- sets / How collections are stored, Sets
- lists / Lists
- maps / Maps
- columns
- expiring / Expiring columns
- communication metrics / Communication metrics
- compaction
- about / Understanding compaction
- size-tiered compaction / Understanding compaction, Size-tiered compaction
- leveled compaction / Understanding compaction, Leveled compaction
- date-tiered compaction / Understanding compaction, Date-tiered compaction
- composite partition key
- about / Composite partition keys
- creating / Composite partition keys
- compound key
- using / Compound keys
- partition key / Partition keys
- clustering columns / Clustering columns
- composite partition key / Composite partition keys
- consistency
- between data centers, achieving / Achieving stronger consistency between data centers
- consistency, ACID / ACID
- consistency, CAP theorem / The CAP theorem
- consistency conflicts
- about / Consistency conflicts
- consistency levels
- about / Consistency levels
- ANY / Consistency levels
- ONE / Consistency levels
- TWO / Consistency levels
- THREE / Consistency levels
- QUORUM / Consistency levels
- SERIAL / Consistency levels
- LOCAL_ONE / Consistency levels
- LOCAL_QUORUM / Consistency levels
- LOCAL_SERIAL / Consistency levels
- EACH_QUORUM / Consistency levels
- ALL / Consistency levels
- consistent hashing
- about / Consistent hashing
- working / The mechanics of consistent hashing
- CQL
- about / CQL under the hood
- storage model / The importance of the storage model
- CQL statements
- with single primary key / Single primary key
- with compound key / Compound keys
D
- data, backing up
- about / Backing up data
- snapshot, taking / Taking a snapshot
- incremental backups / Incremental backups
- snapshot, restoring from / Restoring from a snapshot
- data center
- multiple data centers, use cases / Use cases for multiple data centers
- setup / Data center setup
- adding / Adding a data center
- nodes, removing / Removing nodes within a data center
- decommissioning / Decommissioning a data center
- data center, setup
- RackInferringSnitch / RackInferringSnitch
- PropertyFileSnitch / PropertyFileSnitch
- GossipingPropertyFileSnitch / GossipingPropertyFileSnitch
- Cloud snitches / Cloud snitches
- data center replication
- used, for scaling up / Scaling up using data center replication
- data centers, replication
- about / Replication across data centers
- replication factor, setting / Setting the replication factor
- data deleting
- about / Deleting data
- garbage collection / Garbage collection, Resurrecting the dead
- unexpected deletes / Unexpected deletes
- tombstones, issues / The problem with tombstones
- columns, expiring / Expiring columns
- TTL antipatterns / TTL antipatterns
- data migration
- scenarios / Other data migration scenarios
- data repair operations
- about / Repairing data
- synchronous read repair / Repairing data
- asynchronous read repair / Repairing data
- manually running repair / Repairing data
- DataStax documentation
- URL / Manually assigned tokens
- data storage, Cassandra
- log-structured storage, implications / Implications of a log-structured storage
- date-tiered compaction
- DCAwareRoundRobinPolicy
- about / Load balancing
- usedHostsPerRemoteDc parameter / Failing over to a remote data center
- allowRemoteDCsForLocalConsistencyLevel parameter / Failing over to a remote data center
- dequeue operation
- about / Cassandra is not a queue
- development environment, native Java driver
- setting up / Setting up the environment
- prerequisites / Setting up the environment
- distributed hash table (DHT) / Distributed hash table
- about / Distributing hash tables
- distributed joins
- about / Distributed joins
- downed node
- about / When a node goes down
- marking / Marking a downed node
- handling / Handling a downed node
- durability, ACID / ACID
E
- EACH_QUORUM, consistency level / Consistency levels
- EC2MultiRegionSnitch / Snitches
- EC2Snitch / Snitches
- enqueue operation
- about / Cassandra is not a queue
- extract, transform, and load (ETL)
- about / Online analysis
F
- failover
- about / Failover
G
- garbage collection
- about / Garbage collection, Resurrecting the dead
- garbage collector logs
- about / Garbage collector logs
- geographic distribution
- about / Geographic distribution
- geospatial data
- storing / Working with geospatial data
- querying / Working with geospatial data
- GoogleCloudSnitch / Snitches
- GossipingPropertyFileSnitch / Snitches, GossipingPropertyFileSnitch
H
- Hadoop
- used, for online analysis / Analysis using Hadoop
- Hadoop Distributed File System (HDFS)
- about / Online analysis
- hardware configuration
- selecting / Choosing the right hardware configuration
- hash table
- fundamentals / Hash table fundamentals
- distributed hash table (DHT) / Distributing hash tables
- heterogeneousclusters / Manually assigned tokens
- heterogeneous nodes / Heterogeneous nodes
- hotspots
- about / Hotspots
- scaling out, with ByteOrderedPartitioner / Effects of scaling out using ByteOrderedPartitioner
- time-series example / A time-series example
I
- Integrated Development Environment (IDE)
- about / Setting up the environment
- isolation, ACID / ACID
J
- Java Development Kit (JDK)
- about / Setting up the environment
- URL / Setting up the environment
- Java Management Extensions (JMX)
- used, for monitoring Cassandra / Monitoring via Java Management Extensions
- Java Virtual Machine (JVM) / Monitoring via Java Management Extensions
L
- LatencyAwarePolicy
- about / Load balancing
- leveled compaction
- about / Understanding compaction, Leveled compaction
- advantages / Leveled compaction
- lists
- about / Lists
- Live backup
- about / Live backup
- load balancing
- about / Load balancing, Load balancing
- failing over, to remote data center / Failing over to a remote data center
- consistency level, downgrading / Downgrading the consistency level
- retry policy, defining / Defining your own retry policy
- token awareness / Token awareness
- load balancing, policies
- RoundRobinPolicy / Load balancing
- DCAwareRoundRobinPolicy / Load balancing
- LatencyAwarePolicy / Load balancing
- WhiteListRoundRobinPolicy / Load balancing
- TokenAwarePolicy / Load balancing
- LOCAL_ONE, consistency level / Consistency levels
- LOCAL_QUORUM, consistency level / Consistency levels
- LOCAL_SERIAL, consistency level / Consistency levels
- log-structured storage
- implications / Implications of a log-structured storage
- logging
- about / Logging
- Cassandra logs / Cassandra logs
- garbage collector logs / Garbage collector logs
M
- management toolset
- selecting / Choosing a management toolset
- selecting, pointers / Choosing a management toolset
- manual token assignment
- potential issues / Manually assigned tokens
- maps
- about / Maps
- master-slave architecture
- about / The master-slave architecture
- sharding / Sharding
- traits / Master failover
- monitoring, Cassandra
- via Java Management Extensions / Monitoring via Java Management Extensions
- OpsCenter used / Using OpsCenter
- management toolset, selecting / Choosing a management toolset
- monolithic architecture
- about / The monolithic architecture
- multikey queries
- about / Multikey queries
- multiple data centers
- use cases / Use cases for multiple data centers
- Live backup / Live backup
- failover / Failover
- load balancing / Load balancing
- geographic distribution / Geographic distribution
- online analysis / Online analysis
- environment, consistency / Consistency in a multiple data center environment
- multiple data centers environment, consistency
- replicated write, anatomy / The anatomy of a replicated write
- consistency between data centers, achieving / Achieving stronger consistency between data centers
- Murmur3Partitioner
N
- native protocol
- versus Thrift / Thrift versus the native protocol
- Netflix
- Netflix engineering blog
- network attached storage (NAS) / The monolithic architecture
- NetworkTopologyStrategy, replication
- about / Replication strategies, NetworkTopologyStrategy
- rack awareness / NetworkTopologyStrategy
- snitches, configurable / NetworkTopologyStrategy
- node
- failure, replication factor maintaining on / Maintaining the replication factor when a node fails
- node metrics
- monitoring / Monitoring node metrics
- thread pools / Thread pools
- column family statistics / Column family statistics
- latency outliers, finding / Finding latency outliers
- communication metrics / Communication metrics
- nodes
- adding / Manually assigned tokens, Growing your cluster
- removing / Manually assigned tokens, Removing nodes
- rebuilds / Manually assigned tokens
- adding, without vnodes / Adding nodes without vnodes
- adding, with vnodes / Adding nodes with vnodes
- removing, within data center / Removing nodes within a data center
- data center, decommissioning / Decommissioning a data center
- nodetool netstats command / Communication metrics
- nodetool tpstats command / Thread pools
- null values
O
- ONE, consistency level / Consistency levels
- online analysis
- Hadoop used / Analysis using Hadoop
- Spark used / Analysis using Spark
- Online Analytical Processing (OLAP)
- about / Online analysis
- OpsCenter
- using / Using OpsCenter
- installing, URL / Using OpsCenter
P
- partitioners
- about / Partitioners
- Murmur3Partitioner / Partitioners
- RandomPartitioner / Partitioners
- ByteOrderedPartitioner / Partitioners
- hotspots / Hotspots
- partition key
- declaring / Partition keys
- partition tolerance, CAP theorem / The CAP theorem
- phi / Marking a downed node
- prepared statements
- executing / Prepared statements
- primary key
- using / Single primary key
- PropertyFileSnitch / Snitches, PropertyFileSnitch
Q
- queries
- about / Understanding queries
- creating, with key / Query by key
- range queries, creating / Range queries
- denormalizing, with collections / Denormalizing with collections
- QUORUM, consistency level / Consistency levels
R
- RackInferringSnitch / Snitches, RackInferringSnitch
- RandomPartitioner
- about / Partitioners
- URL / Partitioners
- range queries
- creating / Range queries
- rapid read protection / Handling slow nodes
- replicated write
- anatomy / The anatomy of a replicated write
- replication
- about / Replication
- across data centers / Replication across data centers, Replication across data centers
- factor, setting / Setting the replication factor
- replication factor
- about / The replication factor
- maintaining, on node failure / Maintaining the replication factor when a node fails
- balancing, with consistency / Balancing the replication factor with consistency
- replication strategies
- about / Replication strategies
- SimpleStrategy / Replication strategies, SimpleStrategy
- NetworkTopologyStrategy / Replication strategies, NetworkTopologyStrategy
- retry policy
- defining / Defining your own retry policy
- implementation / Tying it all together
- fallback to QUORUM / Falling back to QUORUM
- RoundRobinPolicy
- about / Load balancing
- rule of transparency
- about / Knowledge is power
S
- scaling out
- versus scaling up / Scaling out versus scaling up
- steps / How to scale out
- data center, adding / Adding a data center
- scaling up
- versus scaling out / Scaling out versus scaling up
- steps / How to scale up
- upgrading, in place / How to scale up, Upgrading in place
- data center replication, using / How to scale up, Scaling up using data center replication
- secondary indices
- about / Secondary indices
- under hood / Secondary indices under the hood
- sensor data model
- about / Modeling sensor data
- queries / Queries
- time-based ordering / Time-based ordering
- sentinel value, using / Using a sentinel value
- time-ordered data, querying / Satisfying our queries
- querying / When time is all that matters
- SERIAL, consistency level / Consistency levels
- sets
- about / Sets
- sharding, master-slave architecture / Sharding
- SimpleSnitch / Snitches
- SimpleStrategy, replication / Replication strategies, SimpleStrategy
- size-tiered compaction
- about / Understanding compaction, Size-tiered compaction
- disadvantages / Size-tiered compaction
- slow nodes
- handling / Handling slow nodes
- snapshot
- taking / Taking a snapshot
- restoring / Restoring from a snapshot
- snitch
- changing / Snitch changes
- snitches
- Solid-state drives (SSDs) / Choosing the right hardware configuration
- Spark
- about / Online analysis
- used, for online analysis / Analysis using Spark
- staged event-driven architecture (SEDA) / Thread pools
- statements
- executing / Executing statements
- prepared statements, executing / Prepared statements
- batched statements, executing / Batched statements
- storage area network (SAN) / The monolithic architecture
- storage model
- importance / The importance of the storage model
- synchronous read repair / Repairing data
T
- thread pools
- about / Thread pools
- Thrift
- versus native protocol / Thrift versus the native protocol
- about / Thrift versus the native protocol
- disadvantages / Thrift versus the native protocol
- time-series data
- working with / Working with time-series data
- designing, for immutability / Designing for immutability
- sensor data, modeling / Modeling sensor data
- time-series example / A time-series example
- token
- assigning / Token assignment
- assigning, manual method / Manually assigned tokens
- virtual nodes (vnodes) / vnodes
- token awareness
- about / Token awareness
- TokenAwarePolicy
- about / Load balancing
- tombstone
- about / Deleting data
- tombstones
- issues / The problem with tombstones
- TTL antipatterns
- about / TTL antipatterns
- tunable consistency, Cassandra
- about / Tunable consistency
- CAP theorem / The CAP theorem
U
- unbounded row growth
- about / Unbounded row growth
V
- virtual nodes (vnodes)
- about / vnodes
- availability, improving / How vnodes improve availability
- adding / Adding and removing nodes
- removing / Adding and removing nodes
- bootstrapping process / Adding and removing nodes
- rebuilding / Node rebuilding
- heterogeneous nodes / Heterogeneous nodes
- vnodes
- using / Adding nodes with vnodes
W
- WhiteListRoundRobinPolicy
- about / Load balancing