Appendix K – Big Data
Page K-4
K.7 What NoSQL categories does Cassandra support?
The Apache Software Foundation’s Cassandra project is a column family database that is
K.8 As illustrated in Figure K-4, what is column family database storage and how are
such systems organized? How do column family database storage systems compare
to RDBMS systems?
Figure K-4 is shown below.
The smallest unit of storage is called a column, but it is really the equivalent of an RDBMS table
Figure K-4(c) clearly illustrates the difference between structured storage column families and
RDBMS tables: Column families can have variable columns and data stored in each row in a way
that is impossible in an RDBMS table. This storage column structure is definitely not in 1NF as
defined in Chapter 2, let alone BCNF! For example, note that the first row has no Phone or City
columns, while the third row not only has no FirstName, Phone, or City columns, but also
contains an EmailAddress column that does not exist in the other rows.