Abstract—The primary data storage for business applications use RDBMSs (Traditional relational databases) for 20 years. Today, another revolutionize is required because most of the applications must now scale to levels that were unbelievable just a few years ago. But scaling alone isn’t enough; companies also require that their applications are always available and scattering fast. Hence Apache Cassandra is a massively scalable Distributed database (NoSQL) that allows for amazing performance at extreme data. This paper provides a brief overview of the Apache Cassandra.
Keywords— Hadoop, HDFS, MapReduce, Cassandra, CQL.