Have you heard of a HDFS?
A Hadoop Distributed File Systemย (HDFS) stores and manages big data. It works by splittingย large filesย into smallerย blocks, aroundย 128MB, and then replicating them 3 times on different nodes.
Each node runs a background process called aย DataNode, which stores the block and talks to aย NameNode, the node that tracks all the blocks.
A HDFS can scale to thousands of nodes and is great for availability because if one node fails, data is still available.