Apache Hadoop is the product which is used for building open-source software for reliable, scalable, distributed computing.
- Apache Hadoop allows to process large data sets across clusters of computers to do the distributed processing using simple programming models.
- It is designed for single machine as well as for n number of machines. Each of which offers local computation and storage.
- It is designed to detect and handle failures at application layer rather than rely on hardware to deliver high-availability. So you can deliver highly-available service on top of a cluster of computers, each of which can be failed.