Apache Hadoop

Apache Hadoop
Original author(s)	Doug Cutting, Mike Cafarella
Developer(s)	Apache Software Foundation
Initial release	April 1, 2006; 18 years ago
Stable release
2.10.x	2.10.2 / May 31, 2022; 2 years ago
3.4.x	3.4.0 / March 17, 2024; 4 months ago
Repository	Hadoop Repository
Written in	Java
Operating system	Cross-platform
Type	Distributed file system
License	Apache License 2.0
Website	hadoop.apache.org

Apache Hadoop ( /həˈduːp/) is a collection of open-source software utilities that facilitates using a network of many computers to solve problems involving massive amounts of data and computation.^[vague] It provides a software framework for distributed storage and processing of big data using the MapReduce programming model. Hadoop was originally designed for computer clusters built from commodity hardware, which is still the common use.^[3] It has since also found use on clusters of higher-end hardware.^[4]^[5] All the modules in Hadoop are designed with a fundamental assumption that hardware failures are common occurrences and should be automatically handled by the framework.^[6]

Quick Facts Original author(s), Developer(s) ...

Close

[3]

[4]

[5]

[6]

[1]

[2]

Apache Hadoop

Distributed data processing framework / From Wikipedia, the free encyclopedia

Dear Wikiwand AI, let's keep it short by simply answering these key questions: