The below architecture diagram is designed using EdrawMax as the free online tool has tons of flexible features to offer. Using the predefined templates, architecture diagrams can be easily created. The following image shows the Hortonworks Data Platform (HDP), a security-rich, enterprise-ready, open-source Apache Hadoop distribution based on a centralized architecture (YARN). HDP addresses data needs at rest, powers real-time customer applications, and delivers robust analytics that help accelerate decision-making. While the data lake concept can be applied more broadly to include other types of systems, it most frequently involves storing data in the Hadoop Distributed File System (HDFS) across a set of clustered compute nodes based on commodity server hardware. As the below image suggests, a Hadoop enterprise data lake can complement an enterprise data warehouse rather than supplant it entirely. It should be noted here that the data lake can host new analytics applications.