The below architecture diagram is designed using EdrawMax as it has free architecture symbols. As per the below architecture diagram, Hadoop Distributed File System (HDFS) exposes a file system namespace and allows user data to be stored in files. There are several Data Repositories like EDW, ERP, CRM, and RDBMS. The HDFS and Data Operation System accesses the data from these repositories. As the image suggests, a data source may be a database, a flat-file, live measurements from physical devices, scraped web data, or any of the myriad static. A Hadoop for the retail data lake can complement an enterprise data warehouse rather than supplant it entirely. It should be noted here that the data lake can host new analytics applications.