![cover image](https://wikiwandv2-19431.kxcdn.com/_next/image?url=https://upload.wikimedia.org/wikipedia/commons/thumb/0/00/Datensammlung_Studie.png/640px-Datensammlung_Studie.png&w=640&q=50)
Data lake
System or repository of data stored in its natural/raw format / From Wikipedia, the free encyclopedia
A data lake is a system or repository of data stored in its natural/raw format,[1] usually object blobs or files. A data lake is usually a single store of data including raw copies of source system data, sensor data, social data etc.,[2] and transformed data used for tasks such as reporting, visualization, advanced analytics, and machine learning. A data lake can include structured data from relational databases (rows and columns), semi-structured data (CSV, logs, XML, JSON), unstructured data (emails, documents, PDFs), and binary data (images, audio, video).[3] A data lake can be established "on premises" (within an organization's data centers) or "in the cloud" (using cloud services).
![Thumb image](http://upload.wikimedia.org/wikipedia/commons/thumb/0/00/Datensammlung_Studie.png/640px-Datensammlung_Studie.png)