2
votes

It looks to be a confusion for users like me as what are the main differences between azure blob storage and azure data lake storage, and in what user case azure blob storage fits better than azure data lake storage, and vice versa?

Thank you.

2

2 Answers

1
votes

Data Lake Storage Gen1 Purpose: Optimized storage for big data analytics workloads

Azure Blob Storage Purpose: General purpose object store for a wide variety of storage scenarios, including big data analytics

Data Lake Storage Gen1 Use Cases: Batch, interactive, streaming analytics and machine learning data such as log files, IoT data, click streams, large datasets

Azure Blob Storage Use Cases: Any type of text or binary data, such as application back end, backup data, media storage for streaming and general purpose data. Additionally, full support for analytics workloads; batch, interactive, streaming analytics and machine learning data such as log files, IoT data, click streams, large datasets

Further more details you could refer to this doc:Comparing Azure Data Lake Storage Gen1 and Azure Blob Storage, there is a table summarizes the differences between Azure Data Lake Storage Gen1 and Azure Blob Storage along some key aspects of big data processing.

0
votes

Adding to the above, Azure Data Lake Storage Gen2 is a set of capabilities dedicated to big data analytics, built on top of Azure Blob Storage.