Description (include relevant info but please do not post solution code or your entire notebook)
According to the slides Apache iceberg and hudi is considered as storage systems. But Apache iceberg and hudi are datalakehouse which is also considered as storage abstraction
So the question is Apache iceberg and hudi are storage systems or storage abstractions ?
You will learn more about storage abstractions and there is a lab on AWS lake formations and Iceberg later in Course 3. In the week 2 slides we can see that storage abstractions (e.g. data Lakes, data Warehouse etc) are in the top of the hierachy. Storage systems like the one you mentioned (iceberg and hudi and other) are in the middle and add features to the storage abstractions. Iceberg adds shema flexibility, data partioning for large datasets and other features as we will see later in the course. Hope it helps
Ok I think I understand what you mean but according to the lectures of Week 2 Iceberg and hudi are definitely not storage abstractions sitting on top of the Hierarchy. You are correct about Apache iceberg and hudi are open datalakehouse solutions and Joe explains that in his book after Delta lakes (which he calls storage management system).
If you follow the course 3 later we implement a datalakehouse in this Assignment: Building a Data Lakehouse with AWS Lake Formation and Apache Iceberg. That will explain about any differences in the datalakehouse and Iceberg architecture. Hope it helps