Apache Iceberg

Apache Iceberg

  • Features

    • Transactional consistency between multiple applications where files can be added, removed or modified atomically, with full read isolation and multiple concurrent writes
    • Full schema evolution to track changes to a table over time
    • Time travel to query historical data and verify changes between updates
    • Partition layout and evolution enabling updates to partition schemes as queries and data volumes change without relying on hidden partitions or physical directories
    • Rollback to prior versions to quickly correct issues and return tables to a known good state
    • Advanced planning and filtering capabilities for high performance on large data volumes
  • Metadata

    Iceberg metadata structure

  • Usage patterns

  • Insights

    • Great for slowly changing big datasets (read heavy, write light)