Tomorrow, Webinar PLIntelligent document processing on AWS
Warsaw, PGE National Stadium View on map

Building data lakehouse architecture on AWS

During the Data Science Summit 2023 — the largest independent data & AI conference in CEE — we introduced the audience to the modern lakehouse data architecture on AWS, covering some of its key concepts through both theory and a practical showcase.

Share

Agenda

  1. Data platforms — a brief history
  2. Data lakehouse architecture — primer
  3. Transactional table formats
  4. Practical applications of transactional table formats — demo

Key topics

  • the evolution of data platforms, leading up to the modern data lakehouse architecture on AWS;
  • open-source data solutions that tie in well with AWS, such as Apache Iceberg and Apache Hudi;
  • transactional table formats — their importance in modern data architectures and their capabilities;
  • a real-world example leveraging AWS analytics services to showcase the above.

Speaker

Rafał Mituła

Rafał Mituła AWS Community Hero

Head of Data

Head of Data at Chaos Gears, wearing an Architect’s hat whenever needed. Recognized as an AWS Hero by AWS, Rafał actively co-organizes the AWS User Group Warsaw meetups and the AWS Community Day Poland conference.

Amazon Athena

Notes

AWS is trusted by…