15.05, Webinar PLGenAI on AWS
Warsaw, PGE National Stadium View on map

Building data lakehouse architecture on AWS

During the Data Science Summit 2023 — the largest independent data & AI conference in CEE — we introduced the audience to the modern lakehouse data architecture on AWS, covering some of its key concepts through both theory and a practical showcase.

Share

Agenda

  1. Data platforms — a brief history
  2. Data lakehouse architecture — primer
  3. Transactional table formats
  4. Practical applications of transactional table formats — demo

Key topics

  • the evolution of data platforms, leading up to the modern data lakehouse architecture on AWS;
  • open-source data solutions that tie in well with AWS, such as Apache Iceberg and Apache Hudi;
  • transactional table formats — their importance in modern data architectures and their capabilities;
  • a real-world example leveraging AWS analytics services to showcase the above.

Speaker

Rafał Mituła

Rafał Mituła AWS Community Hero

Tech Lead, Data & AI

Cloud Data Architect and Engineer in Chaos Gears’ Data & AI team. Always eager to tackle new challenges and find innovative and effective solutions.

Distinguished as an AWS Hero, Rafał is actively involved in the AWS community by co-organising the AWS User Group Warsaw meetups and the AWS Community Day Poland conference.

Amazon Athena

Notes

AWS is trusted by…