top of page
Search


A Beginner’s Guide to Apache Parquet
Apache Parquet is a widely used file format in modern data analytics and data engineering. It is especially common in in data lakes and data lakehouse architectures, where performance, scalability, and efficient storage are critical. This guide explains what Parquet is, where it came from, how it is structured internally, and how to query it effectively. From the AWS Builder Center Blog: https://builder.aws.com/content/38xMNi5KpMwMMVNvBaHPjEOlv8X/a-beginners-guide-to-apache-p

David McAmis
Jan 301 min read
bottom of page