PrestoDB Blog - PrestoDB

Elevating Presto Query Optimization: Leveraging State-of-the-Art Techniques for Improved Performance

By David Simmen, Anant Aneja, Vivek Bharathan, Zachary Blanco, Aditi Pandit & Ethan Zhang March 21, 2024March 21, 2024

Presto, a prominent open-source distributed SQL query engine, has been at the leading edge of high-performance data analytics for over a decade. In analytical data processing, the effectiveness of query optimization is paramount. Over the last half-century, optimizing SQL queries has been a hotbed of research and development, resulting in groundbreaking innovations. This blog post…

2023 Presto Community Year in Review

By Girish Baliga, Tim Meehan & Ali LeClerc December 27, 2023December 27, 2023

Hello Presto community! As we wrap up the year, it’s a great time to reflect on all we accomplished with Presto. We had a lot to celebrate in 2023, from the official launch of Presto 2.0 (see our keynote from PrestoCon) to continued growth in both the community and the Presto Foundation. We’re happy to…

PrestoCon 2023: Highlighting Presto 2.0 and the Presto community

By Ali LeClerc December 19, 2023December 19, 2023

PrestoCon 2023 was a success! Part education, part celebration, this year’s event highlighted the excitement about Prestissimo, aka Presto 2.0.

Top 3 reasons why you should attend PrestoCon 2023: Halloween Edition

By Ali LeClerc October 31, 2023October 31, 2023

Two days of Presto, hands-on workshops, and Prestissimo…oh my! Happy Halloween, Presto community! Over the last few weeks, I’ve had people reach out to ask more about PrestoCon 2023, so I figured I’d write a blog to share my thoughts. First, a quick overview: When: December 5-6th, 2023 at the Computer History Museum in Mountain View,…

Happy 11th Birthday to Presto! We’re celebrating with some project contributions

By Ali LeClerc October 12, 2023October 12, 2023

Happy Birthday to Presto! Although it officially happened in August, we’re still celebrating because Presto turned 11 and because we’ve logged four vibrant years under the Linux Foundation’s governance. 🎂🎉Check out the video of the Presto Community wishing the project a happy birthday! What’s a birthday without gifts, right? 🎁 Coinciding with this birthday, the…

Introducing Presto Working Groups

By Tim Meehan August 29, 2023October 4, 2023

Recently we introduced some new working groups within the Presto open-source project. Before sharing more about those groups, I wanted to give a perspective on why working groups in open-source projects like Presto are important and will help move the project forward at a faster and more effective pace. Below you’ll see an FAQ that…

Scaling Presto for Data Analytics – Insights from Meta, Uber, and Intuit

By Ali LeClerc August 17, 2023September 14, 2023

At PrestoCon Day 2023, we had a fantastic panel discussion with speakers from Meta, Uber, and Intuit. Each shared their experiences and use cases of scaling Presto in their respective companies. Let’s take a look at the key points discussed by each panelist, including use cases, key metrics, and future plans for Presto. Sign up…

Simplifying Presto on Kubernetes – Introducing the Presto Helm Chart

By Ali LeClerc August 2, 2023September 14, 2023

Let’s explore how to run Presto on Kubernetes. At PrestoCon Day 2023, Denis Krivenko of Platform24 shared his work on the Presto Helm Chart and why Presto on Kubernetes helps make for an efficient deployment. He also demoed a step-by-step process of deploying Presto on a Kubernetes cluster using the Helm package manager. Sign up…

Quick Stats – Runtime ANALYZE for Better Query Plans with Presto

By Ali LeClerc July 20, 2023September 14, 2023

At PrestoCon Day, Anant Aneja of Ahana, an IBM Company introduced a new feature for Presto called Quick Stats which aims to enhance query optimization by providing up-to-date statistics for the query optimizer. This enables more accurate cost-based decisions and better selectivity for non-trivial queries. In this blog post we’ll recap the details and benefits…

Migrating to Presto – How Bolt Built a Data Platform Architecture for Scalability and Cost Efficiency

By Ali LeClerc July 13, 2023September 14, 2023

At PrestoCon Day we heard from Bolt, a ride sharing app with 100 million users across 45 countries in Eastern Europe, who shared why they chose Presto to underpin their data architecture platform. By leveraging Presto’s capabilities, Bolt was able to address scalability limits, cost efficiency, and workload management challenges. In this blog we’ll recap…

IBM watsonx.data – a modern open data lakehouse architecture, built on Presto!

By Vikram Murali & Steven Mih July 11, 2023September 14, 2023

Today we are happy to share that IBM watsonx.data, a Presto-based Open Data Lakehouse architecture, is now generally available. Back in April we shared that IBM had joined the Presto Foundation through the acquisition of Ahana. To reiterate what we talked about then, we believe that this is an exciting time for the Presto open…

Harnessing Presto – A Deep Dive into Adobe Advertising’s Three Use Cases

By Ali LeClerc June 29, 2023September 14, 2023

At PrestoCon Day 2023, we had a team from Adobe showcasing three different Presto-based use cases. As part of Adobe Advertising, Rajmani Arya, Varun Senthilnathan and Manoj Kumar Dhakad detailed the Adobe Data Processing platform (ADP) and three use cases for Presto: scheduled pipelines, ad-hoc query, and custom reporting. Let’s dive into what they covered….

Recapping PrestoCon Day 2023 – Presto for the Data Lakehouse, Presto at scale

By Ali LeClerc June 23, 2023September 14, 2023

Just a few weeks ago we hosted PrestoCon Day, our annual virtual community conference. Thank you to everyone who attended – it was an awesome day! We had a fantastic agenda with many Presto users sharing why they chose Presto and how they’re using it to power some pretty sizable workloads. Sign up for the…

Denodo Joins the Presto Foundation

By Pablo Alvarez-Yanez June 14, 2023September 14, 2023

We are pleased to announce that Denodo Technologies has joined the Presto Foundation. The Denodo Platform is a popular data management platform based on the concept of data virtualization and logical data models, which includes capabilities for data integration, privacy, governance, and data cataloging. Denodo is often used to implement logical and distributed data architectures…

Hudi tables via Presto-Hive connector: A Deep Dive

By Pratyaksh Sharma May 30, 2023September 14, 2023

With the growing popularity of the lakehouse approach, it has become increasingly important for query engines to support these new formats such as Hudi. A previous blog discusses the evolution of presto-hudi integration via hive connector at a high level. With the latest community developments, a separate presto-hudi connector has come up but it is…

IBM joins the Presto Foundation through acquisition of Ahana

By Vikram Murali & Steven Mih April 12, 2023September 14, 2023

Today we’re thrilled to share that IBM has acquired Ahana, the venture-backed SaaS for Presto startup company, and we want to write more about our belief in Open Source and why IBM and Ahana are joining forces for the benefit of Presto. We believe that this is an exciting time for the Presto project. We’re…

Customer-Facing Presto at Rippling – Andy Li, Rippling

By Ali LeClerc January 9, 2023September 14, 2023

Last month we hosted PrestoCon, a return to in-person events that showcased the community development of Presto. In this blog we’ll detail Rippling’s presentation on their Presto use case, including their architecture, key optimizations, and hard earned lessons. You can also check out their full presentation here. Background Rippling is a popular HR and payroll…

A recap of PrestoCon 2022 – Bringing Data Lakehouse Analytics to Life (plus a special video recap)

By Ali LeClerc January 9, 2023September 14, 2023

Last month the Computer History Museum in Mountain View, California, reverberated with “all things Presto,” at our PrestoCon 2022 conference. Back for the third time—and the first time post-pandemic—PrestoCon was ground zero for training, knowledge sharing, and inspiration about the open-source Presto for data analytics and lakehouses, as well as for the vibrant Presto community….