The Tale of Two Vehicles: Apache Druid's New Shape Takes Form
Release Date: 02/14/2023
Tales at Scale
Held in October 2024, Druid Summit brought ® community contributors at companies including Netflix, Salesforce, Atlassian, Imply, Roblox and more together to discuss the latest trends, challenges, and best practices across the Druid community. The summit explored user experience design, operations and optimization techniques, and lakehouse and streaming analytics pipelines. And on this featured partner episode, host Danielle DiKayo interviews Larissa Klitzke, Senior Product Marketing Manager at , about the highlights from Druid Summit. Want to dive deeper into content? Watch all the...
info_outlineTales at Scale
On this episode, we explore how , a cloud-native API gateway, leverages Apache Druid for real-time data processing and analytics in their platform, Kong Konnect. Hiroshi Fukada, Staff Software Engineer at Kong, shares his insights on managing customer data through Kong Gateway and transitioning to managed Druid services to simplify their infrastructure. Discover the benefits of Druid, like low latency and ease of use, and learn about Kong's contributions to open source Druid, including the DDSketch extension for improved handling of long-tail distributions. Want to learn more about...
info_outlineTales at Scale
On this episode, we are joined by special co-host Hugh Evans and returning guest Will Xu as we announce Druid Summit 2024 and dive into Druid 30.0's new features and enhancements. Improvements include better ingestion for Amazon Kinesis and Apache Kafka, enhanced support for Delta Lake, and advanced integrations with Google Cloud Storage and Azure Blob Storage. Come for the technical upgrades like GROUP BY and ORDER BY for complex columns and faster query processing with new IN and AND filters, stay for the stabilized concurrent append and replace API for late-arriving streaming data. We...
info_outlineTales at Scale
On this episode, we’re diving into digital ad spend and real-time data with Miguel Rodrigues, Head of Engineering at British media company Global. We’ll discuss their use of Apache Druid to enhance real-time analytics for their digital advertising platform and get the details on their transition from traditional databases to Druid, which added the scalable streaming capabilities and fast query speeds they needed to improve critical data freshness. But that’s not all! Miguel was also kind enough to share insights for newcomers to Druid on how to embrace its flexibility and...
info_outlineTales at Scale
On this episode, we are joined by Ross Morrow, a Software Engineer at Finix, the payment processor working to create the most accessible financial services ecosystem in history. Finix’s B2B payments platform is designed for flexibility and scalability, streamlining financial transactions for businesses and delivering a truly customer-centric experience. Faced with the need for a powerful database for real-time insights, Finix turned to Apache Druid. Listen to learn how they’re able to access real-time data with sub-second query times, how they transformed their data operations, and how...
info_outlineTales at Scale
On this episode, we’re going all in on cybersecurity! Helping us with what critical aspects of security you need to focus on when building analytics applications is Carrell Jackson, CISO at Imply. We’ll discuss the importance of protecting sensitive data by implementing role-based access control and encryption and hear about best practices for securing a Druid cluster. Listen to learn more about how Imply takes a security-first approach to their product development and stick around to hear where Certified Ethical Hacking fits into how Imply’s security stays ahead of threats.
info_outlineTales at Scale
On this episode, we explore Apache Druid 29.0, focusing on three specific themes: performance, ecosystem, and SQL compliance. Discover new features such as EARLIEST / LATEST support for numerical columns, system fields ingestion, and enhanced array support like UNNEST and JSON_QUERY_ARRAY. In addition, get the full scoop on community-contributed extensions like Spectator Histogram and DDsketch for efficient quantile calculations and long-tailed distribution support. Learn about what’s new with MSQ, what’s up with PIVOT / UNPIVOT, and so much more!
info_outlineTales at Scale
In this special episode of Tales at Scale - this is our final episode of our first season! - Peter Marshall, Director of Developer Relations at Imply joins the show to discuss the highlights of 2023 for Apache Druid. We dive into the significant feature releases and enhancements that have transformed Druid over the past year, including the SQL standardizaion, query from deep storage, experimental window functions, and the growing Druid community. Come for the retrospective, stay for the peek into the future of what’s to come for us and for Druid in 2024. See you all next year!
info_outlineTales at Scale
On this episode, we dive into Apache Druid 28. This latest Druid release includes improved ANSI SQL and Apache Calcite support, the addition of window functions as an experimental feature, async queries and query from deep storage going GA, array enhancements, multi-topic Apache Kafka ingestion, and so much more! Will Xu, program manager at Imply returns to give us the full scoop.
info_outlineTales at Scale
On this episode, we debunk the myth that Druid can't do joins. Druid doesn't function as a traditional relational database because it was purpose-built for lightning-fast queries on large datasets. However, this doesn't mean Druid is entirely devoid of join capabilities – it simply approaches them differently. Our myth-busting team features returning guests Sergio Ferragut and Hellmar Becker from Imply ready to clarify how Druid handles joins in its own unique way and tackle what Druid is for in the first place.
info_outlineApache Druid today isn’t the Druid that you’re used to. It’s so much more. The addition of the multi-stage query engine didn’t just change the way Druid handles queries but enabled data and transformation on ingestion and inside of Druid from one table to another using SQL. This has made Druid about 40% faster. But why stop there? Get the inside scoop of what’s coming to Druid this year, from cold tier storage to asynchronous queries and more.