Run DuckDB over S3 files using Lambda for limitless scalability. BoilingData is a fully-managed service to make S3 querying as easy as BoilingData.execQuery({ sql: `SELECT COUNT(*) FROM parquet_scan('s3://your-bucket/datafile.parquet');` })
.
We keep data hot within AWS Lambda memory and ensure there are enough hot Lambda instances for your datasets as long as you need.
We route your queries to the right Lambda instances which have the data, so your hit rate is always 100% once the dataset is loaded. You get in-memory computing speed, single tenant, and always full resources to each and every of your queries. No shared clusters, but serverless power.
Everything works over WebSockets and SQL. Authenticated WebSocket connections carry your SQL queries over and the results are sent back via the WebSocket as well.
Simple and powerful, no tedious API integrations needed. With websockets you also get your query results streamed back to you, which allows faster initial results delivery.
We provide full visibility on your hot datasets and their concurrency setting. We make AWS Lambda containers visible for you. When you query your data set the first time, we load your data into AWS Lambda memory with the biggest Lambda instances.
We ensure you have always required number of concurrent warm Lambda instances ready to serve instantly. After specified idle time has passed for the data set, we let the containers cool off.
Using BoilingData you know exactly the caching status of your data sets and always get predictable high query performance.
Your AWS IAM Role - your control
Hosted DuckDB* - high performance embedded OLAP SQL engine
Lambda Transparency and control - managed service & events
SQL over WebSocket - simple integration and interaction