#apachedatafusion search results
I am happy to announce 🤡 the winner of of TPCH-SF30 on the free tier Colab notebook is #Tableau Hyper Engine, #DuckDB and #Apachedatafusion could not finish it as they get OOM. this is what I call serieus engineering !!! colab.research.google.com/drive/1sqp_V34…
#apachedatafusion is progressing well, querying 42 million rows using the free tier of Colab is something, other SQL engines will crash, I think it will eventually catch up to #DuckDB colab.research.google.com/drive/1KzofqAW…
A piece I wrote about rebuilding InfluxDB in #rustlang and #ApacheArrow and #ApacheDataFusion is up on InfoQ!
Engineering a Time Series Database Using Open Source: Rebuilding InfluxDB 3 in Apache Arrow and Rust bit.ly/46fmuUK authored by @pauldix, reviewed by @olimpiupop
infoq.com
Engineering a Time Series Database Using Open Source: Rebuilding InfluxDB 3 in Apache Arrow and Rust
At times, to evolve your product, you need to rebuild it from scratch. The article provides the story behind the rewrite of InfluxDB from scratch using a different programming language - Rust - and...
Just merged our first SQL query using #ApacheDataFusion at work! 🎉 Integrating it was a fantastic experience, It will soon be part of the core of #Materia. Wrote about why DataFusion is a game-changer for #Rustlang #database systems: pierrezemb.fr/posts/thank-yo…
#delta_rs which is basically the standard way for writing delta table using Python ( no spark) just exposed a SQL interface , it was a simple change as it is already based on #apachedatafusion tested using #Microsoftfabric notebook : github.com/djouallah/Fabr…
start playing with #Apachedatafusion, the good thing it is mature enough that I could run the same test unmodified, that's a testament to SQL I guess, still memory issues, when I increase the data size colab.research.google.com/drive/1WJ2ICxJ…
Check out this Kubeflow in-memory data cache solution built on #ApacheArrow & #ApacheDataFusion! It optimizes sharding of #ApacheIceberg tables and enhances #Kuberentes for #GenAI workloads. github.com/kubeflow/commu… 📽️ #KubeCon + #CloudNativeCon talk: youtu.be/s4KAe7AtN7s
youtube.com
YouTube
Speed up Your ML Workloads With Kubernetes Powered In-memory Data......
Exploring #ApacheDataFusion’s Catalog: maps tables across S3/Postgres/Iceberg & turns unresolved plans into typed logical plans. Still wrapping my head around it, but once the catalog clicks the optimizer is wide open to tweak. 🦀🚀 #RustLang #DataEngineering
In this week's Developer Voices, Andrew Lamb takes us through #ApacheDataFusion, exploring how this #Rust toolkit shaves years off the prospect of creating a custom database. Fascinating stuff for any data and architecture fans like me. 😁 youtu.be/8QNNCr8WfDM
youtube.com
YouTube
DataFusion - The Database Building Toolkit (with Andrew Lamb)
let's try simple sorting of a parquet file but with bigger VM, 16 CPU/60 GB RAM #DuckDB 29.6s #ApacheDatafusion 1min 4s #Apachespark : need to configure java, not interested #Polars 1min 6s #Clickhouse 1min 24s #pyarrow 2m 3s github.com/djouallah/parq…
you can pass #apachedatafusion dataframe directly to delta table python and it works great but ... behind the scene it calls collect() which load the whole damn data into memory, if you have a lot of data to process use daft or duckdb
Startups are doubling down on #ApacheDataFusion for its powerful, scalable data processing capabilities! 🚀 With a flexible architecture and open-source community support, it's transforming data analytics. Discover why it's the future! #BigData #TechTrends #Startups
Dive into the world of #TimeSeries data with InfluxData's @AndrewLamb1111 on The Data Stack Show. Discover insights on high cardinality, query performance, #ApacheDataFusion, and the future of data systems. 🎙️ bit.ly/3zhJCoL #InfluxDB
I guess that's a big win for #apachedatafusion as it is the engine used by SDF
🔥 The #ApacheDataFusion community is growing! 🔥 Chicago’s Apache DataFusion Meetup brought together passionate engineers, developers, and data experts to dive into the latest advancements and real-world applications. 🚀 Check out the key discussions and see how DataFusion…
influxdata.com
Apache DataFusion Meetup: Chicago December 2024 Recap
A recap of the Apache DataFusion Chicago meetup in December, 2024.
🗓️ January 15 🕗 5–7:30 PM ET 📍 Boston, MA Join InfluxData's @AndrewLamb1111 and fellow @ApacheDataFusion enthusiasts for an in-person meetup! Save your spot here: bit.ly/4fCFKPw #InfluxDB #ApacheDataFusion
Why Startups Are Betting Everything on #ApacheDataFusion. Discover why startups like Flarion, LakeSail, and major companies are betting on Apache DataFusion — the Rust-based query engine that's reshaping #data #analytics. bit.ly/4m5nQro
Join us for an @ApacheDataFusio Meetup on January 15th in Boston! Meet Influxer @AndrewLamb1111 and other fellow DataFusion enthusiasts, project contributors, and database aficionados: bit.ly/4fCFKPw #InfluxDB #ApacheDataFusion
📅 Hey, Boston, save the date! On January 15th, #InfluxDB Staff Software Engineer @AndrewLamb1111 is co-hosting an @ApacheDataFusio Meetup! Come meet fellow #ApacheDataFusion enthusiasts, project contributors, and database aficionados at this event: bit.ly/4fCFKPw
A piece I wrote about rebuilding InfluxDB in #rustlang and #ApacheArrow and #ApacheDataFusion is up on InfoQ!
Engineering a Time Series Database Using Open Source: Rebuilding InfluxDB 3 in Apache Arrow and Rust bit.ly/46fmuUK authored by @pauldix, reviewed by @olimpiupop
infoq.com
Engineering a Time Series Database Using Open Source: Rebuilding InfluxDB 3 in Apache Arrow and Rust
At times, to evolve your product, you need to rebuild it from scratch. The article provides the story behind the rewrite of InfluxDB from scratch using a different programming language - Rust - and...
Why Startups Are Betting Everything on #ApacheDataFusion. Discover why startups like Flarion, LakeSail, and major companies are betting on Apache DataFusion — the Rust-based query engine that's reshaping #data #analytics. bit.ly/4m5nQro
Startups are doubling down on #ApacheDataFusion for its powerful, scalable data processing capabilities! 🚀 With a flexible architecture and open-source community support, it's transforming data analytics. Discover why it's the future! #BigData #TechTrends #Startups
you can pass #apachedatafusion dataframe directly to delta table python and it works great but ... behind the scene it calls collect() which load the whole damn data into memory, if you have a lot of data to process use daft or duckdb
Check out this Kubeflow in-memory data cache solution built on #ApacheArrow & #ApacheDataFusion! It optimizes sharding of #ApacheIceberg tables and enhances #Kuberentes for #GenAI workloads. github.com/kubeflow/commu… 📽️ #KubeCon + #CloudNativeCon talk: youtu.be/s4KAe7AtN7s
youtube.com
YouTube
Speed up Your ML Workloads With Kubernetes Powered In-memory Data......
Just merged our first SQL query using #ApacheDataFusion at work! 🎉 Integrating it was a fantastic experience, It will soon be part of the core of #Materia. Wrote about why DataFusion is a game-changer for #Rustlang #database systems: pierrezemb.fr/posts/thank-yo…
Exploring #ApacheDataFusion’s Catalog: maps tables across S3/Postgres/Iceberg & turns unresolved plans into typed logical plans. Still wrapping my head around it, but once the catalog clicks the optimizer is wide open to tweak. 🦀🚀 #RustLang #DataEngineering
In this week's Developer Voices, Andrew Lamb takes us through #ApacheDataFusion, exploring how this #Rust toolkit shaves years off the prospect of creating a custom database. Fascinating stuff for any data and architecture fans like me. 😁 youtu.be/8QNNCr8WfDM
youtube.com
YouTube
DataFusion - The Database Building Toolkit (with Andrew Lamb)
🔥 The #ApacheDataFusion community is growing! 🔥 Chicago’s Apache DataFusion Meetup brought together passionate engineers, developers, and data experts to dive into the latest advancements and real-world applications. 🚀 Check out the key discussions and see how DataFusion…
influxdata.com
Apache DataFusion Meetup: Chicago December 2024 Recap
A recap of the Apache DataFusion Chicago meetup in December, 2024.
I guess that's a big win for #apachedatafusion as it is the engine used by SDF
🗓️ January 15 🕗 5–7:30 PM ET 📍 Boston, MA Join InfluxData's @AndrewLamb1111 and fellow @ApacheDataFusion enthusiasts for an in-person meetup! Save your spot here: bit.ly/4fCFKPw #InfluxDB #ApacheDataFusion
Catch up on the #Database Building Blocks session with #InfluxDB Staff Engineer @andrewlamb1111! 🔎 Dive into @ApacheDataFusio's inner workings and discover the data-centric systems it powers: bit.ly/4ej6qnZ #ApacheDataFusion
youtube.com
YouTube
Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query...
.@ApacheDataFusio 43.0.0 is the fastest engine for querying Apache Parquet files in ClickBench! 👏 For the first time, a Rust-based engine holds the top spot: bit.ly/4fZ5RQV #InfluxDB #ApacheDataFusion #Rustlang
influxdata.com
Apache DataFusion is Now the Fastest Single Node Engine for Querying Apache Parquet Files
Best in class performance on Parquet is now available to anyone. DataFusion’s open design lets you start quickly with a full-featured Query Engine, including SQL, data formats, catalogs, and more,...
🎉 Big news! @ApacheDataFusio—the engine behind the rebuilt #InfluxDB—has been named one of @CRN's 10 Coolest #OpenSource Software Tools of 2024! 🚀 Optimized for speed, scalability, & versatility: bit.ly/3VM9kdH via @CRN #ApacheDataFusion
#delta_rs which is basically the standard way for writing delta table using Python ( no spark) just exposed a SQL interface , it was a simple change as it is already based on #apachedatafusion tested using #Microsoftfabric notebook : github.com/djouallah/Fabr…
Watch InfluxData Staff Software Engineer @AndrewLamb1111 explain @ApacheDataFusio in detail, describing the types of data-centric systems it's used to build. 🔗bit.ly/4ej6qnZ #InfluxDB #ApacheDataFusion
youtube.com
YouTube
Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query...
Dive into the world of #TimeSeries data with InfluxData's @AndrewLamb1111 on The Data Stack Show. Discover insights on high cardinality, query performance, #ApacheDataFusion, and the future of data systems. 🎙️ bit.ly/3zhJCoL #InfluxDB
Join us for an @ApacheDataFusio Meetup on January 15th in Boston! Meet Influxer @AndrewLamb1111 and other fellow DataFusion enthusiasts, project contributors, and database aficionados: bit.ly/4fCFKPw #InfluxDB #ApacheDataFusion
📅 Hey, Boston, save the date! On January 15th, #InfluxDB Staff Software Engineer @AndrewLamb1111 is co-hosting an @ApacheDataFusio Meetup! Come meet fellow #ApacheDataFusion enthusiasts, project contributors, and database aficionados at this event: bit.ly/4fCFKPw
Must-see: @AndrewLamb1111's Database Building Blocks Seminar talk explains @ApacheDataFusio in detail and describes the types of data-centric systems it's used to build. Watch here bit.ly/4ej6qnZ #InfluxDB #ApacheDataFusion
youtube.com
YouTube
Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query...
Dive into the world of #TimeSeries data with InfluxData's @AndrewLamb1111 on The Data Stack Show. Discover insights on high cardinality, query performance, #ApacheDataFusion, and the future of data systems. 🎙️ bit.ly/3zhJCoL #InfluxDB
#apachedatafusion is progressing well, querying 42 million rows using the free tier of Colab is something, other SQL engines will crash, I think it will eventually catch up to #DuckDB colab.research.google.com/drive/1KzofqAW…
I am happy to announce 🤡 the winner of of TPCH-SF30 on the free tier Colab notebook is #Tableau Hyper Engine, #DuckDB and #Apachedatafusion could not finish it as they get OOM. this is what I call serieus engineering !!! colab.research.google.com/drive/1sqp_V34…
start playing with #Apachedatafusion, the good thing it is mature enough that I could run the same test unmodified, that's a testament to SQL I guess, still memory issues, when I increase the data size colab.research.google.com/drive/1WJ2ICxJ…
#delta_rs which is basically the standard way for writing delta table using Python ( no spark) just exposed a SQL interface , it was a simple change as it is already based on #apachedatafusion tested using #Microsoftfabric notebook : github.com/djouallah/Fabr…
Something went wrong.
Something went wrong.
United States Trends
- 1. #CARTMANCOIN 1,717 posts
- 2. Broncos 66.1K posts
- 3. yeonjun 218K posts
- 4. Raiders 66.4K posts
- 5. Bo Nix 18.2K posts
- 6. Geno 18.7K posts
- 7. Sean Payton 4,787 posts
- 8. daniela 45.5K posts
- 9. #criticalrolespoilers 5,014 posts
- 10. #TNFonPrime 4,032 posts
- 11. Kenny Pickett 1,508 posts
- 12. Kehlani 9,576 posts
- 13. Jalen Green 7,601 posts
- 14. Chip Kelly 1,983 posts
- 15. Pete Carroll 1,974 posts
- 16. Bradley Beal 3,549 posts
- 17. TALK TO YOU OUT NOW 27.4K posts
- 18. #Pluribus 2,717 posts
- 19. byers 30K posts
- 20. Tammy Faye 1,309 posts