#apachedatafusion search results
A piece I wrote about rebuilding InfluxDB in #rustlang and #ApacheArrow and #ApacheDataFusion is up on InfoQ!
Engineering a Time Series Database Using Open Source: Rebuilding InfluxDB 3 in Apache Arrow and Rust bit.ly/46fmuUK authored by @pauldix, reviewed by @olimpiupop
I am happy to announce 🤡 the winner of of TPCH-SF30 on the free tier Colab notebook is #Tableau Hyper Engine, #DuckDB and #Apachedatafusion could not finish it as they get OOM. this is what I call serieus engineering !!! colab.research.google.com/drive/1sqp_V34…
#apachedatafusion is progressing well, querying 42 million rows using the free tier of Colab is something, other SQL engines will crash, I think it will eventually catch up to #DuckDB colab.research.google.com/drive/1KzofqAW…
start playing with #Apachedatafusion, the good thing it is mature enough that I could run the same test unmodified, that's a testament to SQL I guess, still memory issues, when I increase the data size colab.research.google.com/drive/1WJ2ICxJ…
#delta_rs which is basically the standard way for writing delta table using Python ( no spark) just exposed a SQL interface , it was a simple change as it is already based on #apachedatafusion tested using #Microsoftfabric notebook : github.com/djouallah/Fabr…
Just merged our first SQL query using #ApacheDataFusion at work! 🎉 Integrating it was a fantastic experience, It will soon be part of the core of #Materia. Wrote about why DataFusion is a game-changer for #Rustlang #database systems: pierrezemb.fr/posts/thank-yo…
Exploring #ApacheDataFusion’s Catalog: maps tables across S3/Postgres/Iceberg & turns unresolved plans into typed logical plans. Still wrapping my head around it, but once the catalog clicks the optimizer is wide open to tweak. 🦀🚀 #RustLang #DataEngineering
Check out this Kubeflow in-memory data cache solution built on #ApacheArrow & #ApacheDataFusion! It optimizes sharding of #ApacheIceberg tables and enhances #Kuberentes for #GenAI workloads. github.com/kubeflow/commu… 📽️ #KubeCon + #CloudNativeCon talk: youtu.be/s4KAe7AtN7s
youtube.com
YouTube
Speed up Your ML Workloads With Kubernetes Powered In-memory Data......
let's try simple sorting of a parquet file but with bigger VM, 16 CPU/60 GB RAM #DuckDB 29.6s #ApacheDatafusion 1min 4s #Apachespark : need to configure java, not interested #Polars 1min 6s #Clickhouse 1min 24s #pyarrow 2m 3s github.com/djouallah/parq…
In this week's Developer Voices, Andrew Lamb takes us through #ApacheDataFusion, exploring how this #Rust toolkit shaves years off the prospect of creating a custom database. Fascinating stuff for any data and architecture fans like me. 😁 youtu.be/8QNNCr8WfDM
youtube.com
YouTube
DataFusion - The Database Building Toolkit (with Andrew Lamb)
you can pass #apachedatafusion dataframe directly to delta table python and it works great but ... behind the scene it calls collect() which load the whole damn data into memory, if you have a lot of data to process use daft or duckdb
Dive into the world of #TimeSeries data with InfluxData's @AndrewLamb1111 on The Data Stack Show. Discover insights on high cardinality, query performance, #ApacheDataFusion, and the future of data systems. 🎙️ bit.ly/3zhJCoL #InfluxDB
Startups are doubling down on #ApacheDataFusion for its powerful, scalable data processing capabilities! 🚀 With a flexible architecture and open-source community support, it's transforming data analytics. Discover why it's the future! #BigData #TechTrends #Startups
I guess that's a big win for #apachedatafusion as it is the engine used by SDF
🔥 The #ApacheDataFusion community is growing! 🔥 Chicago’s Apache DataFusion Meetup brought together passionate engineers, developers, and data experts to dive into the latest advancements and real-world applications. 🚀 Check out the key discussions and see how DataFusion…
influxdata.com
Apache DataFusion Meetup: Chicago December 2024 Recap
A recap of the Apache DataFusion Chicago meetup in December, 2024.
#ApacheDataFusion @GoogleBigQuery @AzureSynapse #AmazonRedshift @SnowflakeDB @ApachePinot @databricks #ClouderaImpala @StarRocksLabs @ApacheKylin @FireboltHQ
Why Startups Are Betting Everything on #ApacheDataFusion. Discover why startups like Flarion, LakeSail, and major companies are betting on Apache DataFusion — the Rust-based query engine that's reshaping #data #analytics. bit.ly/4m5nQro
🔎 Learn about our experience implementing StringView in the Rust implementation of Apache Arrow, and integrating it into Apache DataFusion. #ApacheDataFusion #Rustlang #InfluxDB
Xiangpeng Hao strikes again: Turns out you can both 1. Implement 🇩🇪 strings in Rust 2. They actually improve end to end performance in @ApacheDataFusio. It takes a lot more than a straightforward naive implementation to do so: influxdata.com/blog/faster-qu…
🗓️ January 15 🕗 5–7:30 PM ET 📍 Boston, MA Join InfluxData's @AndrewLamb1111 and fellow @ApacheDataFusion enthusiasts for an in-person meetup! Save your spot here: bit.ly/4fCFKPw #InfluxDB #ApacheDataFusion
luma.com
Boston DataFusion Meetup · Luma
Apache DataFusion Meetup: Boston https://datafusion.apache.org/ Come meet fellow DataFusion enthusiasts, project contributors, and database aficionados at an…
A piece I wrote about rebuilding InfluxDB in #rustlang and #ApacheArrow and #ApacheDataFusion is up on InfoQ!
Engineering a Time Series Database Using Open Source: Rebuilding InfluxDB 3 in Apache Arrow and Rust bit.ly/46fmuUK authored by @pauldix, reviewed by @olimpiupop
Why Startups Are Betting Everything on #ApacheDataFusion. Discover why startups like Flarion, LakeSail, and major companies are betting on Apache DataFusion — the Rust-based query engine that's reshaping #data #analytics. bit.ly/4m5nQro
Startups are doubling down on #ApacheDataFusion for its powerful, scalable data processing capabilities! 🚀 With a flexible architecture and open-source community support, it's transforming data analytics. Discover why it's the future! #BigData #TechTrends #Startups
you can pass #apachedatafusion dataframe directly to delta table python and it works great but ... behind the scene it calls collect() which load the whole damn data into memory, if you have a lot of data to process use daft or duckdb
Check out this Kubeflow in-memory data cache solution built on #ApacheArrow & #ApacheDataFusion! It optimizes sharding of #ApacheIceberg tables and enhances #Kuberentes for #GenAI workloads. github.com/kubeflow/commu… 📽️ #KubeCon + #CloudNativeCon talk: youtu.be/s4KAe7AtN7s
youtube.com
YouTube
Speed up Your ML Workloads With Kubernetes Powered In-memory Data......
Just merged our first SQL query using #ApacheDataFusion at work! 🎉 Integrating it was a fantastic experience, It will soon be part of the core of #Materia. Wrote about why DataFusion is a game-changer for #Rustlang #database systems: pierrezemb.fr/posts/thank-yo…
Exploring #ApacheDataFusion’s Catalog: maps tables across S3/Postgres/Iceberg & turns unresolved plans into typed logical plans. Still wrapping my head around it, but once the catalog clicks the optimizer is wide open to tweak. 🦀🚀 #RustLang #DataEngineering
In this week's Developer Voices, Andrew Lamb takes us through #ApacheDataFusion, exploring how this #Rust toolkit shaves years off the prospect of creating a custom database. Fascinating stuff for any data and architecture fans like me. 😁 youtu.be/8QNNCr8WfDM
youtube.com
YouTube
DataFusion - The Database Building Toolkit (with Andrew Lamb)
🔥 The #ApacheDataFusion community is growing! 🔥 Chicago’s Apache DataFusion Meetup brought together passionate engineers, developers, and data experts to dive into the latest advancements and real-world applications. 🚀 Check out the key discussions and see how DataFusion…
influxdata.com
Apache DataFusion Meetup: Chicago December 2024 Recap
A recap of the Apache DataFusion Chicago meetup in December, 2024.
I guess that's a big win for #apachedatafusion as it is the engine used by SDF
🗓️ January 15 🕗 5–7:30 PM ET 📍 Boston, MA Join InfluxData's @AndrewLamb1111 and fellow @ApacheDataFusion enthusiasts for an in-person meetup! Save your spot here: bit.ly/4fCFKPw #InfluxDB #ApacheDataFusion
Catch up on the #Database Building Blocks session with #InfluxDB Staff Engineer @andrewlamb1111! 🔎 Dive into @ApacheDataFusio's inner workings and discover the data-centric systems it powers: bit.ly/4ej6qnZ #ApacheDataFusion
youtube.com
YouTube
Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query...
.@ApacheDataFusio 43.0.0 is the fastest engine for querying Apache Parquet files in ClickBench! 👏 For the first time, a Rust-based engine holds the top spot: bit.ly/4fZ5RQV #InfluxDB #ApacheDataFusion #Rustlang
influxdata.com
Apache DataFusion is Now the Fastest Single Node Engine for Querying Apache Parquet Files
Best in class performance on Parquet is now available to anyone. DataFusion’s open design lets you start quickly with a full-featured Query Engine, including SQL, data formats, catalogs, and more,...
🎉 Big news! @ApacheDataFusio—the engine behind the rebuilt #InfluxDB—has been named one of @CRN's 10 Coolest #OpenSource Software Tools of 2024! 🚀 Optimized for speed, scalability, & versatility: bit.ly/3VM9kdH via @CRN #ApacheDataFusion
crn.com
The 10 Coolest Open-Source Software Tools Of 2024
The 10 coolest open-source software tools in 2024 include tools for developing AI applications, implementing zero trust networks and managing large volumes of data.
#delta_rs which is basically the standard way for writing delta table using Python ( no spark) just exposed a SQL interface , it was a simple change as it is already based on #apachedatafusion tested using #Microsoftfabric notebook : github.com/djouallah/Fabr…
Watch InfluxData Staff Software Engineer @AndrewLamb1111 explain @ApacheDataFusio in detail, describing the types of data-centric systems it's used to build. 🔗bit.ly/4ej6qnZ #InfluxDB #ApacheDataFusion
youtube.com
YouTube
Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query...
Dive into the world of #TimeSeries data with InfluxData's @AndrewLamb1111 on The Data Stack Show. Discover insights on high cardinality, query performance, #ApacheDataFusion, and the future of data systems. 🎙️ bit.ly/3zhJCoL #InfluxDB
Join us for an @ApacheDataFusio Meetup on January 15th in Boston! Meet Influxer @AndrewLamb1111 and other fellow DataFusion enthusiasts, project contributors, and database aficionados: bit.ly/4fCFKPw #InfluxDB #ApacheDataFusion
📅 Hey, Boston, save the date! On January 15th, #InfluxDB Staff Software Engineer @AndrewLamb1111 is co-hosting an @ApacheDataFusio Meetup! Come meet fellow #ApacheDataFusion enthusiasts, project contributors, and database aficionados at this event: bit.ly/4fCFKPw
Must-see: @AndrewLamb1111's Database Building Blocks Seminar talk explains @ApacheDataFusio in detail and describes the types of data-centric systems it's used to build. Watch here bit.ly/4ej6qnZ #InfluxDB #ApacheDataFusion
youtube.com
YouTube
Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query...
Dive into the world of #TimeSeries data with InfluxData's @AndrewLamb1111 on The Data Stack Show. Discover insights on high cardinality, query performance, #ApacheDataFusion, and the future of data systems. 🎙️ bit.ly/3zhJCoL #InfluxDB
start playing with #Apachedatafusion, the good thing it is mature enough that I could run the same test unmodified, that's a testament to SQL I guess, still memory issues, when I increase the data size colab.research.google.com/drive/1WJ2ICxJ…
#apachedatafusion is progressing well, querying 42 million rows using the free tier of Colab is something, other SQL engines will crash, I think it will eventually catch up to #DuckDB colab.research.google.com/drive/1KzofqAW…
I am happy to announce 🤡 the winner of of TPCH-SF30 on the free tier Colab notebook is #Tableau Hyper Engine, #DuckDB and #Apachedatafusion could not finish it as they get OOM. this is what I call serieus engineering !!! colab.research.google.com/drive/1sqp_V34…
#delta_rs which is basically the standard way for writing delta table using Python ( no spark) just exposed a SQL interface , it was a simple change as it is already based on #apachedatafusion tested using #Microsoftfabric notebook : github.com/djouallah/Fabr…
Something went wrong.
Something went wrong.
United States Trends
- 1. Broncos 62.1K posts
- 2. Bo Nix 17.2K posts
- 3. YEONJUN 149K posts
- 4. Geno 17.6K posts
- 5. $SMILEY N/A
- 6. Sean Payton 4,505 posts
- 7. Kenny Pickett 1,475 posts
- 8. #TNFonPrime 3,923 posts
- 9. #NOLABELS_PART01 57.9K posts
- 10. Chip Kelly 1,892 posts
- 11. Bradley Beal 3,172 posts
- 12. Jalen Green 6,575 posts
- 13. DANIELA 27.4K posts
- 14. Pete Carroll 1,790 posts
- 15. NO LABELS NOVEMBER 20.9K posts
- 16. TALK TO YOU OUT NOW 20.5K posts
- 17. #criticalrolespoilers 4,124 posts
- 18. Jeanty 6,399 posts
- 19. Troy Franklin 2,382 posts
- 20. Thursday Night Football 6,490 posts