#apachedatafusion search results

A piece I wrote about rebuilding InfluxDB in #rustlang and #ApacheArrow and #ApacheDataFusion is up on InfoQ!

Engineering a Time Series Database Using Open Source: Rebuilding InfluxDB 3 in Apache Arrow and Rust bit.ly/46fmuUK authored by @pauldix, reviewed by @olimpiupop



I am happy to announce 🤡 the winner of of TPCH-SF30 on the free tier Colab notebook is #Tableau Hyper Engine, #DuckDB and #Apachedatafusion could not finish it as they get OOM. this is what I call serieus engineering !!! colab.research.google.com/drive/1sqp_V34…

mim_djo's tweet image. I am happy to announce 🤡 the winner of of TPCH-SF30 on the free tier Colab notebook is #Tableau Hyper Engine, #DuckDB and #Apachedatafusion could not finish it as they get OOM.
this is what I call serieus engineering !!!
colab.research.google.com/drive/1sqp_V34…

#apachedatafusion is progressing well, querying 42 million rows using the free tier of Colab is something, other SQL engines will crash, I think it will eventually catch up to #DuckDB colab.research.google.com/drive/1KzofqAW…

mim_djo's tweet image. #apachedatafusion is progressing well, querying 42 million rows using the free tier of Colab is something, other SQL engines will crash, I think it will eventually catch up to #DuckDB
colab.research.google.com/drive/1KzofqAW…

start playing with #Apachedatafusion, the good thing it is mature enough that I could run the same test unmodified, that's a testament to SQL I guess, still memory issues, when I increase the data size colab.research.google.com/drive/1WJ2ICxJ…

mim_djo's tweet image. start playing with #Apachedatafusion, the good thing it is mature enough that I could run the same test unmodified, that's a testament to SQL I guess, still memory issues, when I increase the data size

colab.research.google.com/drive/1WJ2ICxJ…

#delta_rs which is basically the standard way for writing delta table using Python ( no spark) just exposed a SQL interface , it was a simple change as it is already based on #apachedatafusion tested using #Microsoftfabric notebook : github.com/djouallah/Fabr…

mim_djo's tweet image. #delta_rs which is basically the standard way for writing delta table using Python ( no spark)
just exposed  a SQL interface , it was a simple change as it is already based on #apachedatafusion
tested using #Microsoftfabric notebook : 
github.com/djouallah/Fabr…

Just merged our first SQL query using #ApacheDataFusion at work! 🎉 Integrating it was a fantastic experience, It will soon be part of the core of #Materia. Wrote about why DataFusion is a game-changer for #Rustlang #database systems: pierrezemb.fr/posts/thank-yo…


Exploring #ApacheDataFusion’s Catalog: maps tables across S3/Postgres/Iceberg & turns unresolved plans into typed logical plans. Still wrapping my head around it, but once the catalog clicks the optimizer is wide open to tweak. 🦀🚀 #RustLang #DataEngineering


let's try simple sorting of a parquet file but with bigger VM, 16 CPU/60 GB RAM #DuckDB 29.6s #ApacheDatafusion 1min 4s #Apachespark : need to configure java, not interested #Polars 1min 6s #Clickhouse 1min 24s #pyarrow 2m 3s github.com/djouallah/parq…


In this week's Developer Voices, Andrew Lamb takes us through #ApacheDataFusion, exploring how this #Rust toolkit shaves years off the prospect of creating a custom database. Fascinating stuff for any data and architecture fans like me. 😁 youtu.be/8QNNCr8WfDM

krisajenkins's tweet card. DataFusion - The Database Building Toolkit (with Andrew Lamb)

youtube.com

YouTube

DataFusion - The Database Building Toolkit (with Andrew Lamb)


you can pass #apachedatafusion dataframe directly to delta table python and it works great but ... behind the scene it calls collect() which load the whole damn data into memory, if you have a lot of data to process use daft or duckdb


Dive into the world of #TimeSeries data with InfluxData's @AndrewLamb1111 on The Data Stack Show. Discover insights on high cardinality, query performance, #ApacheDataFusion, and the future of data systems. 🎙️ bit.ly/3zhJCoL #InfluxDB

InfluxDB's tweet image. Dive into the world of #TimeSeries data with InfluxData's @AndrewLamb1111 on The Data Stack Show.

Discover insights on high cardinality, query performance, #ApacheDataFusion, and the future of data systems.

🎙️ bit.ly/3zhJCoL #InfluxDB

Startups are doubling down on #ApacheDataFusion for its powerful, scalable data processing capabilities! 🚀 With a flexible architecture and open-source community support, it's transforming data analytics. Discover why it's the future! #BigData #TechTrends #Startups


I guess that's a big win for #apachedatafusion as it is the engine used by SDF

The company I joined after VoDa, @SDFLabs, was acquired by @dbt_labs which has been a very exciting turn of events. So I’m now joining dbt Labs together with the SDF team. 🎉



🔥 The #ApacheDataFusion community is growing! 🔥 Chicago’s Apache DataFusion Meetup brought together passionate engineers, developers, and data experts to dive into the latest advancements and real-world applications. 🚀 Check out the key discussions and see how DataFusion…

influxdata.com

​Apache DataFusion Meetup: Chicago December 2024 Recap

A recap of the Apache DataFusion Chicago meetup in December, 2024.


Why Startups Are Betting Everything on #ApacheDataFusion. Discover why startups like Flarion, LakeSail, and major companies are betting on Apache DataFusion — the Rust-based query engine that's reshaping #data #analytics. bit.ly/4m5nQro


🔎 Learn about our experience implementing StringView in the Rust implementation of Apache Arrow, and integrating it into Apache DataFusion. #ApacheDataFusion #Rustlang #InfluxDB

Xiangpeng Hao strikes again: Turns out you can both 1. Implement 🇩🇪 strings in Rust 2. They actually improve end to end performance in @ApacheDataFusio. It takes a lot more than a straightforward naive implementation to do so: influxdata.com/blog/faster-qu…



A piece I wrote about rebuilding InfluxDB in #rustlang and #ApacheArrow and #ApacheDataFusion is up on InfoQ!

Engineering a Time Series Database Using Open Source: Rebuilding InfluxDB 3 in Apache Arrow and Rust bit.ly/46fmuUK authored by @pauldix, reviewed by @olimpiupop



Why Startups Are Betting Everything on #ApacheDataFusion. Discover why startups like Flarion, LakeSail, and major companies are betting on Apache DataFusion — the Rust-based query engine that's reshaping #data #analytics. bit.ly/4m5nQro


Startups are doubling down on #ApacheDataFusion for its powerful, scalable data processing capabilities! 🚀 With a flexible architecture and open-source community support, it's transforming data analytics. Discover why it's the future! #BigData #TechTrends #Startups


you can pass #apachedatafusion dataframe directly to delta table python and it works great but ... behind the scene it calls collect() which load the whole damn data into memory, if you have a lot of data to process use daft or duckdb


Just merged our first SQL query using #ApacheDataFusion at work! 🎉 Integrating it was a fantastic experience, It will soon be part of the core of #Materia. Wrote about why DataFusion is a game-changer for #Rustlang #database systems: pierrezemb.fr/posts/thank-yo…


Exploring #ApacheDataFusion’s Catalog: maps tables across S3/Postgres/Iceberg & turns unresolved plans into typed logical plans. Still wrapping my head around it, but once the catalog clicks the optimizer is wide open to tweak. 🦀🚀 #RustLang #DataEngineering


In this week's Developer Voices, Andrew Lamb takes us through #ApacheDataFusion, exploring how this #Rust toolkit shaves years off the prospect of creating a custom database. Fascinating stuff for any data and architecture fans like me. 😁 youtu.be/8QNNCr8WfDM

krisajenkins's tweet card. DataFusion - The Database Building Toolkit (with Andrew Lamb)

youtube.com

YouTube

DataFusion - The Database Building Toolkit (with Andrew Lamb)


🔥 The #ApacheDataFusion community is growing! 🔥 Chicago’s Apache DataFusion Meetup brought together passionate engineers, developers, and data experts to dive into the latest advancements and real-world applications. 🚀 Check out the key discussions and see how DataFusion…

influxdata.com

​Apache DataFusion Meetup: Chicago December 2024 Recap

A recap of the Apache DataFusion Chicago meetup in December, 2024.


I guess that's a big win for #apachedatafusion as it is the engine used by SDF

The company I joined after VoDa, @SDFLabs, was acquired by @dbt_labs which has been a very exciting turn of events. So I’m now joining dbt Labs together with the SDF team. 🎉



🗓️ January 15 🕗 5–7:30 PM ET 📍 Boston, MA Join InfluxData's @AndrewLamb1111 and fellow @ApacheDataFusion enthusiasts for an in-person meetup! Save your spot here: bit.ly/4fCFKPw #InfluxDB #ApacheDataFusion


Catch up on the #Database Building Blocks session with #InfluxDB Staff Engineer @andrewlamb1111! 🔎 Dive into @ApacheDataFusio's inner workings and discover the data-centric systems it powers: bit.ly/4ej6qnZ #ApacheDataFusion

InfluxDB's tweet card. Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query...

youtube.com

YouTube

Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query...


#delta_rs which is basically the standard way for writing delta table using Python ( no spark) just exposed a SQL interface , it was a simple change as it is already based on #apachedatafusion tested using #Microsoftfabric notebook : github.com/djouallah/Fabr…

mim_djo's tweet image. #delta_rs which is basically the standard way for writing delta table using Python ( no spark)
just exposed  a SQL interface , it was a simple change as it is already based on #apachedatafusion
tested using #Microsoftfabric notebook : 
github.com/djouallah/Fabr…

Watch InfluxData Staff Software Engineer @AndrewLamb1111 explain @ApacheDataFusio in detail, describing the types of data-centric systems it's used to build. 🔗bit.ly/4ej6qnZ #InfluxDB #ApacheDataFusion

InfluxDB's tweet card. Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query...

youtube.com

YouTube

Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query...


Dive into the world of #TimeSeries data with InfluxData's @AndrewLamb1111 on The Data Stack Show. Discover insights on high cardinality, query performance, #ApacheDataFusion, and the future of data systems. 🎙️ bit.ly/3zhJCoL #InfluxDB

InfluxDB's tweet image. Dive into the world of #TimeSeries data with InfluxData's @AndrewLamb1111 on The Data Stack Show.

Discover insights on high cardinality, query performance, #ApacheDataFusion, and the future of data systems.

🎙️ bit.ly/3zhJCoL #InfluxDB

Join us for an @ApacheDataFusio Meetup on January 15th in Boston! Meet Influxer @AndrewLamb1111 and other fellow DataFusion enthusiasts, project contributors, and database aficionados: bit.ly/4fCFKPw #InfluxDB #ApacheDataFusion


📅 Hey, Boston, save the date! On January 15th, #InfluxDB Staff Software Engineer @AndrewLamb1111 is co-hosting an @ApacheDataFusio Meetup! Come meet fellow #ApacheDataFusion enthusiasts, project contributors, and database aficionados at this event: bit.ly/4fCFKPw


Must-see: @AndrewLamb1111's Database Building Blocks Seminar talk explains @ApacheDataFusio in detail and describes the types of data-centric systems it's used to build. Watch here bit.ly/4ej6qnZ #InfluxDB #ApacheDataFusion

InfluxDB's tweet card. Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query...

youtube.com

YouTube

Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query...


Dive into the world of #TimeSeries data with InfluxData's @AndrewLamb1111 on The Data Stack Show. Discover insights on high cardinality, query performance, #ApacheDataFusion, and the future of data systems. 🎙️ bit.ly/3zhJCoL #InfluxDB

InfluxDB's tweet image. Dive into the world of #TimeSeries data with InfluxData's @AndrewLamb1111 on The Data Stack Show.

Discover insights on high cardinality, query performance, #ApacheDataFusion, and the future of data systems.

🎙️ bit.ly/3zhJCoL #InfluxDB

start playing with #Apachedatafusion, the good thing it is mature enough that I could run the same test unmodified, that's a testament to SQL I guess, still memory issues, when I increase the data size colab.research.google.com/drive/1WJ2ICxJ…

mim_djo's tweet image. start playing with #Apachedatafusion, the good thing it is mature enough that I could run the same test unmodified, that's a testament to SQL I guess, still memory issues, when I increase the data size

colab.research.google.com/drive/1WJ2ICxJ…

#apachedatafusion is progressing well, querying 42 million rows using the free tier of Colab is something, other SQL engines will crash, I think it will eventually catch up to #DuckDB colab.research.google.com/drive/1KzofqAW…

mim_djo's tweet image. #apachedatafusion is progressing well, querying 42 million rows using the free tier of Colab is something, other SQL engines will crash, I think it will eventually catch up to #DuckDB
colab.research.google.com/drive/1KzofqAW…

I am happy to announce 🤡 the winner of of TPCH-SF30 on the free tier Colab notebook is #Tableau Hyper Engine, #DuckDB and #Apachedatafusion could not finish it as they get OOM. this is what I call serieus engineering !!! colab.research.google.com/drive/1sqp_V34…

mim_djo's tweet image. I am happy to announce 🤡 the winner of of TPCH-SF30 on the free tier Colab notebook is #Tableau Hyper Engine, #DuckDB and #Apachedatafusion could not finish it as they get OOM.
this is what I call serieus engineering !!!
colab.research.google.com/drive/1sqp_V34…

#delta_rs which is basically the standard way for writing delta table using Python ( no spark) just exposed a SQL interface , it was a simple change as it is already based on #apachedatafusion tested using #Microsoftfabric notebook : github.com/djouallah/Fabr…

mim_djo's tweet image. #delta_rs which is basically the standard way for writing delta table using Python ( no spark)
just exposed  a SQL interface , it was a simple change as it is already based on #apachedatafusion
tested using #Microsoftfabric notebook : 
github.com/djouallah/Fabr…

Loading...

Something went wrong.


Something went wrong.


United States Trends